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SEQ ID NO. 4201: 2603 V/R STRAIN 

ATGGTAAAATTAGTATTCGCACGCCACGGTGAATCTGAGTGGAATAAAGCTAACCTTTTC 
ACTGG ATGGG CTGACGT AG ATCTTT CAGAAAAAGGTACACAACAAGCTATTG ATG CTGG G 
AAATTAATT CAAGCAGCAGGTATTGAGTT CGAC CTTGCTTTTACAT CAGTTCTT AAACGT 
GCCATCAAAACAACTAACCTTGCCCTTGAAGCAG CTGATCAACTTTGGGTACCAGTTGAA 
AAAT CATGGCGCTTGAACG AACGT CATTACGGTGGATTG ACAGGAAAAAATAAAG CAGAA 
GCAGCTGAACAATTTGGTGATGAGCAAGTTCATATTTGGCGTCGTTCATATGATGTATTG 
CCT CCAG ATATGGCTAAAGATG ATG AACATT CAG CACATACTGATCGT CG CT ATG CTTCA 
CTAG ATG ATT CTGTTATT C CAGATG CAGAAAACCT AAAAGTT ACTTTAG AG CGTGCT CTT 
C CTTT CTGGGAAGAT AAAATTG CT C CTG CT CTTAAAG ATG GT AAAAATGTGTTTGTTGGT 
GCACACGGT AACTCAAT CCGTG CT CTT GTAAAACATAT CAAACAATTGT CAG ATG ATGAA 
ATCATGGACGTTGAAATTCCTAA(2TTCCCACCACTTGTTTTCGAATTTGATGA 
AAC CTTGTTT CAGAATATT ACTTAG GTAAA 

SEQ ID NO. 4202: 090 STRAIN 

GTAAAATTAGTATTCG CACG CCACGGTGAATCTGAGTG 
GAATAAAGCTAACCTTTTCACTGGATGGGCTGACGTAGATCTTTCAGAAA 
AAGGT ACACAACAAG CT ATTGATG CTGGG AAATTAATTCAAG CAG CAGGT 
ATTGAGTT CGAC CTTGCTTTTACAT CAGTTCTTAAACGTGCCATCAAAAC 
AACTAACCTTGCCCTTGAAGCAGCTGATCAACTTTGGGTACCAGTTGAAA 
AAT CATGGCG CTTG AACGAACGTCATT ACGGTGG ATTGACAG GAAAAAAT 
AAAG CAG AAG CAGCTGAACAATTT GGTGATGAG CAAGTTCATATTTGG CG 
T CGTT CATAT GATGT ATTGCCT CCAG ATATGG CTAAAGATGATG AACATT 
CAG CACATACTGATCGTCG C TATG CTT CACTAGATGATT CTGTTATT CCA 
GATGCAGAAA^CCTAAAAGTTACTTTAGAGCGTGCTCTTC CTTT CTGGG A 
AGATAAAATTGCTCCTGCTCTTAAAGATGGTAAAAATGTGTTTGTTGGTG 
CACACGGTAACTCAATCCGTGCTCTTGTAAAACATATCAAACAATTGTCA 
GATGATGAAATCATGGACGTTGAAATTCCTAACTTCCCACCA.CTTGTTTT 
CG AATTTGATGAAAAATTAAAC CTTGTTT CAGAATATTACTTAGGTAAA 

SEQ ID NO. 4203: A909 STRAIN 

GTAAAATTAGTATTCGCACGCCACGGTGAATCTGAGTGG 

AATAAAGCTAACCTTTTCACTGGATGGG CTGACGT AG AT CTTTCAGAAAA 

AGGT AGA.CAACAAGCTATTG ATGCTGGGAAATTAATT CAAG CAG CAGGTA 

TTGAGTT CG ACCTTG CTTTT ACAT CAGTT CTTAAACGTG C CATCAAAACA 

ACTAACCTTG CC CTTGAAGCAGCTG AT CAACTTTGGGT ACCAGTTG AAAA 

ATCATGGCGCTTAA^CGAACGTCATTACGGTGGATTGACAGGAAAAAATA 

AAGCAGAAGCAGCTGAACAATTTGGTGATGAGCAAGTTCATATTTGGCGT 

CGTTCATATGATGTATTGCCTCCAGATATGGCTAAAGATGATGAACATTC 

AGCm^TACTGATCGTCGCTATGCTTC^CTAGATGATTCTGTTATTCCAG 

ATGCAGAAAAC CT AAAAGTT ACTTTAG AG CGTG CT CTTCCTTTCTGG GAA 

GATAAAATTGCTCCTGCTCTTAAAGATGGTAAAAATGTGTTTGTTGGTGC 

ACACGGTAACTCAATCCGTGCTCTTGTAAAACATATCAAACAATTGTCAG 

ATGATGAAATCATGGACGTTGAAATTCOTAACTTCCCACCACITGTTTTC 

GAATTTGATGAAAAATTAAACCTTGTTTCAGAATATTACTTAGGTAAA 

SEQ ID NO. 4204: H36B STRAIN 

GTAAAATTAGTATTCG CACGC CACG GTGAATCTGAG 

TGGAATAAAGCTAACCTTTTCACTGGATGGGCTGACGTAGATCTTTCAGA 
AAAAGGTACACAACAAGCTATTGATGCTGGGAAATTAATTCAAGCAGCAG 
GTATTGAGTTCGACCTTGCTTTTACATC^GTTCTTAAACGTGC(2ATCAAA 
ACAACTAACCTTGCCCTTGAAGCAGCTGATCAACTTTGGGTACCAGTTGA 
AAAATCATGGCGCTTGAACGAACGTCATTACGGTGGATTGACAGGAAAAA 
ATAAAGCAGAAG CAG CTGAACAATTTGGTG ATGAG CAAGTT CATATTTGG 
CGTCGTT CATATGATGT ATTG CCTC CAGATATGG CTAAAG ATGATGAACA 
TTC^GC^CATACTCATCGTCGCTATGCTTCACTAGATGATTCTGTTATTC 
CAGATGCAG AAAAC CTAAAAGTTACTTTAG AG CGTG CTCTT CCTTTCTGG 
GAAG ATAAAATTG CT CCTG CT CTT AAAGAT GGTAAAAATGTGTTTGTTG G 
TG CACACGGT AACTCAAT C CGTG CTCTTGTAAAACATAT CAAACAATTGT 
CAGATGATGAAATCATGGACGTTGAAATTCCTAACTTCCCACCACTTGTT 
TTCGAATTTGATGAAAAATTAAACCTTGTTTCAGAATATTACTTAGGTAA 
A 

SEQ ID NO. 4205: 18RS21 STRAIN 

GTAAAATTAGTATTCGCACGCCACGGTGAATCTGAGTGG 

AATAAAG CTAAC CTTTT CACTGGATGG GCT GACGTAG AT CTTT CAG AAAA 

AGGTACACAACAAGCTATTGATGCTGGGAAATTAATTCAAGCAGCAGGTA 

TTGAGTTCGACCTTGCTTTTACATC^GTTCTTAAACGTGCCATCAAAACA 

ACTAACCTTGCCCTTGAAGCAGCTGATCAACTTTGGGTACCAGTTGAAAA 

ATCATGGCGCTTGAACGAACGTCATTACGGTGGATTGACAGGAAAAAATA 

AAGC^GAAGCAGCTGAACAATTTGGTGATGAGCAAGTTCATATTTGGCGT 

CGTTCAT ATG ATGTATTG CCTC CAG AT ATG G CT AAAGATGATGAACATT C 

AGCACATACTGATCGTCGCTATGCTTCACTAGATGATTCTGTTATTCCAG 

ATG CAGAAAACCTAAAAGTTACTTTAGAG CGTGCT CTT CCTTTCTGGGAA 

GATAAAATTGCTCCTGCTCTTAAAGATGGTAAAAATGTGTTTGTTGGTGC 

ACACGGTAACTCAATCCGTGCTCTTGTAAAACATATCAAACAATTGTCAG 

ATGATGAAATCATGGACGTTGAAATTCCTAACTTCCCACCACTTGTTTTC 

G AATTTGATGAAAAATTAAAC CTTGTT TCAGAATATT ACTT AGGTAAA 

SEQ ID NO. 4206: M732 STRAIN 

GTAAAATTAGTATTCGCACGCCACGGTGAATCTGAGTGG 

AATAAAG CTAAC CTTTT CACTGG ATG G G CTG ACGT AG AT CTTTCAGAAAA 

AGGTACACAACAAGCTATTGATGCTGGGAAATTAATTCAAGCAGCAGGTA 
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TTGAGTT CGACCTTG CTTTTACAT CAGTT CTTAAACGTG C CAT CAAAACA 
ACT AAC CTTGC C CTTGAAG CAG CT GAT CAA CTTT GGGTAC CAGTT GAAAA 
ATCATGG CG CTTG AACGAACGT CATTACGGTGGATTG ACAGG AAAAAAT A 
AAG CAGAAG CAG CTG AACAATTTGGTGAT GAG CAAGTTCATATTTGG CGT 
CGTT CAT ATGATGTATTGCCTCCAGATATGG CTAAAG ATG ATGAACATT C 
AGCACATACTGATCGTCGCTATGCTTCACTAGATGATTCrGTTATTCCAG 
ATGCAGAAAACCTAAAAGTTACTTTAGAGCGTGCTCTTCCTTTCTGGGAA 
GATAAAATTGCTCCTGCTCTTAAAGATGGTAAAAATGTGTTTGTTGGTGC 
ACACGGTAACT CAAT CCGTG CT CTT GTAAAACAT AT CAAACAATTGT CAG 
ATGATGAAATCATGGACGTTGAAATTCCTAACTTCCCACCACTTGTTTTC 
GAATTTGATGAAAAATTAAACCTTGTTTCAGAATATTACTTAGGTAAA 

SEQ ID NO. 4207: COH1 STRAIN 

GTAAAATTAGTATTCGCACGCCACGG 

TGAATCTGAGTGGAATAAAGCTAACCTTTTCACTGGATC 

ATCTTTCAGAAAAAGGTACACAACAAG CT ATTGATGCTG GGAAATTAATT 

CAAG CAGCAGGT ATTGAGTT CG AC CTTG CTTTTACAT CAGTT CTTAAACG 

TGCCATCAAAACAACTAACCTTGCCCTTGAAGCAGCT^ 

TACCAGTTG AAAAAT CATGG CG CTTGAACG AACGT CATTACGGTGGATTG 

ACAGGAAAAAAT AAAG CAGAAG CAG CTGAACAATTTGGTG ATGAGCAAGT 

TCATATTTGGCGTCGTTCATATGATGTATTGCCTCCAGATATGGCTAAAG 

ATGATGAACATTCAGCACATACTGATCGTCGCTATGCTTCACTAGATGAT 

T CTGTTATTC CAGATG CAG AAAAC CTAAAAGTTACTTTAGAGCGTGCTCT 

TCCTTT CTGGGAAGATAAAATTG CT CCTG CT CTTAAAGATGGTAAAAATG 

TGTTTGTTGGTGCACACGGTAACTCAATCCGTGCTCTTGTAAAACATATC 

AAACAATTGT CAGATGATGAAAT CATGG ACGTTGAAATT C CTAACTT CC C 

ACCACTTGTTTTCXIAATTTGATGAAAAATTAAACCT^ 

ACTTAGGTAAA 

SEQ ID NO. 4208: CJB110 STRAIN 

GTAAAATTAGTATTCGCACGCCACGG 

TGAATCTGAGTGGAATAA^GCTAACCTTTTCACTGGATGGGCTGACGTAG 
AT CTTTCAG AAAAAGGT ACACAACAAG CT ATTGATGCTG GGAAATTAATT 
CAAGCAGCAGGTATTGAGTTCGACCTTGCTTTTACATCAGTTCTTAAACG 
TGCCATCAAAACAACTAACCTTGCCCTTGAAGCAGCTGATGAACTTTGGG 
TACCAGTTGAAAAATCATGGCGCTTGAACGA^CGTCATTACGGTGGATTG 
ACAGGAAAAAATAAAGCAGAAGCAG CTGAACAATTTGGT G ATGAG CAAGT 
T CATATTTG G CGTCGTT CAT ATGATGTATTGC CT C CAGAT ATGG CTAAAG 
ATGATGAAC^TTC^GCACaTACTGATCGTCGCTATGCrTCACTAGATGAT 
TCTGTTATT CCAGATG CAGAAAAC CTAAAAGTTACTTTAG AG CGTGCTCT 
TCCTTTCTGGGAAGATAAAATTGCTCCTGCTCTTAAAGATGGTAaAAATG 
TGTTTGTTGGTGCACACGGTAACTCAATCCGTGCTCTTGTAAAACATATC 
AAACAATTGTCAGAT GATGAAATCATGGACGTTG AAATT C CTAACTT CC C 
AC CACTTGTTTT CGAATTTG ATGAAAAATTAAAC CTTGTTTCAGAAT ATT 
ACTTAGGTAAA 

SEQ ID NO. 4209: 1169NT STRAIN 

AGTATTCGCACGCCACGGTGAATCTGAGTGGAATAAAGCTAACCTTTTCA 
CTGG ATGGG CTG ACGTAGAT CTTT CAGAAAAAGGT ACACAACAAG CTATT 
GATG CIGGG AAATT AATTCAAG CAG CAGGTATTGAGTTCG AC CTTG CTTT 
TACATCAGTTCTTAAACGTGCCATCAAAACAACTAACCTTGCCCTTGAAG 
CAG CTGATCAACTTTGGGTACCAGTTGAAAAAT CATG G CG CTTGAACGAA 
CGTCATTACG GTGG ATTG ACAGGAAAAAATAAAG CAG AAGCAGCTGAACA 
ATTTGGTGATGAGCAAGTTCATATTTGGCGTCGTTCATATGATGTATTGC 
CTC CAGATATGG CTAAAGATGATGAACATT CAGCACATACTGATCGTCG C 
TATGCTT(^CTAGATGATTCTGTTATTCCAGATGCAGAAAACCTAAAAGT 
TACTTTAGAG CGTGCT CTTC CTTT CTGGG AAGATAAAATTGCTCCTG CT C 
TTAAAGATGGTAAAAATGTGTTTGTTGGTGCACACGGTAACTCAATCCGT 
G CTCTTGTAAAACAT AT CAAACAATTGTCAGATGATG AAATCATGGACGT 
TGAAATTCCTA^CTTCCCACCACTTGTTTTCGAATTTaATGAAAAATT^ 
ACCTTGTTTCAGAATATTACTTAGGTAAA 

SEQ ID NO. 4210: M7 81 STRAIN 

GTAAAATTAGTATTCGCACGCCACGGT 

GAAT CTG AGTGGAAT AAAG CT AACCTTTT CAC TG G ATGG G CTGAC GT AG A 
TCTTTCAGAAAAAGGTACACAACAAG CTATTG ATG CTGGGAAAT T AATT C 
AAGCAG CAGGTATT G AGTT CGAC CTTG CTTTTACATCAGTT CTT AAACGT 
GCCATCAAAACAACTAACCTTGCCCTTGAAGCAGCTGATCAACTTTGGGT 
ACCAGTTGAAAAATCATGGCGCTTGAACGAACGTCATTACGGTGGATTGA 
CAGG AAAAAAT AAAG CAGAAGCAG CTG AACAATT TGGTG ATG AG CAAGTT 
CATATTTGG CGT CGTTCATATG ATGT ATTG CCTC CAG AT ATGG CTAAAG A 
TG ATGAACATT CAG CACAT ACT GAT CGTCG CT ATG CTT CACT AGATG ATT 
CTGTTATTCCAGATGCAGAAAACCTAAAAGTTACTTTAGAGCGTGCTCTT 
CCTTTCTGGGAAGATAAAATTGCTCCTGCTCTTAAAGATGGTAAAAATGT 
GTTTGTTGGTGCACACGGTAACTCAATCCGTGCTCTTGTAAAACATATCA 
AACAATTGT CAG ATG ATGAAAT CATGG ACGTTGAAATT CCTAACTTC CCA 
CCACTTGTTTTCGAATTTGATGAAAAATTAAACCTTGTTTCA 
CTTAGGTAAA 

SEQ ID NO. 4211: JH930013 STRAIN 

GTAAAATTAGTATTCGCACG CCACGGTGAATCT 

G AGTGGAAT AAAG C TAAC CTTTTCACTGG ATG GG CTG ACGTAGAT CTTT C 
AGAAAAAGGTACACAACAAGCTATTGATGCTGGGAAATTAATTCAAGCAG 
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CAGGT ATTG AGTT CG AC CTTG CTTTT ACAT CAGTT CTTAAACGTG CCATC 
AAAACAACTAACCTTGCCCTTGAAGCAGCTaA.TCMCTTTGGGTACCAGT 
TGAAAAATGATGGCGCTTGAACGAACX3TCATTACGGTGGATTGACAGGAA 
AAAATAAAG CAGAAG CAG CTGAACAATTTGGTGATGAG CAAGTT CATATT 
TGG CGT CGTT CATATGATGTATTG C CTC CAGATATGG CTAAAGATGATGA 
ACATT CAG CAC^TACTG AT CGTCX3 CT ATG CTTCACTAGATGATT CTGTTA 
TTCCAGATG CAG AAAAC CTAAAAGTTACTTTAGAG CGTG CTCTT CCTTT C 
TGGGAAG AT AAAATTGCT C CTG CT CITAAAGATGGTAAAAATGTGTTTGT 
TGGTGCACACGGTAACTCAATCCGTGCTCTTGTAAAACA.TATC^ 
TGTCAGATGATGAAATC7VTGGACGTTGAAATTCCTAACTTCCCACCACTT 
GTTTTCGAATTTGATGAAAAATTAAACCTTGTTTCAGAATATTACTTAGG 
TAAA 



PRETTY of: /biotmp/msa63264 .2{*} March 10, 2003 09:30 



msa63264 . 2 { 110_090 } 
msa63264 . 2 { 110_1169NT} 
msa63264 . 2 { 110_18RS21 } 
msa63264 .2{110_2603) 
msa63264 . 2 { 110_CJB110 } 
msa63264 . 2 { 110_COHl} 
msa63264 . 2 { 110_H36B} 
msa63264.2{!10_JM9130013} 
msa63264 . 2 { 110_M732 } 
msa63264.2{ll0_M78l} 
rasa63264.2{l!0_A909} 
Consensus 



gtaaaat 

gtaaaat 

atggtaaaat 
~ — gtaaaat 

gtaaaat 

gtaaaat 

gtaaaat 

gtaaaat 

gtaaaat 

gtaaaat 



t AGTATTCGC 
-AGTATTCGC 
t AGTATTCGC 
tAGTATTCGC 
t AGTATTCGC 
tAGTATTCGC 
tAGTATTCGC 
tAGTATTCGC 
tAGTATTCGC 
tAGTATTCGC 
tAGTATTCGC 
_********* 



ACGCCACGGT 
ACGCCACGGT 
ACGCCACGGT 
ACGCCACGGT 
ACGCCACGGT 
ACGCCACGGT 
ACGCCACGGT 
ACGCCACGGT 
ACGCCACGGT 
ACGCCACGGT 
ACGCCACGGT 
********** 



GAATCTGAGT 
GAATCTGAGT 
GAATCTGAGT 
GAATCTGAGT 
GAATCTGAGT 
GAATCTGAGT 
GAATCTGAGT 
GAATCTGAGT. 
GAATCTGAGT 
GAATCTGAGT 
GAATCTGAGT 
********** 



50 

GGAATAAAGC 
GGAATAAAGC 
GGAATAAAGC 
GGAATAAAGC 
GGAATAAAGC 
GGAATAAAGC 
GGAATAAAGC 
GGAATAAAGC 
GGAATAAAGC 
GGAATAAAGC 
GGAATAAAGC 
********** 



msaS32S4 
msa63264 
msa63264 

msa63264 
msa63264.2{ 

msa63264 . 

msa63264 . 
msa63264»2{ll0 

msa63264 . 

msa63264. 

msa63264 . 



^-».2{ll0_090) 
.2{llO_1169NT} 
.2{110_18RS21} 
2{110_2603} 
110_CJB110} 
2{ll0_COHl} 
2{110_H36B} 
_JM9130013) 
2{110_M732) 
2{110_M781} 
110_A909} 



2{ 



51 

TAACCTTTTC 
TAACCTTTTC 
TAACCTTTTC 
TAACCTTTTC 
TAACCTTTTC 
TAACCTTTTC 
TAACCTTTTC 
TAACCTTTTC 
TAACCTTTTC 
TAACCTTTTC 
TAACCTTTTC 



ACTGGATGGG 
ACTGGATGGG 
ACTGGATGGG 
ACTGGATGGG 
ACTGGATGGG 
ACTGGATGGG 
ACTGGATGGG 
ACTGGATGGG 
ACTGGATGGG 
ACTGGATGGG 
ACTGGATGGG 



Consensus 



********** ********** 



CTGACGTAGA 
CTGACGTAGA 
CTGACGTAGA 
CTGACGTAGA 
CTGACGTAGA 
CTGACGTAGA 
CTGACGTAGA 
CTGACGTAGA 
CTGACGTAGA 
CTGACGTAGA 
CTGACGTAGA 
********** 



TCTTTCAGAA 
TCTTTCAGAA 
TCTTTCAGAA 
TCTTTCAGAA 
TCTTTCAGAA 
TCTTTCAGAA 
TCTTTCAGAA 
TCTTTCAGAA 
TCTTTCAGAA 
TCTTTCAGAA 
TCTTTCAGAA 
********** 



100 

AAAGGTACAC 
AAAGGTACAC 
AAAGGTACAC 
AAAGGTACAC 
AAAGGTACAC 
AAAGGTACAC 
AAAGGTACAC 
AAAGGTACAC 
AAAGGTACAC 
AAAGGTACAC 
AAAGGTACAC 
********** 



msa63264 
msa63264 
msa63264 

msa63264 * 
msa63264.2{ 
msa63264 . 
msa63264 - 
msa63264.2{ll0 
msa63264 
msa63264 
msa63264 . 



^ .2{ll0_090} 
.2{110_1169NT} 
.2{110_JL8RS21) 
2{110_2603} 
110_CJB110} 
2{ll0_COHl} 
2{110_H36B} 
_JM9130013} 
2{110_M732} 
2{llO_M78lJ 
2{110_A909} 
Consensus 



101 

AACAAGCTAT 
AACAAGCTAT 
AACAAGCTAT 
AACAAGCTAT 
AACAAGCTAT 
AACAAGCTAT 
AACAAGCTAT 
AACAAGCTAT 
AACAAGCTAT 
AACAAGCTAT 
AACAAGCTAT 
********** 



TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
********** 



AAATTAATTC 
AAATTAATTC 
AAATTAATTC 
AAATTAATTC 
AAATTAATTC 
AAATTAATTC 
AAATTAATTC 
AAATTAATTC 
AAATTAATTC 
AAATTAATTC 
AAATTAATTC 
********** 



AAGCAGCAGG 
AAGCAGCAGG 
AAGCAGCAGG 
AAGCAGCAGG 
AAGCAGCAGG 
AAGCAGCAGG 
AAGCAGCAGG 
AAGCAGCAGG 
AAGCAGCAGG 
AAGCAGCAGG 
AAGCAGCAGG 
********** 



150 

TATTGAGTTC 
TATTGAGTTC 
TATTGAGTTC 
TATTGAGTTC 
TATTGAGTTC 
TATTGAGTTC 
TATTGAGTTC 
TATTGAGTTC 
TATTGAGTTC 
TATTGAGTTC 
TATTGAGTTC 
********** 



msa63264 
msa63264 
msaS3264 

msa63264 - 
msa63264 . 2 { 

msa63264 . 

msa63264 . 
Tnsa63264.2{ll0 

msa63264 . 

msa63264 . 

msa63264 . 



, ^.2{ll0_090} 
.2{110_1169NT} 
.2{llO_18RS2i; 
2{110_2603 1 
110_CJB110* 
2{ll0_COHi: 
2{110_H36B 
_JM9130013] 
2{110_M732} 
2{110_M781} 
2{110_A909} 
Consensus 



151 

GACCTTGCTT 
GACCTTGCTT 
GACCTTGCTT 
GACCTTGCTT 
GACCTTGCTT 
GACCTTGCTT 
GACCTTGCTT 
GACCTTGCTT 
GACCTTGCTT 
GACCTTGCTT 
GACCTTGCTT 
********** 



TTACAT CAGT 
TTACATCAGT 
TTACATCAGT 
TTACATCAGT 
TTACATCAGT 
TTACATCAGT 
TTACATCAGT 
TTACATCAGT 
TTACATCAGT 
TTACATCAGT 
TTACATCAGT 
********** 



TCTTAAACGT 
TCTTAAACGT 
TCTTAAACGT 
TCTTAAACGT 
TCTTAAACGT 
TCTTAAACGT 
TCTTAAACGT 
TCTTAAACGT 
TCTTAAACGT 
TCTTAAACGT 
TCTTAAACGT 
********** 



GCCATCAAAA 
GCCATCAAAA 
GCCATCAAAA 
GCCATCAAAA 
GCCATCAAAA 
GCCATCAAAA 
GCCATCAAAA 
GCCATCAAAA 
GCCATCAAAA 
GCCATCAAAA 
GCCATCAAAA 
********** 



200 

CAACTAACCT 
CAACTAACCT 
CAACTAACCT 
CAACTAACCT 
CAACTAACCT 
CAACTAACCT 
CAACTAACCT 
CAACTAACCT 
CAACTAACCT 
CAACTAACCT 
CAACTAACCT 
********** 



m9a63264.2{ll0_090) 
msa63264.2{llO_1169NT} 
msa63264 . 2 { 110_18RS21 } 
msa63264 . 2 {110_2603 ) 
msa63264.2{ll0_CJB110} 
rasa63264 . 2 { 110_COH1 } 
msa63264.2{ll0 H36B) 
msa63264 . 2 {llO_JM913 00 13 } 
msa63264.2{llO_M732) 
msa63264 . 2 { 110_M7 81 } 



201 

TGCCCTTGAA 
TGCCCTTGAA 
TGCCCTTGAA 
TGCCCTTGAA 
TGCCCTTGAA 
TGCCCTTGAA 
TGCCCTTGAA 
TGCCCTTGAA 
TGCCCTTGAA 
TGCCCTTGAA 



GCAGCTGATC 
GCAGCTGATC 
GCAGCTGATC 
GCAGCTGATC 
GCAGCTGATC 
GCAGCTGATC 
GCAGCTGATC 
GCAGCTGATC 
GCAGCTGATC 
GCAGCTGATC 



AACTTTGGGT 
AACTTTGGGT 
AACTTTGGGT 
AACTTTGGGT 
AACTTTGGGT 
AACTTTGGGT 
AACTTTGGGT 
AACTTTGGGT 
AACTTTGGGT 
AACTTTGGGT 



ACCAGTTGAA 
ACCAGTTGAA 
ACCAGTTGAA 
ACCAGTTGAA 
ACCAGTTGAA 
ACCAGTTGAA 
ACCAGTTGAA 
ACCAGTTGAA 
ACCAGTTGAA 
ACCAGTTGAA 



250 

AAATCATGGC 
AAATCATGGC 
AAATCATGGC 
AAATCATGGC 
AAATCATGGC 
AAATCATGGC 
AAATCATGGC 
AAATCATGGC 
AAATCATGGC 
AAATCATGGC 
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msa63264 .2{ll0_A909} TGCCCTTGAA GCAGCTGATC AACTTTGGGT ACCAGTTGAA AAATCATGGC 
Consensus ********** ********** ********** ********** ********** 



msa63264 
msa63264 .2 
msa63264 .2 

msa63264 . 
msa63264 .2{ 

msa63264. 

msa63264 . 
msa63264.2{ll0 

msa63264 . 

msa63264 . 

msa63264 



.2{ll0_09Ol 
110_1169NT} 
110_18RS2l} 
2{110_2603} 
110_CJB110} 
2{llO > _COHl} 
2{110_H3 6B} 
JM9130013} 
2{110_M732} 
2{110_M781} 
2{110_A909} 
Consensus 



251 

GCTTgAACGA 
GCTTgAACGA 
GCTTgAACGA 
GCTTgAACGA 
GCTTgAACGA 
GCTTgAACGA 
GCTTgAACGA 
GCTTgAACGA 
GCTTgAACGA 
GCTTgAACGA 
GCTTaAACGA 



ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 



GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 



CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 



300 

TAAAGCAGAA 
TAAAGCAGAA 
TAAAGCAGAA 
TAAAGCAGAA 
TAAAGCAGAA 
TAAAGCAGAA 
TAAAGCAGAA 
TAAAGCAGAA 
TAAAGCAGAA 
TAAAGCAGAA 
TAAAGCAGAA 



****_***** ********** ********** ********** ********** 



msa63264 
msaS3264 »2{ 
msa63264.2{ 

msa63264 . 
msa63264.2{ 

msa63264 . 

msa63264. 
msa63264.2{H0. 

msa63264 . 

msa63264 . 

msa63264 . 



.2{ll0_090} 
110_1169NT) 
110_18RS2l} 
2{110_2603} 
110_CJB110} 
2{110__COH1) 
2{110_H3 6B} 
_JM9130013} 
2{110_M732} 
2{110_M781> 
2{llO_A909} 
Consensus 



301 

GCAGCTGAAC 
GCAGCTGAAC 
GCAGCTGAAC 
GCAGCTGAAC 
GCAGCTGAAC 
GCAGCTGAAC 
GCAGCTGAAC 
GCAGCTGAAC 
GCAGCTGAAC 
GCAGCTGAAC 
GCAGCTGAAC 
********** 



AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 
********** 



TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
********** 



CATATTTGGC 
CATATTTGGC 
CATATTTGGC 
CATATTTGGC 
CATATTTGGC 
CATATTTGGC 
CATATTTGGC 
CATATTTGGC 
CATATTTGGC 
CATATTTGGC 
CATATTTGGC 
********** 



350 

GTCGTTCATA 
GTCGTTCATA 
GTCGTTCATA 
GTCGTTCATA 
GTCGTTCATA 
GTCGTTCATA 
GTCGTTCATA 
GTCGTTCATA 
GTCGTTCATA 
GTCGTTCATA 
GTCGTTCATA 
********** 



351 400 

msa63264.2{H0_090} TGATGTATTG CCTCCAGATA TGGCTAAAGA TGATGAACAT TCAGCACATA 

msa63264.2{ll0_1169NT} TGATGTATTG CCTCCAGATA TGGCTAAAGA TGATGAACAT TCAGCACATA 

msa63264 . 2 { 110_18RS2l} TGATGTATTG CCTCCAGATA TGGCTAAAGA TGATGAACAT TCAGCACATA 

msaS3264 . 2 { 110_2603 } TGATGTATTG CCTCCAGATA TGGCTAAAGA TGATGAACAT TCAGCACATA 

msa63264.2{ll0_CJB110} TGATGTATTG CCTCCAGATA TGGCTAAAGA TGATGAACAT TCAGCACATA 

msa63264.2{ll0 COHl} TGATGTATTG CCTCCAGATA TGGCTAAAGA TGATGAACAT TCAGCACATA 

msa63264 .2 {110~H36B} TGATGTATTG CCTCCAGATA TGGCTAAAGA TGATGAACAT TCAGCACATA 

msa63264.2{ll0_JM9130013} TGATGTATTG CCTCCAGATA TGGCTAAAGA TGATGAACAT TCAGCACATA 

msa63264 .2 {110_M732 } TGATGTATTG CCTCCAGATA TGGCTAAAGA TGATGAACAT TCAGCACATA 

msa63264 .2 {llO_M78l} TGATGTATTG CCTCCAGATA TGGCTAAAGA TGATGAACAT TCAGCACATA 

msa63264 .2{110_A909} TGATGTATTG CCTCCAGATA TGGCTAAAGA TGATGAACAT TCAGCACATA 

Consensus ********** ********** ********** ********** ********** 

401 450 

msa63264.2{ll0 090) CTGATCGTCG CTATGCTTCA CTAGATGATT CTGTTATTCC AGATGCAGAA 

msa63264.2{ll0_1169NT} CTGATCGTCG CTATGCTTCA CTAGATGATT CTGTTATTCC AGATGCAGAA 

msa63264 .2{110_18RS21} CTGATCGTCG CTATGCTTCA CTAGATGATT CTGTTATTCC AGATGCAGAA 

msa63264.2{H0_2603} CTGATCGTCG CTATGCTTCA CTAGATGATT CTGTTATTCC AGATGCAGAA 

msa63264.2{ll0_CJB110} CTGATCGTCG CTATGCTTCA CTAGATGATT CTGTTATTCC AGATGCAGAA 

msa63264.2{ll0_COHl} CTGATCGTCG CTATGCTTCA CTAGATGATT CTGTTATTCC AGATGCAGAA 

msa63264 .2{110_H36B} CTGATCGTCG CTATGCTTCA CTAGATGATT CTGTTATTCC AGATGCAGAA 

msa"63264.2{H0_JM9130013} CTGATCGTCG CTATGCTTCA CTAGATGATT CTGTTATTCC AGATGCAGAA 

msa632S4 .2{110_M732) CTGATCGTCG CTATGCTTCA CTAGATGATT CTGTTATTCC AGATGCAGAA 

Ttisa63264 .2{llO_M781} CTGATCGTCG CTATGCTTCA CTAGATGATT CTGTTATTCC AGATGCAGAA 

msa63264.2{H0_A909} CTGATCGTCG CTATGCTTCA CTAGATGATT CTGTTATTCC AGATGCAGAA 

Consensus ********** ********** ********** ********** ********** 



msa63264 
msa63264 .2{ 
msa63264.2{ 

msa63264 
msa63264.2{ 
msaG3264 . 
msa63264 . 
msa63264.2{ll0 
msa63264 
msa63264 
msa63264 . 



,2{ll0__090j 
110_1169NT} 
110_18RS2lJ 
2{110_2603} 
110_CJB110} 
2{110_COH1) 
2{110_H36B} 
_JM9130013} 
2(110_M732} 
2(110_M78lJ 
2{110_A909} 
Consensus 



451 

AACCTAAAAG 
AACCTAAAAG 
AACCTAAAAG 
AACCTAAAAG 
AACCTAAAAG 
AACCTAAAAG 
AACCTAAAAG 
AACCTAAAAG 
AACCTAAAAG 
AACCTAAAAG 
AACCTAAAAG 
********** 



TTACTTTAGA 
TTACTTTAGA 
TTACTTTAGA 
TTACTTTAGA 
TTACTTTAGA 
TTACTTTAGA 
TTACTTTAGA 
TTACTTTAGA 
TTACTTTAGA 
TTACTTTAGA 
TTACTTTAGA 
********** 



GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
********** 



CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
********** 



500 

AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
********** 



501 550 

insa63264 ,2{110_090} TGCTCCTGCT CTTAAAGATG GTAAAAATGT GTTTGTTGGT GCACACGGTA 

msa63264 . 2 { 110_1169NT} TGCTCCTGCT CTTAAAGATG GTAAAAATGT GTTTGTTGGT GCACACGGTA 

msa63264 .2{110_18RS21} TGCTCCTGCT CTTAAAGATG GTAAAAATGT GTTTGTTGGT GCACACGGTA 

msa63264 . 2 {110_2603 } TGCTCCTGCT CTTAAAGATG GTAAAAATGT GTTTGTTGGT GCACACGGTA 

msa63264.2{llO_CJB110} TGCTCCTGCT CTTAAAGATG GTAAAAATGT GTTTGTTGGT GCACACGGTA 

msa63264 ,2{110_COH1} TGCTCCTGCT CTTAAAGATG GTAAAAATGT GTTTGTTGGT GCACACGGTA 

msa63264 .2{110_H36B} TGCTCCTGCT CTTAAAGATG GTAAAAATGT GTTTGTTGGT GCACACGGTA 

msa63264.2{H0_JM9130013l TGCTCCTGCT CTTAAAGATG GTAAAAATGT GTTTGTTGGT GCACACGGTA 

msa63264.2{ll0_M732) TGCTCCTGCT CTTAAAGATG GTAAAAATGT GTTTGTTGGT GCACACGGTA 
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msa63264.2{ll0_M78l} TGCTCCTGCT CTTAAAGATG GTAAAAATGT GTTTGTTGGT GCACACGGTA 
msa63264 .2 (110_A909) TGCTCCTGCT CTTAAAGATG GTAAAAATGT GTTTGTTGGT GCACACGGTA 
Consensus ********** ********** ********** ********** ********** 



msa63264 . 2 { 110__090 } 
msa63264 . 2 { 110_1169NT} 
msa63264 . 2 { 110__18RS21 } 
msa63264 . 2 {ll0_2603 } 
msa63264 . 2 { 110_CJB110 } 
msa63264 . 2 { 110_COH1 } 
msa63264.2{ll0_H3 6B} 
msa63264 . 2 { 110_JM91300 13 } 
msa63264.2{ll0_M732} 
msa63264.2{ll0_M78l} 
msa63264.2{ll0_A909} 
Consensus 



551 

ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
********** 



TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
********** 



AAACATATCA 
AAACATATCA 
AAACATATCA 
AAACATATCA 
AAACATATCA 
AAACATATCA 
AAACATATCA 
AAACATATCA 
AAACATATCA 
AAACATATCA 
AAACATATCA 
********** 



AACAATTGTC 
AACAATTGTC 
AACAATTGTC 
AACAATTGTC 
AACAATTGTC 
AACAATTGTC 
AACAATTGTC 
AACAATTGTC 
AACAATTGTC 
AACAATTGTC 
AACAATTGTC 
********** 



600 

AGATGATGAA 
AGATGATGAA 
AGATGATGAA 
AGATGATGAA 
AGATGATGAA 
AGATGATGAA 
AGATGATGAA 
AGATGATGAA 
AGATGATGAA 
AGATGATGAA 
AGATGATGAA 
********** 



msa63264 
msa63264 . 2{ 
msa63264.2{ 

msa63264 . 
msa63264 .2{ 

msa63264 

msa63264 . 
msa63264.2{ll0, 

msa63264 . 

msa63264. 

msa63264 . 



.2{110_090) 
110_1169NT} 
110_18RS2l} 
2{110_2603* 
110_CJB110l 
2{ll0_C0Hl) 
2{110_H36B] 
_JM9130013j 
2fll0_M732] 
2{110_M781] 
2{110_A909} 
Consensus 



601 

ATCATGGACG 
ATCATGGACG 
ATCATGGACG 
ATCATGGACG 
ATCATGGACG 
ATCATGGACG 
ATCATGGACG 
ATCATGGACG 
ATCATGGACG 
ATCATGGACG 
ATCATGGACG 
********** 



TTGAAATTCC 
TTGAAATTCC 
TTGAAATTCC 
TTGAAATTCC 
TTGAAATTCC 
TTGAAATTCC 
TTGAAATTCC 
TTGAAATTCC 
TTGAAATTCC 
TTGAAATTCC 
TTGAAATTCC 
********** 



TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
********** 



CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
********** 



650 

TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 
********** 



msa63264 .2{110_090} 
msa63264 . 2 { 110_1169NT} 
msa63264 . 2 { 110_18RS21 } 
msa63264.2{ll0_2603} 
nisa63264.2{ll0_CJB110} 
msa63264.2fll0_COHl} 
msa63264 . 2 {110_H36B} 
msa63264 . 2 { 110_JM9130013 } 
msa63264 . 2 { 110_M732 } 
msa63264 . 2 {llO_M78l} 
msa63264.2{H0_A909} 
Consensus 



651 

TGAAAAATTA AACCTTGTTT 
TGAAAAATTA AACCTTGTTT 
TGAAAAATTA AACCTTGTTT 
TGAAAAATTA AACCTTGTTT 
TGAAAAATTA AACCTTGTTT 
TGAAAAATTA AACCTTGTTT 
TGAAAAATTA AACCTTGTTT 
TGAAAAATTA AACCTTGTTT 
TGAAAAATTA AACCTTGTTT 
TGAAAAATTA AACCTTGTTT 
TGAAAAATTA AACCTTGTTT 
********** ********** 



690 

CAGAATATTA CTTAGGTAAA 
CAGAATATTA CTTAGGTAAA 
CAGAATATTA CTTAGGTAAA 
CAGAATATTA CTTAGGTAAA 
CAGAATATTA CTTAGGTAAA 
CAGAATATTA CTTAGGTAAA 
CAGAATATTA CTTAGGTAAA 
CAGAATATTA CTTAGGTAAA 
CAGAATATTA CTTAGGTAAA 
CAGAATATTA CTTAGGTAAA 
CAGAATATTA CTTAGGTAAA 
********** ********** 



SEQ ID NO. 4212: 2603 V/R STRAIN 

VIOjVPARHGESEWNKANLFTGWADVDIjSEKGTO^AIDAGKLIQAAGIEFDIiAFTSVLKRA 
IKTTNIJVLEAADQLWVPVEKSWRI^ERHYGGLTGKNKAEAAEQFGDEQVHIWRRSYDVLP 
PDMAKDDEHSAHTDRRYASLDDSVIPDAENLKVTLERALPFWEDKIAPALKDGKNVFVGA 
HGNS I RALVKHI KQLSDDE IMDVEI PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4213: 090 STRAIN 

VKLVFARHGESE WNKAltfLFTG WAD VDLSE KGTQQAI DAGKL I QAAGI EFDLAFTS VL KRA 
IKTTNLALEAADQLWVPVEKSWRLNERHYGGLTGKNKAEAAEQFGDEQVHIWRR 
PDMAKDDEHSAHTDRRYASLDDSVIPDAENLKVTLERALPFWEDKIAPALKDGKNVFVGA 
HGNS I RALVKHI KQLSDDE I MDVEI PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4214: A909 STRAIN 

VKLVFARHGE SEWNKANLFTGWAD VDLS E KGTQQAI DAGKL I QAAGI E FDLAFTS VLKRA 
IKTTNLALEAADQLWVPVEKSWRLNERHYGGLTGKNKAEAAEQFGDEQVHIWRRSY^ 
PDMAKDDEHSAHTDRRYASLDDSVIPDAENLKVTLERALPFWEDKIAPALKDGKNVFVGA 
HGNS I RALVKH I KQLSDDE I MD VE I PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4215: H36B STRAIN 

VKLVFARHGESEWNKANLFTG WAD VDLS E KGTQQA I DAGKL I QAAG I EFDLAFTS VLKRA 
IKTTNLALEAADQLWVPVEKSWRLNERHYGGLTGKNKAEAAEQFGDEQVHIWRRSYDVLP 
PDMAKDDEH S AHTDRR Y AS LDD S V I PDAENLKVTLERALPFWEDKI APALKDGKNVFVGA 
HGNS I RALVKH I KQLSDDE I MDVEI PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4216: 18RS21 STRAIN 

VKLVFARHGESEWNKANLFTGWADVDLSEKGTQQAIDAGKLIQAAGIEFDLAFTSVLKRA 
IKTTNLALEAADQLWVPVEKSWRLNERHYGGLTGK^ 

PDMAKDDEHSAHTDRRYASLDDSVI PDAENLKVTLERALPFWEDKI APALKDGKNVFVGA 
HGNS I RALVKHI KQLSDDE I MDVE I PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4217: M732 STRAIN 

VKLVFARHGESE WNKANLFTGWAD VDLS E KGTQQA I DAGKL I QAAG I EFDLAFTS VLKRA 
I KTTNLALEAADQLWVP VE KS WRLNERHYGGLTGKNKAEAAEQFGDEQVH I WRRS YD VL P 
PDMAKDDEHSAHTDRRYASLDDSVI PDAENLKVTLERALPFWEDKI APALKDGKNVFVGA 
HGNS I RALVKHI KQLSDDE IMDVE I PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4218: COHl STRAIN 
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VKLVFARHGESEWNKANLFTGWADVDLSEKGTQQAIDAGKLI QAAGI EFDLAFTSVLKRA 
IKTTNIJILEAADQLWVPVEKSWRLN^ 

PDMAKDDEHSAHTDRRYASLDDSVIPDAENLKVTLERALPFWEDKIAPAL KDGKNVFVGA 
HGNS I RALVKHI KQLSDDE I MDVE I PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4219: CJB110 STRAIN 

VKLVFARHGESE WNKANLFTG WAD VDLSEKGTQQAI DAGKL I QAAG I EFDLAFTSVLKRA 
IKTTNIJUiEAADQLWVPVEKSWRLNERHYGGLTGKNKAEAAEQFGDEQVHIW 
PDMAKDDEHSAHTDRRYASLDDSVIPDAENIiKVTLERALPFWEDKIAPALKDGKNVFVGA 
HGNS I RALVKHI KQLSDDE I MDVE I PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4220: 1169NT STRAIN 

VFARHGESEWNKANLFTGWADVDLSEKGTQQAIDAGKLIQAAGIEFDLAFTSV^ 
TNLALEAADQLWPVEKSWRLNERHYGGLTGKN 

AKDDEHSAHTDRRYASLDDSVIPDAENLKVTLERALPFWEDKIAPALKDGKNVFVGAHGN 
SI RALVKHI KQLSDDE IMDVE I PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4221: M781 STRAIN 

VKLVFARHGE SEWNKANLFTGWADVDLSEKGTQQAIDAGKL I QAAG I EFDLAFTSVLKRA 
IKTTNLALEAADQLWVPVEKSWRLNERHYGGLTGKNKAEAAEQFGDEQVH 
PDMAKDDEHSAHTDRRYASLDDSVIPDAENLKVTLERALPFWEDKIAPALKDGKNVFVGA 
HGNS I RALVKHI KQLSDDE IMDVE I PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4222: JM9130013 STRAIN 

VKLVFARHGE SEWNKANLFTGWADVDLSEKGTQQAIDAGKL I QAAGI EFDLAFTSVLKRA 
I KTTNLALEAADQLWVP VEKS WRI^ERHYGGLTGKNKAEAAEQFGDEQVH I WRRS YDVLP 
PDMAKDDEHSAHTDRRYASLDDSVIPDAENLKVTLERALPFWEDKIAPALKDGKNVFVGA 
HGNS I RALVKHI KQLSDDE IMDVE I PNFPPLVFEFDEKLNLVSEYYLGK 

PRETTY of : /biotmp/msa70722 . 2 { * } March 10, 2003 09:33 



msa70722 ,2{110_090} vklVFARHGE SEWNKANLFT GWADVDLSEK 

msa70722.2{ll0_18RS2l} vklVFARHGE SEWNKANLFT GWADVDLSEK 

msa7072 2. 2 {110_2603} vklVFARHGE SEWNKANLFT GWADVDLSEK 

msa70722 .2{ll0_A909} vklVFARHGE SEWNKANLFT GWADVDLSEK 

msa70722 . 2 { 110_CJB110 } vklVFARHGE SEWNKANLFT GWADVDLSEK 

msa70722.2{llO_COHl} vklVFARHGE SEWNKANLFT GWADVDLSEK 

tnsa70722.2{ll0_H36B} vklVFARHGE SEWNKANLFT GWADVDLSEK 

msa70722.2{H0_OM9130013} vklVFARHGE SEWNKANLFT GWADVDLSEK 

tnsa70722 .2f 110_M732} vklVFARHGE SEWNKANLFT GWADVDLSEK 

msa70722.2{110_M78l} vklVFARHGE SEWNKANLFT GWADVDLSEK 

msa70722 . 2 { 110_1169NT] VFARHGE SEWNKANLFT GWADVDLSEK 

Consensus ******* ********** ********** 



GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 



50 

LI QAAGI EFD 
LIQAAGIEFD 
LIQAAGIEFD 
LIQAAGIEFD 
LIQAAGIEFD 
LIQAAGIEFD 
LIQAAGIEFD 
LIQAAGIEFD 
LIQAAGIEFD 
LIQAAGIEFD 
LIQAAGIEFD 



********** ********** 



51 

msa70722 . 2{ 110_090} LAFTSVLKRA IKTTNLALEA 

msa70722 .2{llO_18RS2l} LAFTSVLKRA IKTTNLALEA 

tnsa70722.2{llO_2S03} LAFTSVLKRA IKTTNLALEA 

tnsa70722 .2{110_A909) LAFTSVLKRA IKTTNLALEA 

msa70722.2{ll0_CJB110) LAFTSVLKRA IKTTNLALEA 

tnsa70722.2{ll0_COHl} LAFTSVLKRA IKTTNLALEA 

msa70722.2{H0_H36B> LAFTSVLKRA IKTTNLALEA 

msa70722 . 2 { 110JM9130013 ) LAFTSVLKRA IKTTNLALEA 

msa70722 . 2{llO_M732) LAFTSVLKRA IKTTNLALEA 

rasa70722.2{H0_M78l} LAFTSVLKRA IKTTNLALEA 

msa70722.2{ll0_ll(p9NT} LAFTSVLKRA IKTTNLALEA 

Consensus ********** ********** 



ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 



SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 



********** ********** 



100 

GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
********** 



msa70722.2{ll0_090} 
msa70722.2{HO_JL8RS2l} 
msa70722.2{H0_2603} 
msa70722.2 {110_A909} 
msa70722 . 2 { 110_CJB110 ] 
msa70722 . 2 { 1 10_COH1 ] 
msa70722 .2{110_H36B) 
msa70722.2{H0_JM9130013j 
rasa70722 .2lll0_M732] 
msa70722.2{H0_M78lj 
msa70722 . 2 { 110_1169NT] 
Consensus 



101 

AEQFGDEQVH 
AEQFGDEQVH 
AEQFGDEQVH 
AEQFGDEQVH 
AEQFGDEQVH 
AEQFGDEQVH 
AEQFGDEQVH 
AEQFGDEQVH 
AEQFGDEQVH 
AEQFGDEQVH 
AEQFGDEQVH 
********** 



I WRRS YDVLP 
IWRRSYDVLP 
IWRRSYDVLP 
IWRRSYDVLP 
IWRRSYDVLP 
IWRRSYDVLP 
IWRRSYDVLP 
IWRRSYDVLP 
IWRRSYDVLP 
IWRRSYDVLP 
IWRRSYDVLP 
********** 



PDMAKDDEHS 
PDMAKDDEHS 
PDMAKDDEHS 
PDMAKDDEHS 
PDMAKDDEHS 
PDMAKDDEHS 
PDMAKDDEHS 
PDMAKDDEHS 
PDMAKDDEHS 
PDMAKDDEHS 
PDMAKDDEHS 



AHTDRRYASL 
AHTDRRYASL 
AHTDRRYASL 
AHTDRRYASL 
AHTDRRYASL 
AHTDRRYASL 
AHTDRRYASL 
AHTDRRYASL 
AHTDRRYASL 
AHTDRRYASL 
AHTDRRYASL 



150 

DDSVIPDAEN 
DDSVIPDAEN 
DDSVIPDAEN 
DDSVIPDAEN 
DDSVIPDAEN 
DDSVIPDAEN 
DDSVIPDAEN 
DDSVIPDAEN 
DDSVIPDAEN 
DDSVIPDAEN 
DDSVIPDAEN 



********** ********** ********** 



151 200 

msa70722 .2{110_090} LKVTLERALP FWEDKIAPAL KDGKNVFVGA HGNSIRALVK HI KQLSDDE I 

msa70722 -2{110_18RS21} LKVTLERALP FWEDKIAPAL KDGKNVFVGA HGNSIRALVK HI KQLSDDE I 

m sa70722.2{ll0_2603} LKVTLERALP FWEDKIAPAL KDGKNVFVGA HGNSIRALVK HI KQLSDDE I 

msa70722 .2{110_A909} LKVTLERALP FWEDKIAPAL KDGKNVFVGA HGNSIRALVK HI KQLSDDE I 

msa70722 .2{ll0_CJB110} LKVTLERALP FWEDKIAPAL KDGKNVFVGA HGNSIRALVK HI KQLSDDE I 

msa70722.2{llO_COHl} LKVTLERALP FWEDKIAPAL KDGKNVFVGA HGNSIRALVK HI KQLSDDE I 

msa70722.2{HO_H36B} LKVTLERALP FWEDKIAPAL KDGKNVFVGA HGNSIRALVK HI KQLSDDE I 

msa70722.2{ll0_JM9130013} LKVTLERALP FWEDKIAPAL KDGKNVFVGA HGNSIRALVK HI KQLSDDE I 

msa70722.2{110_M732} LKVTLERALP FWEDKIAPAL KDGKNVFVGA HGNSIRALVK HI KQLSDDE I 
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msa70722 . 2 { 110_M781 ) 
msa70722.2{ll0_1169NT} 
Consensus 



msa70722 . 2 { 110_0 90 } 
msa70722 . 2 { 110_18RS21 } 
msa70722.2{ll0_2603} 
msa70722.2{H0_A909 
msa70722 . 2 { 110_CJB110 } 
ms a7 0 7 2 2 . 2 { 1 1 0_COH1 } 
msa70722.2{llO_H36B} 
msa70722 . 2 {110_JM9130013 } 
msa70722.2{H0_M732} 
msa70722.2{llO__M78l} 
msa70722.2{H0_1169NT} 
Consensus 



LKVTLERALP FWEDKIAPAL KDGKNVFVGA HGNSIRALVK HIKQLSDDEI 
LKVTLERALP FWEDKIAPAL KDGKNVFVGA HGNSIRALVK HIKQLSDDEI 
********** ********** ********** ********** ********** 



201 

MDVEIPNFPP 
MDVEIPNFPP 
MOVE I PNFPP 
MDVEIPNFPP 
MDVEIPNFPP 
MDVEIPNFPP 
MDVEIPNFPP 
MDVEIPNFPP 
MDVETPNFPP 
MDVEIPNFPP 
MDVEIPNFPP 
********** 



LVFEFDEKLN 
LVFEFDEKLN 
LVFEFDEKLN 
LVFEFDEKLN 
LVFEFDEKLN 
LVFEFDEKLN 
LVFEFDEKLN 
LVFEFDEKLN 
LVFEFDEKLN 
LVFEFDEKLN 
LVFEFDEKLN 
********** 



229 

LVSEYYLGK 
LVSEYYLGK 
LVSEYYLGK 
LVSEYYLGK 
LVSEYYLGK 
LVSEYYLGK 
LVSEYYLGK 
LVSEYYLGK 
LVSEYYLGK 
LVSEYYLGK 
LVSEYYLGK 
********* 
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SEQ ID NO. 4301: 2603 V/R STRAIN 

ATGAATCTTTTAATTATGGGTTTGCCTG 

GTTGAAGAATTTGGTGTTGCrrCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCT 
AAT CAAAC CG AAATGGGACGTTT AG CTAAAAGTTAT ATTGATAAAG GTGAATT GGTTC CT 
GATGAAGTAACAAACGGGATTGTAAAAGAGCGCrTAGCTGAGGATGATATCGCAGAAAAA 
GGTTTTTTACTTGATGGATATCCACGTACTATTGA^ 

CTTGAAGAACT AGGACTACG CTT AG ATGGTGTTATTAATATTAAAGTGGAT CCAT CATGT 
CTTATAGAGCGTTTGAGTGkTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAA 
GTGTT CAACC CAC CAGTAGATT ATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAG 
C CTGAAACTGT CAAACGT CG CTTGGACGTT AAT ATTGCT CAAGGAGAACCT ATTCTTG AA 
CACTAT CGTAAG CTT GGT CTTGT TACAGATATTGAAGGTAAT CAAGAAATAACAGAAGTT 
TTTGCAGATGTTGAAAAAGCGTTGCTAGAACTCAAA 

SEQ ID NO. 4302s 090 STRAIN (reverse complement) 

AATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCA 

AGCAGCTAAGATCGTTGAAGAATTTGGTGTTGCTCAC^TCTCAACAGGGGATATGTTCCG' 
CG C CG CAATGG CTAAT CAAAC CGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGG 
TGAATTGGTT C CTGATGAAGT AA.CAAACGGGATTGT AAAAGAG CG CTTAG CTGAGGATGA 
TAT CG CAGAAAAAGGTTTTTTACTTGATGGATAT C CACGTACTATTGAACAAG CACACG C 
CTTAGATG CTACG CTTGAAGAACTAGGACTACG CTT AG ATGGTGT T ATTAATATTAAAGT 
GGATCCATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACT 
AACITTCCACAAAGTGTT CAAC CCAC CAGTAGATTATAAAGAAGAAGATTACTAT CAACG 
TGAAGATGATAAGCCTGAAACTGTCAAACGTCGCT 

AATAACAGAAGTTTTTGCAGATGTTGAAAAAGCGTTG 

SEQ ID NO. 4303: 3.169NT STRAIN (REVERSE COMPLEMENT) 

TGGTAAAGGGACTCAAGCAGCTAAGATTGTTGAAGAATTTGGTG 

AGGGGATATGTTC CG CGCCGCAATGGCTAAT CAAACCGAAATGGGACGTTTAG CTAAAAG 
TTATATTGATAAAGGTGAATTGGTTCCTGATCAAGTAACAAAC 

CTTAG CTGAGGATGATATCX3 CAG AAAAAGGTTTTTT ACTTGATGGGTATCCACGTAOT 

TGAACAAG CAC ACG C CTT AGATG CTACX3 CTTGAAGAACTAGGACTACG CTTAG ATGGTGT 

TATTAATATTAAAGTGGATCCATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATC^A 

T CGTAAAACTGGTGAAACTTTCCACAAAGTGTT CAACCCACCAGTAGATTATAAAGAAGA 

AGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAACGTCG 

TATTGCTCAAGGAGAACCTATTCTTGAAC^CTATAGTAAGCT^ 

TGAAGGTAATCAAGAAATAA 

SEQ ID NO. 4304: 18RS21 STRAIN (REVERSE COMPLEMENT) 

AAT CTTTT AAC CACGGGTTCG CCTGGTG CTGGTAAAGGTACTCAAG CAGCTAAGATCG 
TTGAAGAATTTGGTGTTGCTCACATCTCAACAGGGGATAT^ C CG CGCCG CAATGG CTA 
ATCAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAA^ 

ATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAG CTGAGGATGATATCG CAGAAAAAG 
GTTTTTTACTTGATGGATATCCACGTACTATTGAACAAGCACA CTTAGATG CTACG C 
TTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTC 
TTATAGAG CGTTTGAGTGGT CGTATTAT C^TCGT AAAACTGGTGAAACTTTCCACAAAG 
TGTTC^^CCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGC 
CTGAAACTGTCAAACGTCG CTTGGACGTTAATATTG CT CAAGGAGAAC CTATT CTTGAAC 
ACTATCGTAAGCIT'GGTCTTGTTACAGATATTGAAGGTAATCAAGAA^ 
TTGCAGATGTTGAAAAAGCGTTG 

SEQ ID NO. 43 05: A909 STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAATTATGGGTTTG C CTGGTG CTGGTAAAGGTACT CAAG CAG 
CTAAGATCGTTGJy^G A^TTTGGTGTTGCTCACATCT CAACAGGGGATATGTT C CG CG C CG 
CAATGG CTAAT CAAAC CGAAATGGGACGTTTAG CTAAAAGTTATATTGATAAAGGTGAAT 
TGGTTC CT GATGAAGT AACAAACGGGATTGTAAAAGAG CG CTTAG CTGAGGAT GATATCG 
CAGAAAAAGGTTTTTTACTTGATGGATAT C CACGTACTATTGAACAAGCACACGC CTTAG 
ATG CTACG CTTGAAGAACTAGGACT ACG CTT AGATGGTGTTATTAATATTAAAGTGGAT C 
CATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTT 
TCCACAAAGTGTT CAAC C CACCAGTAGATT ATAAA.GAAGAAGATTACTATCAACGTGAAG 
ATG ATAAG CCTGAAACTGTCAAACGT CG CTTGGACGTT AATATTG CTCAAGGAGAATCTA 
TTCITGAACACTATCGAAAGCTTGGTCTTGTTAC^ 

SEQ ID NO. 4306: CJB110 STRAIN (REVERSE COMPLEMENT) 

AAT CTTTTAACCACGGGTTTG CTTGGTGCTGGTAAAGGTACT CAGCTAA 

G ATCGTTGAAG AATTTGGTGTTGCT CACAT CTCAACAGGGGATATGTTCCG CG CCG CAAT 

GGCTAATCAAACCGAAATC<^ACGTTTAGCTAAAAGTTATATTaATAAAGGTGAATTGGT 

TCCTGATGAAGT AACAAACGGGATTGTAAAAGAG CG CTTAG CTGAGGATGATATCGCAGA 

AAAAGGTTTTTTACTTGATGGATAT C CACGTACTATTGAACAAGCACACG C CTTAGATGC 

TACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGG 

ATGTCTTATAGAGCGTTTGAGTGGT CGTATTAT CAAT CGTAAAACTGGTG AAACTTT C CA 

CAAAGTGTT CAAC C CACCAGTAG ATTAT AAAGAAG AAGATTACTATCAACGTGAAGATGA 

TAAGC CTGAAACTGT CAAACGTCGCTTGGAC GTT AATATTGCT CAAGGAGAACCTATT CT 

TGAACACTATAG 

SEQ ID NO. 4307: COH1 STRAIN (REVERSE COMPLEMENT ) 

AT CTTTTAATTATGGGTTTG CCTGGTG CTGGTAAAGGTACT CAAG CAG CTAAGATTGTTG 
AAG AATTTGGTGTTGCTCACATCTCAACAGGGG ATATGTT C CG CGC CG CAATGG CTAATC 
AAAC CCAAATGGGACGTTTAG CTAAAAGTTATATTGATAAAGGTGAATTGGTT CCTGATG 
AAGTAACAAACGGGATTGTAAAAGAG CG CTTAGCTGAGGATGATAT CGCAGAAAAAGGTT 
TTTTACTTGATGGATATCCACGTACTATTGAGCA^ 

AAG AACTAGGACTACG CTTAG ATGGTGTTATTAAT ATT AAAGTGGATC CAACATG CCT 
TAGAGCGTTTGAGTGGCCGTATTATCAATCGTAAAACTGGTGAAACITTCCa 
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T CAAC C CAC CAGTAG ATT AT AAAGAAGAAGATT ACTAT CAACGTG AAGATGATAAG CCTG 
AAACT GTCAAACGTCG CTTGG ACGTT AATATTG CT CAAGGAGAAC CTATT CTTGAACACT 

CAGATGTTGAAAAAGCGTTG 

SEQ ID HO. 4308: H3 6B STRAIN (REVERSE COMPLEMENT) 

CAGGGGATATGTTCCGCGCCGCAATGGCTAATCAAACCGAAATGGGACGTTTAGCTAAAA 
GTTATATTGATAAAGGTGAATTGGTTCCTGATGAAGTAACAAACGGGATTGTAAAAGAGC 
GCTTAG CTGAGGATG ATAT CG CAGAAAAAGGTTTTTTACTTGATGG ATAT C CACGT ACTA 
TTGAACAAG CACACG C GITAGAT GCTACGCTTG AAG AACTAGGACTACG CTTAGATGGTG 
TTATTAAT ATTAAAGTGGAT C CATCATGTCTTATAGAG CGTTTGAGTG GT C GT ATTAT CA 
AT CGTAAAAC TGGTGAAACTTTC CACAAAGT GT TCAAC CCAC CAGTAG ATTATAAAGAAG 
AAGATT ACTAT CAACGTGAAGATGATAAG C CTGAAACTGT CAAACGT CGCTTGGACGTTA 
ATATTG CT CAAGGAGAAT CTATT CTTGAACACTAT CGTAAG CTTGGT CTTGTTACAGATA 
TTGAAGGTAATC^GAAATAACAGAAGTTTTTG CAGATGTTGAAAAAGCGTTG 

SEQ ID NO. 4309: OM9130013 STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAATTATGGGTTTGCCTGGTG CTGGTAA^GGT 

ACT CAAGCAGCTAAGATCGTTGAAGAATTTGGTGTTGCT CACAT CT CAACAGGGG ATATG 
TTCCGCGC CG CAATGG CTAAT CAAAC CGAAATGGGACGTTTAG CTAAAAGTTATATTGAT 
AAAGGTGAATTGGTTCCTGATGAAGTAACAAACGGGATTGTAAA^ 

G ATGATAT CG CAGAAAAAGGTTTTTT ACTTGAT GGATAT C CACGTACTATT GAACAAG CA 
CACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTT^ 

AAAGTGGATCCATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAATCX5TAAAACT 
GGTGAAACTTT C CACAAAGTGTT CAACC CAC CAGTAGATTATAAAGAAGAAGATT ACTAT 
CAACGTGAAGATGATAAGCCTGAAACTGTTAAACGTCGCTTGGAC^ 

GGAGAACCTATT Cl'l'G AACACTATAAAAAG CTTGGTCTTGTTACAGATATTGAAGGTAAT 
CA 

SEQ ID NO. 4310: M732 STRAIN (REVERSE COMPLEMENT) 

CTTTTAATTATGGGTTTG CCTGGTG CTGGTAAAGGTACTCAAG CAGCTAAGATTGTTGAA 
GAATTTGGTGTTGCTCACATCTCAACAGCiGGAT^ 

AC C CAAATGGGACGTTTAGCT AAAAGTTATATTGATAAAGGTGAATTGGTT CCT 
GTAACAAACGGGATTGTAAA^GAGCG CTTAG CTGAGGATGATATCG CAGAAAAAGGTTTT 
TTACTTGATGGATAT CCACGTACTATTGAG CAAGCACACGCCTTAGATGCTACG CTTGAA 
GAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTG 
GAGCGTTTGAGTGGCCGTATTATCAATCGTAAA^CTGGTC 

AACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCrrc 
ACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAA 
CGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCA^ 
G ATGTTGAAAAAG CGTTG 

SEQ ID NO. 4311: M781 STRAIN (REVERSE COMPLEMENT) 

AAT CITTTAATTACGGGTTTG CCIGGTGCTO 

G CAGCTAAGATTGTTG AAGAATTTGGTGTTG CT CACAT CTCAACAGGGGATATGTT CCG C 
GCCGCAATGGCTAATCAAACCCAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGT 
GAATTGGTTCCTGATGAAGTAAGAAACGGGATTGTAAAAG 

AT CGCAGAAAAAGGTTTTTT ACTTGATGGATATCCACX5TACT CAAGCACACG CC 

TTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATC 

GAT CCAACATG CCTTATAGAG CGTTTGAGTGGCCGT ATTAT C^VATCGTAAAACTGGTGAA 
ACTTT C CACAAAGTGTTCAACC CACCA.GTAG ATTAT AAAGAAGAAGATTACTATCAA 
GAAGATGATAAGCCTGAAACTGTCAAACGTCGCTTGGACG 

MSA Alignment Results: Pretty output 

PRETTY of: /biotmp/msa25038 . 2 { * } April 17, 2002 08:53 
PRETTY of : /biotmp/msa252229 .2{*} January 31, 2003 03:05 .. 

1 50 

msa252229.2{ll4_COHl} atcttt taattatggg tttgcctggt gctggtaaag gtactcaagc 

msa252229.2{114_M732} -cttt taattatggg tttgcctggt gctggtaaag gtactcaagc 

msa252229.2{ll4_M78l} Aatcttt taattacggg tttgcctggt gctggtaaag gtactcaagc 

msa252229 .2 {114_A909} Aatcttt taattatggg tttgcctggt gctggtaaag gtactcaagc 

msa252229.2{ll4_JM9130013} Aatcttt taattatggg tttgcctggt gctggtaaag gtactcaagc 

msa252229.2{ll4_CJB110 J Aatcttt taaccacggg tttgcttggt gctggtaaag gtactcaagc 

msa252229 .2{ll4_090> Aatcttt taattatggg tttgcctggt gctggtaaag gtactcaagc 

msa252229 . 2 { 114_2603 } atgAatcttt taattatggg tttgcctggt gctggtaaag gtactcaagc 

msa252229.2{114_H36B} 

msa252229.2f 114_18RS2l} Aatcttt taaccacggg ttcgcctggt gctggtaaag gtactcaagc 

msa252229.2{114_1169NT} — tggtaaag ggactcaagc 

Consensus **** 



msa252229 . 2 \ 114_COHl} 
msa252229 . 2 (114_M732 } 
msa252229.2{H4_M78l} 
msa252229 . 2 { 114_A909) 
msa252229.2{H4_JM9130013} 
msa252229 . 2 { 114_CJB110 } 
msa252229.2{H4_090} 
msa252229 . 2 { 114_2603 } 
msa252229 . 2 {114_H36B} 
msa252229 . 2 f 114_18RS2l} 
msa252229 - 2 { 114_1169NT} 



51 

agctaagatt 
agctaagatt 
agctaagatt 
agctaagatc 
age t a agate 
agctaagatc 
agctaagatc 
agctaagatc 



gttgaagaat 
gttgaagaat 
gttgaagaat 
gttgaagaat 
gttgaagaat 
gttgaagaat 
gttgaagaat 
gttgaagaat 



ttggtgttgc 
ttggtgttgc 
ttggtgttgc 
ttggtgttgc 
ttggtgttgc 
ttggtgttgc 
ttggtgttgc 
ttggtgttgc 



tcacatctca 
tcacatctca 
tcacatctca 
tcacatctca 
tcacatctca 
tcacatctca 
tcacatctca 
tcacatctca 



agctaagatc gttgaagaat 
agctaagatt gttgaagaat 



ttggtgttgc 
ttggtgttgc 



tcacatctca 
gcacatctca 



100 

aCAGGGGATA 
aCAGGGGATA 
aCAGGGGATA 
aCAGGGGATA 
aCAGGGGATA 
aCAGGGGATA 
aCAGGGGATA 
aCAGGGGATA 
-CAGGGGATA 
aCAGGGGATA 
aCAGGGGATA 
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Consensus 



_********* 



msa252229. 

msa252229. 

msa252229 . 

msa252229. 
msa252229.2{ll4 
msa252229.2{ 
msa252229 

msa252229 

msa252229. 
msa252229 
msa252229 



J.aj 

J.2{ 



2{114_C0H1} 
2(114__M732) 
2{114_M781} 
2{ll4_A909} 
_JM9130013} 
114_CJB110} 
.2{114_090} 
2{114_2603} 
2{114_H36B} 
114_18RS2l} 
114_1169NT} 
Consensus 



101 

TGTTCCGCGC 
TGTTCCGCGC 
TGTTCCGCGC 
TGTTCCGCGC 
TGTTCCGCGC 
TGTTCCGCGC 
TGTTCCGCGC 
TGTTCCGCGC 
TGTTCCGCGC 
TGTTCCGCGC 
TGTTCCGCGC 
********** 



CGCAATGGCT 
CGCAATGGCT 
CGCAATGGCT 
CGCAATGGCT 
CGCAATGGCT 
CGCAATGGCT 
CGCAATGGCT 
CGCAATGGCT 
CGCAATGGCT 
CGCAATGGCT 
CGCAATGGCT 
********** 



AATCAAACCc 
AATCAAACCc 
AATCAAACCc 
AATCAAACCg 
AATCAAACCg 
AATCAAACCg 
AATCAAACCg 
AATCAAACCg 
AATCAAACCg 
AATCAAACCg 
AATCAAACCg 
*********_ 



AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
********** 



150 

TTT AG CT AAA 
TTT AG CT AAA 
TTTAGCTAAA 
TTTAGCTAAA 
TTTAGCTAAA 
TTTAGCTAAA 
TTTAGCTAAA 
TTTAGCTAAA 
TTTAGCTAAA 
TTTAGCTAAA 
TTTAGCTAAA 
********** 



msa252229. 

msa252229. 

msa252229. 

msa252229. 
msa252229.2{H4 
msa252229 .2{ 
msa252229 

msa252 22 9 

msa252229- 
msa252229.2{ 
msa252229.2{ 



2{114_C0H1} 
2{114_M732} 
2lll4_M78l} 
2{114_A909} 
JM9130013} 
114_CJB110' 
,2{114_090 
2{114__2603' 
2{ll4_H36B] 
114_18RS2lJ 
114_1169NT) 
Consensus 



151 

AGTTATATTG 
AGTTATATTG 
AGTTATATTG 
AGTTATATTG 
AGTTATATTG 
AGTTATATTG 
AGTTATATTG 
AGTTATATTG 
AGTTATATTG 
AGTTATATTG 
AGTTATATTG 



ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 



********** ********** 



ATTGGTTCCT 
ATTGGTTCCT 
ATTGGTTCCT 
ATTGGTTCCT 
ATTGGTTCCT 
ATTGGTTCCT 
ATTGGTTCCT 
ATTGGTTCCT 
ATTGGTTCCT 
ATTGGTTCCT 
ATTGGTTCCT 
********** 



GATgAAGTAA 
GATgAAGTAA 
GATgAAGTAA 
GATgAAGTAA 
GATgAAGTAA 
GATgAAGTAA 
GATgAAGTAA 
GATgAAGTAA 
GATgAAGTAA 
GATgAAGTAA 
GATcAAGTAA 
***_****** 



200 

CAAACGGGAT 
CAAACGGGAT 
CAAACGGGAT 
CAAACGGGAT 
CAAACGGGAT 
CAAACGGGAT 
CAAACGGGAT 
CAAACGGGAT 
CAAACGGGAT 
CAAACGGGAT 
CAAACGGGAT 
********** 



201 250 

msa252229.2{H4_COHl} TGTAAAAGAG CGCTTAGCTG AGGATGATAT CGCAGAAAAA GGTTTTTTAC 

msa252229.2{114_M732} TGTAAAAGAG CGCTTAGCTG AGGATGATAT CGCAGAAAAA GGTTTTTTAC 

msa252229.2{ll4_M78l} TGTAAAAGAG CGCTTAGCTG AGGATGATAT CGCAGAAAAA GGTTTTTTAC 

msa252229.2{H4_A909} TGTAAAAGAG CGCTTAGCTG AGGATGATAT CGCAGAAAAA GGTTTTTTAC 

msa252229-2{H4_JM9130013} TGTAAAAGAG CGCTTAGCTG AGGATGATAT CGCAGAAAAA GGTTTTTTAC 

msa252229.2{ll4_CJB110} TGTAAAAGAG CGCTTAGCTG AGGATGATAT CGCAGAAAAA GGT TTT TTAC 

msa252229 .2{114_090} TGTAAAAGAG CGCTTAGCTG AGGATGATAT CGCAGAAAAA GGTTTTTTAC 

msa252229 .2 f 114_2603 } TGTAAAAGAG CGCTTAGCTG AGGATGATAT CGCAGAAAAA GGTTTTTTAC 

msa252229.2{ll4_H36B) TGTAAAAGAG CGCTTAGCTG AGGATGATAT CGCAGAAAAA GGTTTTTTAC 

rasa252229.2{H4_18RS21} TGTAAAAGAG CGCTTAGCTG AGGATGATAT CGCAGAAAAA GGTTTTTTAC 

rasa252229.2{H4_1169NT} TGTAAAAGAG CGCTTAGCTG AGGATGATAT CGCAGAAAAA GGTTTTTTAC 

Consensus ********** ********** ********** ********** ********** 

251 300 

msa252229.2lll4_COHl} TTGATGGaTA TCCACGTACT ATTGAgCAAG CACACGCCTT AGATGCTACG 

msa252229 . 2 { 114_M732 } TTGATGGaTA TCCACGTACT ATTGAgCAAG CACACGCCTT AGATGCTACG 

msa252229 .2{ll4_M78l} TTGATGGaTA TCCACGTACT ATTGAgCAAG CACACGCCTT AGATGCTACG 

msa252229.2{H4_A909} TTGATGGaTA TCCACGTACT ATTGAaCAAG CACACGCCTT AGATGCTACG 

msa252229.2{ll4_JM9130013} TTGATGGaTA TCCACGTACT ATTGAaCAAG CACACGCCTT AGATGCTACG 

rasa25222 9.2{ll4_CJB110j TTGATGGaTA TCCACGTACT ATTGAaCAAG CACACGCCTT AGATGCTACG 

msa2 52229 . 2{114_090} TTGATGGaTA TCCACGTACT ATTGAaCAAG CACACGCCTT AGATGCTACG 

msa252229 .2(114_2603} TTGATGGaTA TCCACGTACT ATTGAaCAAG CACACGCCTT AGATGCTACG 

msa252229.2{l 14_H3 6B } TTGATGGaTA TCCACGTACT ATTGAaCAAG CACACGCCTT AGATGCTACG 

msa252229 . 2 { 114_18RS2l} TTGATGGaTA TCCACGTACT ATTGAaCAAG CACACGCCTT AGATGCTACG 

msa252229.2{ll4_1169NT} TTGATGGgTA TCCACGTACT ATTGAaCAAG CACACGCCTT AGATGCTACG 

Consensus *******-** ********** *****-**** ********** ********** 

301 350 

msa252229 .2f 114_C0Hl} CTTGAAGAAC TAGGACTACG CTTAGATGGT GTTATTAATA TTAAAGTGGA 

msa252229 .2{114_M732} CTTGAAGAAC TAGGACTACG CTTAGATGGT GTTATTAATA TTAAAGTGGA 

msa252229.2(ll4_M78l} CTTGAAGAAC TAGGACTACG CTTAGATGGT GTTATTAATA TTAAAGTGGA 

msa252229.2(114_A909} CTTGAAGAAC TAGGACTACG CTTAGATGGT GTTATTAATA TTAAAGTGGA 

msa252229.2{ll4_JM9130013} CTTGAAGAAC TAGGACTACG CTTAGATGGT GTTATTAATA TTAAAGTGGA 

msa252229.2{ll4_CJB110} CTTGAAGAAC TAGGACTACG CTTAGATGGT GTTATTAATA TTAAAGTGGA 

msa252229.2{H4_090} CTTGAAGAAC TAGGACTACG CTTAGATGGT GTTATTAATA TTAAAGTGGA 

msa252229 .2{ll4_2603} CTTGAAGAAC TAGGACTACG CTTAGATGGT GTTATTAATA TTAAAGTGGA 

msa252229.2{H4_H36B} CTTGAAGAAC TAGGACTACG CTTAGATGGT GTTATTAATA TTAAAGTGGA 

msa252229.2{l!4_18RS2l} CTTGAAGAAC TAGGACTACG CTTAGATGGT GTTATTAATA TTAAAGTGGA 

msa252229.2{H4_1169NT} CTTGAAGAAC TAGGACTACG CTTAGATGGT GTTATTAATA TTAAAGTGGA 

Consensus ********** ********** ********** ********** ********** 



351 400 

msa25222 9.2{H4_COHl> TCCAaCATGc CTTATAGAGC GTTTGAGTGg CCGTATTATC AATCGTAAAA 

msa25222 9.2{H4_M732} TCCAaCATGc CTTATAGAGC GTTTGAGTGg CCGTATTATC AATCGTAAAA 

msa252229-2{ll4_M78l} TCCAaCATGc CTTATAGAGC GTTTGAGTGg CCGTATTATC AATCGTAAAA 

msa252229.2{H4_A909} TCCAtCATGt CTTATAGAGC GTTTGAGTGg tCGTATTATC AATCGTAAAA 

msa252229.2{114_JM9130013> TCCAtCATGt CTTATAGAGC GTTTGAGTGg tCGTATTATC AATCGTAAAA 

msa252229.2{ll4_CJB110} TCCAtCATGt CTTATAGAGC GTTTGAGTGg tCGTATTATC AATCGTAAAA 

msa252229 . 2{ 114_090) TCCAtCATGt CTTATAGAGC GTTTGAGTGg tCGTATTATC AATCGTAAAA 

msa252229.2{H4_2603} TCCAtCATGt CTTATAGAGC GTTTGAGTGk tCGTATTATC AATCGTAAAA 

msa252229.2{H4_H3 6B> TCCAtCATGt CTTATAGAGC GTTTGAGTGg tCGTATTATC AATCGTAAAA 

msa252229 .2 {l!4_18RS2l} TCCAtCATGt CTTATAGAGC GTTTGAGTGg tCGTATTATC AATCGTAAAA 
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msa252229.2{ll4_1169NT} TCCAtCATGt CTTATAGAGC GTTTGAGTGg t CGTATTATC AATCGTAAAA 
Consensus ****-****_ ********** *********_ _********* ********** 



msa252229 . 

rasa252229. 

msa252229. 

msa252229. 
nisa252229.2{ll4 
msa252229.2{' 
msa252229 

msa252229 

msa252229 
msa252229 
msa252229 



2{ll4_COHl) 
2{114_M732} 
2{114_M781} 
2{ll4_A909} 
_JM9130013) 
114_CJB110) 
,2{ll4_090} 
2{114_2603} 
^.2{ll4_H36B} 
.2{ll4_18RS2l} 
.2{114_1169NT} 
Consensus 



401 

CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
********** 



TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
********** 



GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
********** 



CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
********** 



450 

TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
********** 



msa252229. 
msa252229. 
msa252229 
msa252229 
msa252229.2{ll4 
msa252229.2{ 
msa252229 
msa252229 
msa252229 
msa252229.2( 
msa252229.2{ 



2{ll4_COHl} 
2(ll4_M732* 
2(114_M78lj 
2{114_A909] 
JM9130013 
114_CJB110' 
2 { 114^090 ; 
2{114_2603; 
2{ll4_H36Bj 
114_18RS2l) 
114JL169NT} 
Consensus 



451 

GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
********** 



ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
********** 



AGATGATAAG 
AGATGATAAG 
AGATGATAAG 
AGATGATAAG 
AGATGATAAG 
AGATGATAAG 
AGATGATAAG 
AGATGATAAG 
AGATGATAAG 
AGATGATAAG 
AGATGATAAG 
********** 



CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
********** 



500 

TcAAACGTCG 
TcAAACGTCG 
TcAAACGTCG 
TcAAACGTCG 
TtAAACGTCG 
TcAAACGTCG 
TcAAACGTCG 
TcAAACGTCG 
TcAAACGTCG 
TcAAACGTCG 
TCAAACGTCG 
*_******** 



msa252229. 

msa252229 . 

msa252229. 

msa252229 
msa252229.2{ll4. 
msa252229.2{ 
msa252229 

msa252229 . 

msa252229. 
msa25222 9.2{ 
msa25222 9.2{ 



2{ll4_COHl} 
2{114_M732} 
2{114_M781} 
2 {114_A909} 
_JM913 0013> 
114_CJB110} 
.2{ll4_090> 
2{114_2603) 
2{114_H36B} 
114__18RS2l} 
114_1169NT} 
Consensus 



501 

CTTGGACGTT 
CTTGGACGTT 
CTTGGACGTT 
CTTGGACGTT 
CTTGGACGTT 
CTTGGACGTT 
CTTGGACGTT 
CTTGGACGTT 
CTTGGACGTT 
CTTGGACGTT 
CTTGGACGTT 
********** 



aATATTGCTC 
aATATTGCTC 
aATATTGCTC 
aATATTGCTC 
aATATTGCTC 
aATATTGCTC 
aATATTGCTC 
aATATTGCTC 
aATATTGCTC 
aATATTGCTC 
CATATTGCTC 



_********* **_ 



AAggagaacc 
AAggagaacc 

AA — 

AAggagaatc 
AAggagaacc 
AAggagaacc 
AAggagaacc 
AAggagaacc 
AAggagaatc 
AAggagaacc 
AAggagaacc 



550 

tattcttgaa cactatcgta 
tattcttgaa cactatcgta 



tattcttgaa 
tattcttgaa 
tattcttgaa 
tattcttgaa 
tattcttgaa 
tattcttgaa 
tattcttgaa 
tattcttgaa 



cactatcgaa 
cactataaaa 
cactatag — 
cactatcgta 
cactatcgta 
cactatcgta 
cactatcgta 
cactatagta 



msa25222 9 . 2 f 114_COHl } 
msa252229.2{114_M732} 
msa252229.2fll4_M78l} 
msa252229.2{114_A909} 
msa252229.2{H4_JM9130013} 
msa252229.2{H4_CJB110} 
msa252229.2{H4_090} 
msa252229 . 2 {114__2603 } 
msa252229 . 2 { 114_H3 6B} 
msa252229 . 2 { 114_18RS21 } 
msa252229 . 2 { 114_1169NT} 
Consensus 



551 600 
agcttggtct tgttacagat attgaaggta atcaagaaat aacagaagtt 
agcttggtct tgttacagat attgaaggta atcaagaaat aacagaagtt 



agcttggtct tgttacagat attgaaggta a — ~~ 
agcttggtct tgttacagat attgaaggta atca- 



agcttggtct tgttacagat attgaaggta atcaagaaat aacagaagtt 

agcttggtct tgttacagat attgaaggta atcaagaaat aacagaagtt 

agcttggtct tgttacagat attgaaggta atcaagaaat aacagaagtt 

agcttggtct tgttacagat attgaaggta atcaagaaat aacagaagtt 

agcttggcct tgttacagat attgaaggta atcaagaaat aa 



msa252229 . 2 { 114_C0H1 } 
msa252229 . 2 { 114_M732 } 
msa252 22 9 . 2 { 114_M7 8 1 } 
msa252 2 2 9 . 2 { 114_A9 0 9 } 
msa252229.2{H4_JM9130013} 
msa252229.2{H4_CJB110) 
msa2 5 22 2 9 . 2 { 114__0 9 0 } 
msa25222 9 . 2 ( 114_2603 } 
msa25222 9 . 2 { 114_H36B} 
msa252229 . 2 f 114_18RS2X } 
msa252229 . 2 { 114 JL169NT} 
Consensus 



601 

tttgcagatg ttgaaaaagc gttg- 
tttgcagatg ttgaaaaagc gttg- 



636 



tttgcagatg ttgaaaaagc gttg 

tttgcagatg ttgaaaaagc gttgctagaa ctcaaa 

tttgcagatg ttgaaaaagc gttg — 

tttgcagatg ttgaaaaagc gttg 



_****** ****** 



SEQ ID NO. 4312: 2603 V/R STRAIN 

MNLLI MGLPGAGKGTQAAKI VEE FGVAH I STGDMFRAAMANCTEMGRLAKS Y IDKGELiVP 
DEVTNGIVKER1AEDDIAEKGFLLIX?YPRTIEQAHAIJDATLEE]^LRLDGVINIKVDPSC 
LIERLSXRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLiDVNI AQGEPI LE 
HYRKLGLiVTD I EGNQE ITEVFADVEKALLELK 



SEQ ID NO. 4313: 090 STRAIN 

NLLI MGLPGAGKGTQAAKI VEEFGVAHISTGDMFRAAMANQTEMGRIAKSY I DKGELVPD 
EVTNGIVKERIJ^DDIAEKGFLLBGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 
IERLSGRI INRKTGETFT1KVFNPPVDYKEEDYYQREDDKPETVKRRLDWIAC<3EPILEH 
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YRKLGLVTD I EGNQE I TE VFADVEKALLEL K 
SEQ ID NO. 4314: 116 9NT STRAIN 

GKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGELVPDQVTNGIVKER 
LAEDD I AEKGPLLDGYPRTI EQAHALDATLEELGLRLDGVI NI KVDPSCL I ERLSGRI IN 
RKTGETPTIKVFNPPVDYKEEDYYQREDDKPETVKRRLDVHIAO^EPILEHYSKLGXiVTDI 
EGNQE I 

SEQ ID NO. 4315: 18RS21 STRAIN 

NLLTTGSPGAGKGTQAAKI VEE FGVAH I STGDM FRAAMANQTEMGRLAKS Y I DKGELVPD 
EVTNG I VKERLAEDDI AEKGFLLDGYPRT I EQAHALDATLEELGLRLDGVINI KVDPSCL 
I ERLSGRI I NRKTGETFHKVFNPPVD YKEED YYQREDDKPETVKRRLD W I AQGEP I LEH 
YRKLGLVTD I EGNQE I TE VFADVEKALLE 

SEQ ID NO. 4316: A909 STRAIN 

NLL I MGLPGAGKGTQAAKI VEEFGVAH I STGDMFRAAMANQTEMGRLAKSY I DKGELVPD 
EVTNGI VKERLAEDDI AEKGFLLDGYPRT I EQAHALDATLEELGLRLDGVINI KVDPSCL 
IERLSGRIINRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGESILEH 
YRKLGLVTDI EG 

SEQ ID NO. 4317: A909 STRAIN 

NLL IMGLPGAGKGTQAAKI VEEFGVAH I STGDMFRAAMANQTEMGRLAKSY I DKGELVPD 
EVTNGI VKERLAEDDIAEKGFLLDGYPRTI EQAHALDATLEELGLRLDGVINI KVDPSCL 
I ERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNI AQGES I LEH 
YRKLGLVTDI EG 

SEQ ID NO. 4318: CJB110 STRAIN 

NLLTTGLLGAGKGTQAAKI VEEFGVAHI STGDMFRAAMANQTEMGRLAKSY I DKGELVPD 
EVTNGIVKERIAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRIiDGVINIKVDPSC^ 
I ERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGEPILEH 
Y 

SEQ ID NO. 4319: COH1 STRAIN 

LL I MGLPGAGKGTQAAKI VEEFGVAHI STGDMFRAAMANQTQMGRLAKSYIDKGELVPDE 
VTNG I VKERLAEDD I AEKGFLLDGYPRT I EQAHALDATLEELGLRLDG VI NI KVDPTCL I 
ERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNI AQGEP I LEHY 
RKLGLVTD I EGNQE I TEVFAD VEKALL 

SEQ ID NO. 4320: H36B STRAIN 

GDMFRAAMANQTEMGRLAKS Y I DKGELVPDEVTNG I VKERLAEDD I AEKGFLLDGYPRT I 
EQAHALDATLEELGLRLDGVINI KVDPSCL I ERLSGRI INRKTGETFHKVFNPPVDYKEE 
DYYQREDDKPETVKRRLDVNI AQGES I LEHYRKLGLVTD I EGNQE I TEVFAD VEKAL 

SEQ ID NO. 4321: JM9130013 STRAIN 

NLL IMGLPGAGKGTQAAKI VEEFGVAH I STGDMFRAAMANQTEMGRLAKSYI DKGELVPD 
EVTNGI VKERLAEDDI AEKGFLLDGYPRT I EQAHALDATLEELGLRLDGVINI KVDPSCL 
lERLSGRIINRKTGETFHKVFNPPVDYKEEDYTQREDDKPETVKRRLDVNIAQGEPILEH 
YKKLGLVTDI EGN 

SEQ ID NO. 4322: M732 STRAIN 

LL I MGLPGAGKGTQAAKI VEE FGVAH I STGDMFRAAMANQTQMGRLAKS Y I DKGELVPDE 
VTNGI VKERLAEDD I AEKGFLLDGYPRT I EQAHALDATLEELGLRLDGVINI KVDPTCL I 
ERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGEP I LEHY 
RKLGLVTD I EGNQE I TEVFAD VE KALLELK 

SEQ ID NO. 4323: M781 STRAIN 

NLL I TGLPGAGKGTQAAKI VEEFGVAH I STGDMFRAAMANQTQMGRLAKS Y IDKGELVPD 
EVTNG I VKERLAEDD I AEKGFLLDGYPRT I EQAHALDATLEELGLRLDGVI NI KVDPTCL 
I ERLSGRI INRKTGETFHKVFNP PVD YKEED YYQREDDKPETVKRRLDVNIAQ 



MSA Alignment Results: Pretty output 

PRETTY Of: /biotmp/msa32357 . 2 { *} April 17, 2002 09:17 



msa252352.2{ 

msa252352 
msa252352 .2{ 
msa252352 
msa252352.2{ll4 

msa252352 . 
msa252352.2{ 

msa252352 . 

msa252352 . 

msa252352 . 

msa252352 . 



1 50 

114_18RS2l} -nllttgspg agkgtqaaki veefgvahis tGDMFRAAMA NQTeMGRLAK 

2{114_M781} -nllitglpg agkgtqaaki veefgvahis tGDMFRAAMA NQTqMGRLAK 

114_CJB110} -nllttgllg agkgtqaaki veefgvahis tGDMFRAAMA NQTeMGRLAK 

.2{114_090} -nllimglpg agkgtqaaki veefgvahis tGDMFRAAMA NQTeMGRLAK 

_JM9130013} -nllimglpg agkgtqaaki veefgvahis tGDMFRAAMA NQTeMGRLAK 

2{ll4__A909j -nllimglpg agkgtqaaki veefgvahis tGDMFRAAMA NQTeMGRLAK 

114_1169NTj gkgtqaaki veefgvahis tGDMFRAAMA NQTeMGRLAK 

2{114_2603) mnllimglpg agkgtqaaki veefgvahis tGDMFRAAMA NQTeMGRLAK 

2{114_C0H1} — llimglpg agkgtqaaki veefgvahis tGDMFRAAMA NQTqMGRLAK 

2 (114_M732 } — llimglpg agkgtqaaki veefgvahis tGDMFRAAMA NQTqMGRLAK 

2{114_H36B} GDMFRAAMA NQTeMGRLAK 

Consensus * _********* ***_****** 



msa252352 .2{114_18RS21) 
msa252352.2{H4_M78l) 
msa2523 52 . 2 { 114_CJB110 } 



51 100 
S Y I DKGELVP DeVTNGIVKE RLAEDDIAEK GFLLDGYPRT I EQAHALDAT 
S Y I DKGELVP DeVTNGIVKE RLAEDDIAEK GFLLDGYPRT I EQAHALDAT 
SYI DKGELVP DeVTNGIVKE RLAEDDIAEK GFLLDGYPRT I EQAHALDAT 
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msa252352 . 2 { 114_090 
msa252352 . 2 {114_JM9130013 
msa252352 .2{ll4_A909 
msa252352 . 2 { 114_1169NT 
msa252352.2f 114_2603 
msa252352 . 2 { 114_COHl 
msa252352 . 2 { 114_M732 
msa252352.2{H4_H3 6B 
Consensus 



msa252352 . 2 { 114_18RS21 } 
msa252352.2{ll4_M78l} 
msa252352 .2{114_CJB110} 
msa252352.2{H4_090} 
msa252352 .2 {114_JM9130013} 
msa252352.2{ll4_A909} 
* msa252352 . 2 { 114_1169NT} 
msa252352 .2{114_2603} 
msa252352 .2{ll4_COHl} 
msa252352.2{114_M732} 
msa252352.2{ll4_H36B} 
Consensus 



msa252352 . 2 { 114_18RS2l} 
msa2523S2 .2{li4_M78l} 
msa252352 . 2 { 114_CJB110 } 
msa252352 . 2 { 114_090} 
msa252352 . 2 {114_JM9130013 } 
msa252352 . 2 { 114_A909 } 
msa252352 . 2 { 114JL169NT) 
msa252352 . 2 f 114_2603 } 
msa252352.2{114_COHl} 
msa252352 . 2 { 114_M732 } 
msa252352 . 2 {114_H36B> 
Consensus 



SYIDKGELVP 
SYIDKGELVP 
SYIDKGELVP 
SYIDKGELVP 
SYIDKGELVP 
SYIDKGELVP 
SYIDKGELVP 
SYIDKGELVP 
********** 

101 

LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
********** 

151 

EDYYQREDDK 
EDYYQREDDK 
EDYYQREDDK 
EDYYQREDDK 
EDYYQREDDK 
EDYYQREDDK 
, EDYYQREDDK 
EDYYQREDDK 
EDYYQREDDK 
EDYYQREDDK 
EDYYQREDDK 
********** 



DeVTNGIVKE 
DeVTNGIVKE 
DeVTNGIVKE 
DqVTNGIVKE 
DeVTNGIVKE 
DeVTNGIVKE 
DeVTNGIVKE 
DeVTNGIVKE 
*_******** 



VINIKVDPSC 
VINIKVDPtC 
VINIKVDPSC 
VINIKVDPSC 
VINIKVDPSC 
VINIKVDPSC 
VINIKVDPSC 
VINIKVDPSC 
VINIKVDPtC 
VINIKVDPtC 
VINIKVDPSC 
********_* 



PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 
********** 



RLAEDDIAEK 
RLAEDDIAEK 
RLAEDDIAEK 
RLAEDDIAEK 
RLAEDDIAEK 
RLAEDDIAEK 
RLAEDDIAEK 
RLAEDDIAEK 
********** 



LIERLSgRII 
LIERLSgRI I 
LIERLSgRII 
LIERLSgRII 
LIERLSgRII 
LIERLSgRII 
LIERLSgRII 
LIERLSxRII 
LIERLSgRII 
LIERLSgRI I 
LIERLSgRII 
******_*** 



GFLLDGYPRT 
GFLLDGYPRT 
GFLLDGYPRT 
GFLLDGYPRT 
GFLLDGYPRT 
GFLLDGYPRT 
GFLLDGYPRT 
GFLLDGYPRT 
********** 



NRKTGETFHK 
NRKTGETFHK 
NRKTGETFHK 
NRKTGETFHK 
NRKTGETFHK 
NRKTGETFHK 
NRKTGETFHK 
NRKTGETFHK 
NRKTGETFHK 
NRKTGETFHK 
NRKTGETFHK 
********** 



I EQAHALDAT 
I EQAHALDAT 
I EQAHALDAT 
I EQAHALDAT 
I EQAHALDAT 
I EQAHALDAT 
I EQAHALDAT 
I EQAHALDAT 
********** 

150 

VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
********** 



200 

nIAQgepile hyrklglvtd iegnqeitev 

nIAQ 

nIAQgepile hy 

nIAQgepile hyrklglvtd iegnqeitev 

nIAQgepile hykklglvtd iegn 

nIAQgesile hyrklglvtd ieg 

hIAQgepile hysklglvtd iegnqei 

nIAQgepile hyrklglvtd iegnqeitev 
nIAQgepile hyrklglvtd iegnqeitev 
nIAQgepile hyrklglvtd iegnqeitev 
nIAQgesile hyrklglvtd iegnqeitev 
_*** 



rasa252352.2{ 

msa252352 . 
msa252352 .2{ 
msa252352 
msa252352.2{H4 

rasa252352 . 
msa2523 52 .2{ 

msa252352 . 

msa252352 . 

msa252352 . 

rasa252352 . 



201 212 

114_18RS2l} fadvekalle — 

2{114_M781} 

114_CJB110} 

.2{114_090} fadvekalle LK 

_JM9130013) — 

2{114_A909} — 

114_1169NT} 

2 ( 114^2603} fadvekalle LK 

2 { 114_COHl } f advekall 

2{ll4jyi732} fadvekalle LK 

2{114_H36B} fadvekal — — 

Consensus ** 
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Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



SEQ ID NO. 4401 
STRAIN 2603 

GTGGATAAACATCACTCAAAAAAGGCTATTTTAAAGTTAACA 
CTTATAAGkAGTAGTATTTTATTAATG 

TTAAAAAACCAAGAGCAAT CAC CTGTAATTG CTAATGTTG CT CAACAGC CAT CG C CAT CG 
GT AACTACT AAT ACTGTTG AAAAARCATCTGTAACAGCTG CTT CTGCTAGTAATACAG CG 
AAAGAAATGGGTGATACATCTGTAAAAAATGACAAAACAGAAGATGAATTATTAGAAGAG 
TT ATCTAAAAACCTTGATACGT CTAATTTGGGGG CTGAT CTTGAAGAAG AATAT C C CT CT 
AAACCAGAG ACAACCAACAATA^GAAAGCAATGTAGTA^ 

G CACAG AAAGTTCC CT CAG CATATG AAGAGGTGAAG C CAGAAAG CAAGTCAT CG CTTG CT 

GTT CTTG ATACATCTAAAATAACAAAATTACAAG CCATAACCCAAAGAGGAAAGGGAAAT 

GTAGTAGCTATTATTGATACTGGCITTGATATTAACCATGATAT^^ 

CCAAAAGATG AT AAG CACAG CTTTAAAACTAAGACAGAATTTGAGGA^TTAAAAG CAAAA 

CATAATATCACTTATGGGAAATGGGTTAACGATA^GATTGT^ 

AACAATACAGAAACG GTGG CTGAT ATTGCAGCAG CTATG AAAGATGGTTATG GTTCAGAA 

GCAOAGAATATTTCGCATGGTACACACGTTGCTGGTATTTTTGTAGGTA^ 

C CAG CAAT CAATGGT CTTCTTTTAGAAGGTGCAG CG CCAAATGCT CAAGTCTTATT AATG 

CGTATTCCAGATAAAATTGATT CGGACAAATTTGGTGAAGCATATGCTAAAG CAAT CACA 

GACG CTGTT AAT CTAGG AG CAAAAACG ATTAATATGAGT ATTGGAAAAACAG CTG ATT CT 

TTAATTGCTCTCAATGATAAAGTTAAATTAGCACTTAAATTAGCT 

G CAGTTGTTGTGG CTGC CGGAAATG AAGG CG CATTTGGTATGGATTATAG CAAAC CATT A 
TCAACTAATCCTGAC!TACGGTACGGTTAATAGrCCAGCT 

GTTG CT AG CTATGAAT CACT TAAAACT ATCAGTGAGGTCGTTGAAACAACTATTG AAGGT 
AAGTTAGTTAAGTTGCCGATTGTGACTTCTAAACCTTTC^ 

GTGGTTTATG C CAATTATGGTG CAAAAAAAGACTTTTG AAGGT AAGGACTTTAAAG GTAAG 
ATTGCATTAATTGAGCX3TGGTGGTGGACTTGATTTTATGACTAAA 
AATGCAGGTGTTGTTGGTATCGTTATTTTTAACGATCAAGAAAAACGTGGAAATTTT 
ATT C CTT AC CGTG AAT t AC CTGTGGGGATT ATTAGTAAAGTAGAT GG CGAG CGTATAAAA 
AATA CTTCAAGT CAGTTAACATTTAACCAGAGTTTTGAAGTAGTTGATAG CCAAGGTGGT 
AATCGTATGCTGGAACAATCAAGTTGGGGCGTGACAGCTGAAGGAGCAATCAAGCCT 
G TAACAG CTTCTGG CTTTGAAATTTATT CTT CAAC CT AT AATAATCAATACCAAACAATG 
T CTG GTACAAGT ATG G CTT CAC CACATGTTGCAGGATTAATGACAATGCTTCAAAGT CAT 
TTGG CTGAG AAATATAAAGGGATG AATTTAGATT CTAAAAAATTG CTAGAATTGT CT AAA 
AACAT C CT CATGAG CT CAG CAACAG CATTATATAGTG AAG AGGATAAGG CGTTTT AT TCA 
CCACGTCAGCA^GGTGCAGGTGTAGTTGATGCTGAAAAAGCTATCCAAGCTCAATATTAT 
ATTACTGGAAACGATGGCAAAGCTAAAATTAATCTCAAACGAATGGGAGATAAATT^ 
ATCACAGTT ACAATT CATAAA.CTTGTAGAAGGTGTCAAAG AATTGTATTATCAAG CTAAT 
GTAG CAACAG AACAAGT AAATAAAG GTAAATTTG C CCTT AAAC CACAAG C CTTG CTAGAT 
ACTAATTGG CAG AAAGTAATTCTT CGTGATAAAGAAACACAAGTTCGATTTACTATTGAT 
GCTAGTCAATTTAGTCAGAAATTAAAAGAACAGATGGCAAATGGTTATT^ 
TTTGTACGTTTTAAAGAAGCCAAG GATAGTAAT CAGGAGTTAATGAGTATT C CTTTTGT A 
GGATTTA^TGGTGATTTTGCGAACTTACAAGCACTTGAAACACCGATTTATAA 
TCTAAAG GTAGTTT CTACTATAAAC CAAATGATACAACT CAT AAAGAC CAATTG G AGT AC 
AATG AAT CAG CTCCTTT'TGAAAGCAACAA CTATA CTGCCTTGTTAACACAAT CAG CGTCT 
TGGGG CTATGTTGATTATGT CAAAAATGGTGGGGAGTTAG AATT AG CAC CGGAG AGTCCA 
AAAAGAATTATTTTAGGAACTTTTGAGAATAAGGTTGAGGATAA 

GA*VAG AG ATGCAG CG AATAAT C CATATTTTGCCATTT CT C CAAAT AAAGATGG AAAT AGG 
GACGAAATCACTCCCCAGGCAACTTTCTTAAGAAATGTT^ 

CTAGAT CAAAATGG AAATGTTATTTGG CAAAGTAAGGTTTTAC CAT CTT ATCGTAAAAAT 
TT CCATAAT AATCCAAAG CAAAGT GATGGTCATTATCGTATGGATGCTCTTCAGTGGAG T 
GGTTTAGATAAGGATGGCAAAGTTGTAGCAGATGGTTTTTATA 

ACACCAGTAG CAGAAGG AG CAAATAGTCAGGAGTCAGACTTT A^GTACAAGTAAGTACT 
AAGT CAC CAAAT CTTCCTT CACGAG CT CAGTTTG ATGAAACT AAT CG AACATT AA.GCTT A 
G CCATG CCTAAGGAAAGTAGTT ATGTT CCT ACAT ATCGTTT , ACAATT' AGTTT^ CTCAT 
GTTGTAAAAGATGAAGAATATGGGGATGAGACTTCITACCATTATTTCCA 
GAAGGTAAAGTGACACTTCCTAAAACGGTTAAGATAGGAGAGAGTGAGGTTGCGGTAGAC 
CCTAAGGCCTTGACACTTGTTGTG<?AAGAT^ 

TCTGATCTCTTG AATAAGG CAGTAGT ATCAGAGAAAGAAAACG CTATAGTAATTT CT AAC 

AGTTTGAAATATTTTGATAACTTGAAAAAAGAACCTATGTTTATTTCT 

GT AGT AAACAAGAAT CTAG AAG AAATAAT ATT AGTTAAG CCG CAAACTACAGTT ACT ACT 

CAATCATTGTCTAAAGAAATAACTAAATCAGGAA 

AATAATAGTAGCAGAGTAGCTAAGATCATATCACCTAAACATAACGGGGATTCTGTTAAC 
CAT AC CTTAC CT AGT ACAT CAG ATAGAG CAACGAATGGTCTATTTGTTGGTACTTTGGCA 
TTGTTATCTAGTTTACTT'CTTTATT?^ 

SEQ ID NO. 4402 
STRAIN 090 

GAGGAGCAAGAATTAAAAAACCAAGAGCAATCACCTGTAATTGCT 

AATGTTGCTCAACAGCCATCGCCATCGGTAACTACTAATATTGTTGAAAA 

AACATCTGTAACAGCTGCTTCTGCTAGTAATACAGTGAAAGAAATGGGTG 

ATACATCTGTAAAAAATGACAAAACAGAAGATGAATTATTAGAAGAGTTA 

TCrAAAAACCTTGATACGTCTAATTTGGGGGCrGATCITGAAGAAGAATA 

TCCCTCTAAACCAGAGACAACCAACAATAAAGAAAGCAATGTAGTAACAA 

AT G CTTCAACTG CAATAGCACAGAAAGTT C CCTCAG CGT ATG AAG AGGTG 

AAGCf^G^AGCAAGTCATCGCTTGCTGTTTTTGATACATCTAAAATAAC 

AAAATTG CAAG C CAT AACC CAAAG AGG AAAGGG AAATGTAGT AG CTATT A 

TTGATACTGGCTTTGATATTAACCATGATATT1TTCGTTTAGATAGCCCA 

AAAGATG ATAAG CACAG CTTTAAAACTAAAGCAGAATT CG AGG AATT AAA 

AGCAAAACATAATATCACTTATGGGAAATGGGTTAACGATA^GATTGTTT 

TTGCACATAA.CTACGCCAACAATACAGAAACGGTGGCTGATATTGCAGCA 

GCTATGAAAGATGGTTATGGGTCAGAAGCAAAGAATATTTCGCATGGTAC 

ACACGTTGCTGGTATTITTGTAGGTAATAGTAAACGTCC^GCAATCAATG 

GT CT T CTTTT AG AAG GTG CAG CG C CAAATGCTCAAGT CTTATTAATG CGT 
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ATTCCAGATAAAATTGATT CGG ACAAATTTGGAG AAG CATATG CT AAAG C 

AATCACAGACGCTG t TAAT CT AGG AG CAAAAa CG ATT AATATGAG CCTTG 

GAAAAA.CAG GAG ATT CTTT AA 1 1 GC aCTCAATGATAAAGTTAAATTAG CA 

CTTAAATTAGCTTCTGAGAAGGGCGTTGCAGTTGTTGTGGCTGCCGGAAA 

TG AAGGTG CATTTGGTATGGATTATAG CAAAC CATTATCAACTAAT c CTG 

ACTACGGTACGGTTAATAGTCCAGCTATTTCTGAAGATACTtTGAGTGTT 

GCTAGCTATGAATCACTTAAAACTATCAGTGAGGTCGTTGAAACAACTAT 

TGaaGGTA^GTTAGTTAAGTTGCCGATTGTGACTTCTAAACCTTTtGACA 

AAGGTAAGGC CTACG ATGTGGTTTATG C CAATTATGGTG CAaAAAAAGAC 

TTTGAAGGTAAgGACTTTAAAGGTAAGATTGCATTAATtGAGCGTGGtGG 

TGGACITGATTTTATGACTAAaa t CACTcATGCTACAAATGCAgGTGTTG 

t TGGTaTCGT t ATT 1 1 1 AACgAt CAAGAa aAACG t GGAAATTTTcTAATT 

CCTTACCGTGAATTACCTGTGGGGGTTATTAGTAAAGTAGATGGCGAGCG 

TATAAAAAATACTT CAAGT CAGTTAACATTTAAC CAG AGTTTTg AAGTAG 

TTGATAGCCAAGGTGGCAATCGTATGCTGGAACAATCAAGTTGGGGCGTG 

ACAG CTG AAGGAG CAATCAAG CCTG ATGTAACAG CTT CTGGCTTTGAAAT 

TTATT CTTCAAC CTAT AAT AATCAATAC CAAACAATGT CTGGTACAAGT A 

TGGCTTCACCACATGTTGGAGGATTAATGACAATGCTTCAAAGTCATTTG 

GCTGAGAAATATAAAGGGATGAATTTAgATTCTAAAAAATTGCTAGAATT 

GTCTAaAAACATCCTCATGAGCTCAGCaaCAGCATTATATAGTgAAGAgG 

ATAAGGCGTtTtATTCaCCACGTCAGCAAGGtGCAGGtGTAGTTGATGCT 

GAAAAAGCTATCCAAGCTCAATATTATGTTACTGGAAACGATGGCAAAGC 

TAAAATTAAT CT CAAACGAGTGGGAGATAAATTTG ATATCACAGTTACAA 

TTCATAAACTTGTAG AAGGTGT CAAAG AATTGTAT TATCAAG CTAATGT A 

GCAACAGAACaAGTAAATAAAGGTAAATTTGCCCTTAAACCACAAGCCtT 

GCTAGATACTAATTGGCAGAaAGTAATTCTTcGTGATAAAGAAACACAAG 

TTcGATTTACTATTGATGCTAGTCAATTTAGTCAGAAATTAAAAGAACAG 

ATGG CAAATGGTTATTT CTTAg AAGGTTTTGTACGTTTT AAAGAAG CCAA 

GGATAG t AATCAGGAGTTAaTGAGTATTCCTT tTGTAGGATt t AATGGTG 

ATTTTGCGAACTTACAAGCACTTGAAACACCGATTTATAAGA.CGCTTTCT 

AAAGGTAGTTTCTACTATAAACCAAATGATACAACTC^TAAAGACCAATT 

GGAGTACAATGAAT CAG CT CCTTTTGAAAG CAACAACTAT ACTG C CTTGT 

TAACACMTCAGCGTCTTGGGGCTATGTTGATTATGTCAAAAATGGTGGG 

GAGTTAGAATTAGCACCGGAgAGTcCAAAAAGAATTATTTTAgGAACTTT 

TGAGAATAAGGTTGAGGATAAAACAATTCATCTTTTGGAAAGAGATGCAG 

Cg AAT AATCCAT ATTTTG C CATTT CTC CAAATAAAGATGG AAATAGG GAT 

GAAATCACTCCCCAGGCAACTTTCTTAAGAAATGTTAAGGATATTTCTGC 

TCAAGTT CTAGATCAAAATGGAAATGT TATTT GG CAAAGTAAGGTTTT AC 

CATCTTATCGTAAAAATTT C CAT AATAATC CAAAG CAAAGTGATGGT CAT 

TATCGTATGGATGCtTTTTCAGTGGAGTGGTTTAGATAAGGATGGCAAAGT 

TGTAGCAGATGGTTTTTATACTTATCGCCTACGTTACACACCAGTAGCAG 

AAGGAG CAAATAGT CAGGAGTCAGACTTTAAAGTTCAAGTAAGTACTAAG 

TCACCAAAT CTT C CTTTACTAGCT CAGTTTGATGAAACT AAT CGAACATT 

AAGCTTAGCCATGCCTAAGGAAAGTAGTTATGTTCCTACATATCGTTTAC 

AATTAGTTTTATCTCATGTTGTAAAAGATGAAGAATATGGGGATGAGACT 

TCTTACCACTATTTCCATATAGATCAAGAAGGTAAA.GTGACACTTCCTA^ 

AACGGTT AAGAT AGG AG AG AGTGAGGTTG CAGTAGAC CCTAAGG C CTTGA 

CACTTGTTGTGG AAGAT AAAG CTG GTAATTTTG CAACGGTAAAATTGTCT 

GACCT CTTG AATAAGGCAGTAGTAT CAGAGAAAGAAAACG CTAT AGT AAT 

TTCTAACAGTTTCAAATATTTTGATAACTTGAAAAAAGAATCTATGTT^ 

TTTCTAAAGAAGGAAAAGTAGTAAACAAGAATCTAGAAGAAATAACATTA 

GTTAAG C CG CAAACT ACAGTTACTACT CAATCATTGT CTAAAGAAATAAC 

TAAATCAGGAAATGAGAAAGTCCTCACCTCTACAAACAATAATAGTAGCA 

GAGTAGCTAAGATCAT AT CACCT AAACATAACGGGGATT CTGTT AAC CAT 

ACC 

SEQ ID HO. 4403 
STRAIN A90 9 

GAGGAGCAAG AATTAAAAAACCAAGAG CAAT 

CACCTGTAATTGCTAATGTTGCTCAACAGCCATCGCCATCGGTAACTACT 
AATACTGTTG AAAAAACATCTGTAACATCTGCTT CTG CTAGT AAT ACAG C 
GAAAGAAATGGGTGATACATCTGTAAAAAATGACAAAACAGAAGATGAAT 
TATTAGAAG AGTTAT CTAAAAAC CTTG ATA CGTCTAATTTGGGGG CT GAT 
CTTG AAG AAG AATATC CCT CTAAAC GAGAG ACAAC CAACAAT AAAG AAAG 
CAATGTAGTAACAAATGCTTCAACTGCAATAGCACAGAAAGTTCCCTCAG 
CATATGAAGAGGTGAAGCCAGAAAGCAAGTCATCACTTGCTGTTCTTGAT 
ACATCTAAAATAACAAAATTGCAAGCCATAACCCAAAGAGGAAAGGGAAA 
TGTAGTAGCTATTATTGATACTGGCTTTGATATTAACCATGATATTTTTC 
GTTTAGATAGCCCAAAAGATgaTAAGCACAGCTTTAaAACTAAGGCAGAA 
TTTGAGGAATTAAAAGCAAAACATAATATCACTTATGGGAAATGGGTTAA 
CGATAAGATTGtTTTTGCACATAACTACGCCAaCAATACAGAAACGGTGG 
CTGAT ATTGCAG CAG CT ATG AAAGATGGTT ATGGGTCAGAAG CAAAG AAT 
ATTTCGCATGGTAC7VCACGTTGCTGGTATTTTTGTAGGTAATAGTAAACG 
TCCAGCAATCAATGGTCCTCCTCTAGAAGGTGCAGCGCCAAATGCTCAAG 
T CTTATTAATG CGT ATTCCAG ATAAAATTGATTCGGA CAAATTTG GTGAA 
G CATATG CTAAAG CAATCACAG ACG CTGTT AATCT AG GAG CAAAAACGAT 
TAAT ATG AG C CTTGG AAAAACAG CAGATT CTT TAATTG C T CT CAATG AT A 
AAGTTAAAT TAG CACTTAAATT AG CTT CTG AG AAG GG CGTTGCAGTTGTT 
GTGG CTG C CGGAAATG AAGGTG CATTTGGTATGGATTATAG CAAAC CATT 
ATCAACTAATCCTGACTACGGTACGGTTAATAGTCCAGCTATTTCTGAAG 
ATACTTTGAGTGTTGCTAGCTATGAATCACTTAAAACTATCAGTGAGGTC 
GTTGAAACAACTATTGAAGGTAAGTTAGTTAAGTTGCCGATTGTGACTTC 
TAAACCTTtTGACAAAGGTAAGGCCTACGATGTGGTTTATGCCAATTATG 
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GTGCAAAAAAAAGACTTTGAAGGTAAGGACTTTAAAGGTAAGATTGCATT 
AATTGAG CGTGGTGGTGGACTTGATTTTATGACT AAAAT CACT CATG CTA 
CAAATG CAGGTGTTGTTGGTAT CGTTATTTTTAACGAT CAAG AAAAACGT 
GGAAATTITCTAATTCCTTACCGTGAATTACCTGTGGGGGTTATTAGTAA 
AGTAGATGG CGAG CGTATAAAAAAT ACTT CAAGT CAGTTAACATTTAACC 
AGAGTTTTGAAGTAGTTGATAGCCAAGGTGGCAATCGTATGCTGGAACAA 
TCAAGTTGGGGCGTGACAGCTGAAGGAGCAATCAAGCCTGATGTAACAGC 
TTCTGGCrTTGAAATTTATTCTTC^COTATAATAATCAATACCAAACAA 
TGTCTGGTACAAGTATG G CTT CACCACATG t TGCAGG ATTAATGACAATG 
CTTCAAAGT CATTTGG CTG AGaAATATAAAGGGATGAATTTAGATT CTAA 
AAAATTGCT AGaATTGT CTAAAAACAT c CT CATG AG CTCAGCAACAG CAT 
* TATATAGTGAAGAGGATAAGGCGTTTTATTCACCACGTCAGCAAGGTGCA 
GGTGT AGTTGAT G CTGAAAAAG CTATC CAAG CTCAAT ATT ATGTTACTGG 
AAACG ATGG CAAAG CTAAAATT AAT CT CAAACGAGTG GG AGATAAATTTG 
ATATCACAGTTACAATT CAT AAACTTGTAGAAGGTGT CAAAG AATTGTAT 
TATCAAGCTAATGTAGCMCAGAACAAGTAAATAAAGGTAAATTTGCCCT 
TaAACCaCAAGCCTTGCTAGATACTAATTGGCAGAAAGTAATTCTTcGTG 
ATAAAGAAACACRAGTTCGATTTACTAtTGATTCTAGTCAATTTAGTCAG 
AAATTAAAAGAAC!AG ATGG CAAATGGTTATTT CTTAG AAGGTTTTGTACG 
TTTTAAAGAAG C CAAGG AT AGT AATCAGG AGTTAATG AGT ATT C CTTTTG 
TAGGATTTAATGGTGATTTTGCGAACTTACAAGCACTTGAAACACCGATT 
TATAAGACG CTTTCT AAAGGTAGTTTCTACTATAAAC CAAATGATACAAC 
T CAT AAAGACCAATTGG AGT ACAATGAATCAG CT C CTTTTGAAAG CAACA 
ACTATACTG C CTTGTTAACACAATCAG CGT CTTGGGG CT ATGTTG ATTAT 
GTCAAAAATGGTGGGGAGTTAGAATTAGCACCGGAGAGTCCAAAAAGAAT 
TATTTTAGGAACnTTTGAGAATAAGGTTGAGGATAA 

TGGAAAGAGATG CAG CGAAT AAT CCATATTTTGCCATTT CTCCAAATAAA 
GATGGAAATAGGGATGAAATCACTCCCCAGGCAACTTTCTTAAGAJ\ATGT 
TAAGGATATTTCTG CTCAAGTTCTAGAT CAAAATGGAAATGTTATTTGGC 
AAAGTAAGGTTTTAC CATCTTATCGTAAAAATTTC CATAATAAT C CAAAG 
CAAAGTCATGGTCATTATCGTATGGATGCCCTTCAGTGGAGTGGTTTAGA 
TAAGGATGGCAAAGTTGTAGCAGATGGTTTTTATACTTATCGTTTACG'rT 
ACACACCAGTAG CAG AAGG AG CAAAT AGT CAGGAGTCAG ACTTT AAAGTT 
CAAGTAAGT ACTAAGT CAC CAAAT CTT C CTTCACG AG CT CAGTTTGATG A 
AA CT AAT CGAACATT AAG CTTAGC CATG C CTAAGG AAAGT AGTTATGTTC 
CT ACATATCGT CTACAATTAGTTTT AT CT CATGTTGTAAAAG ATG AAGAA 
TATGGAG ATG AGACTT CTT AC CATT ATTT CCATAT AG ATCGAGAAGGTAA 
AGTGACACTT CCTAAAACAGTT AAGATAGG AGAG AGTGAGGTTG CAGTAG 
ACCCTAAGACCTTGACACTTGTTGTGGAAGATAAAGCTGGTAATTTCGCA 
ACGGTAAAATTGTCTGAC CT CTTGAAT AAGG CAGTAGTAT CAGAGAA&GA 
AA^CGCTATAGTAATTTCTAACAATTTCAAATATTTTGATAACTTGA 
AAGAAC CTATGTTT ATTT CTAAAG AAG GAAAAGTAGT AAA CAAG AAT CTA 
GAAGAAATAGCATTAGTTA^GCCGCAAACTACAGTTACTACTCAATCATT 
GT CTAAA.GAA^TAACT CAATCAGGAAATG AG AAAGT C CT CACTT CT ACAA 
ACAATAATAGTAGCAGAGTAGCTAAGATCATATCACCTAAACATAACGGG 
GATTCTGTTAACCATACC 

SSQ XD NO. 4404 
STRAIN H36B 

GAGGAGCAAG AATTAAAAAAC CAAGAG CAAT CACCTGT AATTG C 
TAATGTTGCTCAACAGCCATCGCCATCGGTAACTACTAATACTGTTGAAA 
AAACATCTGTAAC^TCTGCTTCTGCTAGTAATACAGCGAAAGAAATGGGT 
GATACAT CTGTAAAAAATGACAAAACAGAAGATG AATTATTAGAAGAGTT 
ATCTAAAAAC CTTG AT ACGTCTAATTT GG GGG CTG AT CTTGAAGAAGAAT 
AT CC CTCTAAAC CAG AG ACAAC CAACAATAAAGAAAG CAATGTAGTAACA 
AATG CTT CAACTGCA^T AGCACAG AAaGTT CCCT CAG CAT ATGAAGAGGT 
a^GCCAGAAAGCAAGTCATCACTTGCTGTTCTTGATACA.TCTAAAATAA 
CAAAATTGCAAGCCATAAC C CAAAG AGGAAAGGGAAATGT AG TAG CT ATT 
ATTGATACTGGCITTGATATTAACCATGATATTTTTCGTTTAGAT^ 
APAAG ATGAT AAG CACAG CTTT AAAACT AAGG CAG AATTTG AGGAATTAA 
AAGCAAAACATAATATCACTTATGGGAAATG^GTTAACGATAAGATTGTT 
TTTGCACAT AACTACG C CA aCAATACAGAAACGGTGG CTG ATATTGCAGC 
AGCTATGAAAGATGGTTATGGGTCAGAAGCAAAGAATATTTCGCATGGTA 
CACACGTTGCTGGTATTTTTGTAGGTAATAGTAAACGTCCAGCAATCAAT 
GGTCTT CTTTTAGAAGGTG CAG CG C CAAATGCTCAAGT CTTATT AATGCG 
TATTCa^GATAAAATTGATTCGGACAAATTTGGTGAAGCATATGCTAAAG 
C*VAT CACAGACG CTGTT AAT CT AG GAGCAAAAACG ATTAATATG AG C CTT 
GG AAAAACAG CAGATTCTTT AATTGCT CT CAATG AT AAAGTT AAATT AG C 
ACITAAATTAGCTTCTGAGAAGGGCGTTGCAGTTGTTGTGGCTGCCGGAA 
ATGAAGGTG CATTTG GT ATGGATT ATAG CAAAC CATT AT CAACT AATCCT 
GACTACGGTACGGTTAATAGTC(ZAGCrATTTCTGAAGATACTTTGAGTGT 
TGCTAGCTATGAATC^CTTAAAACrATC^GTGAGGTCGTTGAAACAACTA 
TTGAAGGTAAGTTAGTTAAGTTGCCGATTGTGACTTCTAAACCTTtTGAC 
AAAGGTAAGGCCTACGATGTGGTTTATGCCAATTATGGTGCAAAAAAAGA 
CTTTG AAGGTAAGG ACTTTAAAGGTAAGATTG CATTAATTG AG CGTGGTG 
GTGGACrTGATTTTATGACTAAAATCACTCATGCTACAAATGCAGGTGTT 
GTTGGTATCGTTATTTTTAACGATCAAGAAAAACGTGGAAATTTTCTAAT 
TCCTTACCGTGAATTACCTGTGGGGGTTATTAGTAAAGTAGATGGCGAGC 
GTATAAAAAATACTTCAAGTCAGTTAACATTTAACCAGAGTTTTGAAGTA 
GTTGATAGCCAAGGTGGCAATCGTATG CTGG AACAAT CAAGTTGGGGCGT 
G ACAG CTGAAGG AG CAATCAAG C CT GATGTAACAG CTTCTGG CTTTG AAA 
TTTATTCTT C AACCT AT AATAAT CAAT AC C AAACAAT GT CTGGTACAAGT 
ATGG CTT C^CCACATGTTGCAGGATTAATGACTVATGCTTCAA^GTCATTT 
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GGCTGAGAAATATAAAGGGATGAATTTAGATTCTAAAAAATTGCTAGAAT 
TGT CTAAAAACATC CT CAT GAGCT CAG CAACAGCATTATATAGTGAAGAG 
GATAAGG CGTTTTATTCAC CACGTCAG CAAGGTG CAGGTGTAGTTGATG C 
TGAAAAAGCTATCCAAGCTCAATATTATGTTACTGGAAACGATGGCAAAG 
CTAAAATTAA.TCTCAAACGAGTGGGAGATAAATTTGATATCACAGTTACA 
ATTCATAAACTTGTAGAAGGTGTCAAAGAATTGTATTATCAAGCTAATGT 
AGCAACAGAACAAGTAAATAAAGGTAAATTTGCCCTTAAACCaCAAGCCT 
TGCTAGATACTAATTGGCAGAAAGTAATTCTTCGTGATAAAGAAACACAA 
GTTCGATTTACrATTGATTCTAGTCAATTTAGTCAGAAATTAAAAGAACA 
GATGGCAAATGGTTATTTCTTAGAAGGTTTTGtACGTTTTAAAGAAGCCA 
AGGATAGTAAT CAGG AGTTAATGAGT ATT C CTTTTGT AGGATTT AATGGT 
GATTTTGCGAACTtACAAGCACTTGAAACACCGATTTATAAGACGCTTTC 
TAAAGGTAGTTT CT ACT AT AAACCAAATG AT ACAACT CATAAAG ACCAAT 
TGGAGTACAATGAATCAGCTCCTTTTGAAAGCAACAACTATACTGCCTTG 
TTAACACAATCAGCGTCTTGGGGCTATGTTGATTATGTCAAAAATGGTGG 
GGAGTTAgAATTAgCACCGGAGAGTCCAAAAAGAATTATTTTAGGAACTT 
TTGAGAATAAGGTTG AGGATAAAACAATT CATCTTTTGG AAAGAG ATGCA 
GCGAATAAT CCATATTTTG C GATTTCT CCAAATAAAG ATGGAAAT AG GGA 
TGAAAT CACTCC CCAGG CAACTTT CTT AAGAAATGTTAAGGATATTT CTG 
CTCAAGTTCT AG AT CAAAATGGAAATGTTATTTGG CAAAGTAAGGTTTT A 
CCAT CTTAT CGTAAAAATTT C CAT AAT AAT CCAAAG CAAAGTGATGGTCA 
TTATCGT ATGGATG C CCTT CAGTGG AGTGGTTT AGATAAGGATGG CAAAG 
TTGTAGCAGATGGTTTTTATACITATCGTTTACGTTACACACCAGTAGCA 
GAAGGAGCAAATAGTC^GGAGTCAGACTTTAAAGTTCAAGTAAGTACTAA 
GT CACCAAATCTTC CTT CACGAG CT CAGTTTG ATGAAACTAAT CG AACAT 
T AAGCTT AG C CATG C CTAAGGAAAGTAGTTATGTTC CTACATAT CGT CT A 
CAATTAGTTTTATCT CATGTTGTAAAAGATGAAG AATATGGAGATGAGAC 
TTCTTACCATTATTTCGATATAGATCAAGAAGGTA 

AAACAGTTAAGATAGGAGAGAGTGAGGTTGCAGTAGACCCTAAGACCTTG 
ACACITGTTGTGGA^GATAAAGCTGGTAATTTCGaA^CGGTAAAATTGTC 
TGACCTCTTGAATAAGGCAGTAGTATCAGAGAAAGAAAACGCTATAGTAA 
TTTCTAACAATTTCAAATATTTTGATAACTTC 

ATTTCTAAAGAAGGAAAAGTAGTAAACAAGAATCTAGAAGAAATAGCATT 
AGTT AAG CCG CAAAC TACAGTTACT ACTCAATCATTGT CTAAAG AAATAA 
CTCAATCAGGAAATGAGAAAGTCCTCACTTCTACAAACAATAATAGTAGC 
AGAGTAG CT AAG AT CAT AT CAC CT AAACATAACG G GGATT CTGTTAACCA 
TACC 

SEQ ID NO. 4405 
STRAIN 18RS21 

GAGGAGCAAGAATTAAAAAACCAAGAGCAATCACC 

TGTAATTGCTAATGTTGCTCAACAGCCATCGCCATCGGTAACTACTAATA 

CTGTT GAAAAAACAT CTGTAACAG CTG CTT CTG CT AGTAATACAG CGAAA 

GAAATGGGTGATACATCTGTAAAAAZVTGACAAAACAGAAGATGAATTATT 

AGAAGAGTTATCTAAAAACCTTGATACGTCTAATTTGGGGGCTGATCTTG 

AAGAAGAATATC CCTCT AAAC CAG AGACAACCAACAATAAAGAAAGCAAT 

GTAGTAACAAATG CTTCAACTG CAAT AGCACAGAAAGTT CCCTCAG CATA 

TGAAGAGGTGAAG C CAGAAAG CAAGTCAT CGCTTG CTGTT CTTGATACAT 

CTAAAATAACAAAATTACAAG C CAT AACCCAAAGAGGAAAGGGAAATGTA 

GTAGCTATTATTGATACrGGCTTTGATATTAACCATGATATTTTTCGT^ 

AGAT AGCCCAAAAGATG AT AAG CACAG CTTTAAAACT AAG ACAG AATTTG 

AGGAATTAAA^G CAAAACATAATAT CACTTATGGGAAATGGGTTAACGAT 

AAGATTGTTTTTGCACATAACTACGCCAACAATACAGAAACGGTGGCTGA 

TATTG CAGCAG CTATGAAAGATGGTTATGGTT CAGAAG CAAAG AATATTT 

CGC^TGGTACACACGTTGCTGGTATTTTTGTAGGTAATAGTAAACGTCCA 

GCAATCAATGGTCTTCTTTTAGAAGGTGCAGCGCCAAATGCTCAAGTCTT 

ATTAATG CGTATTC CAG AT AAAATTGATT CGG ACAAATTTGGTGAAG CAT 

ATGCTAAAG CAATCACAGACGCTGTTAAT CTAGGAGCAAAAACG ATT AAT 

ATGAGTATTGGAAAAACAGCTGATTCTTTAATTGCTCTCAATGATA^AGT 

TAAATTAGCACTTAAATTAGCTTCTGAGAAGGGCGTTGCAGTTGTTGTGG 

CTGCCGGAAATGAAGGCGCATTTGGTATGGATTATAGCAAACCATTATCA 

ACTAAT C cTG ACTACGGT ACGGTT AATAGTCCAG CTATTT CTG AAGAT AC 

TTTGAGTGTTG CTAG CT ATG AATCACTTAAAACT AT CAGTG AGGTCGTTG 

AAACAACTATTGAAGGTAAGTTAGTTAAGTTGCCGATTGTGACTTCTAAA 

CCTTTTGACAAAGGTAAGGC CTACG ATGTG GTTTATG C CAATTATGGTG C 

AAA^AAAG ACTTTG AAGGT AAGGACTTT7VAAG GTAAG AT TG CATT AATTG 

AGCGTGGTGGTGGACTTGATTTTATGACTAAAATCACTCATGCTACAAAT 

GCAGGTGTTGTTGGTATCGTTATTTTTAACGATCAAGAAAAACGTGGAAA 

TTTTCTAATTCCTTACCGTGAATTACCTGTGGGGATTATTAgTAAAGTAG 

ATGG CGAGCGTATAAAAAATACTT CAAGT C AGTT AAC ATT t AACCAg AGT 

TTTGAAGtAGTTGATAGCCAAGGTGGtAATCGTaTGCTGGAACAATCAAG 

TTGGGGCGTGACAGCTGAAGGAGCAATCAAGCCTGATGTAACAGCTTCTG 

G CTTTGAAATTT ATT CTTCAAC CT AT AAT AAT CAATAC CAAa CAATGT CT 

GGTACAAGTATGGCTTC^CCACATGTTGCAGGATTAATGACAATGCTTCA 

AAGTCATTTGGCTGAGAAATATAAAGGGATGAATTTAGATTCTAAAAAAT 

TG CT AGAATTGT CT AAAAACAT C CT CATG AG CTCAG CAACAG CATT ATAT 

AGTGAAGAGGATAAGGCGTTTTATTCACCACGTCAGCAAGGTGCAGGTGT 

AGTTG ATG CT GAAAAAG CT AT CCAAG CT C aAT ATTAT ATTACTGG AAACG 

ATGG CAaAG CTAAAATT AAT CT CAAACGAATG G GAG ATAAATTTG ATAT C 

ACAGTTACAATTCATaAACTTGTAGAAGGTGTC^AAAGAATTGTATTATCA 

AGCTAATGTAGCAACAGAACAAGTAAATAAAGGTAAATTTGCCCTTaAAC 

CACAAGCCTTGCTAGATACTAATTGGCAGAAAGTAATTCTTcGTGATAAA 

GAAACACAAGTTCGATTTACTATTGATGCTAGTCAATTTAGTCAGAAATT 
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AAAAGAACAGATGGCAAATGGTTATTTCITAgAAGQTTTTGTACGTTTTA 
AAGAAGCCAAGGATAGTAATCAGGAGTTAATGAGTATTCCTTTTGTAGGA 
TTTAATGGTGATTTTGCGAACTTACAAGC^CTTGAAACACCX3 
GACGATTTCT AAAGGTAGTTT CT ACTATAAACCAAATGAT ACAACT CAT A 
AAGACCAATTGGAGTACAATGAATCAG CT C CTTTTGAAAG CAACAACTAT 
ACTG CCTTGTTAACACAAT CAG CGT CTTG G GG CTATGTT G ATT ATGT CAA 
AAATGGTGG GGAGTT AGAATTAGC a CCGG AGAGT C CAAAAAGAATT ATTT 
TAGGAACTTTTGAGAATAAGGTTGAGGATAAAACAATTCATCTTTTGGAA 
AGAGATG CAGCGAAT AAT C CAT ATTTTG CCATTTCT C CAAAT AAAGATGG 
AAATAGGGACGAAAT CACTC CCCAG G CAAC t TTCTTAAG AAATGTT AAG G 
ATATTTCTGCTCAAGTTCTAGATCAAAATGGAAATGTTATTTGGCAAAGT 
AAGGTTTTACCATCTTAT CGTAAAAATTT C CATAATAAT CCAAAG CAAAG 
TGATGGTCATTATCGTATGGATGCTCTTCAGTGGAGTGGTTTAGATAAGG 
ATGG CAAAGTTGTAG CAGATGGTTTTT AT ACTTAT CG CTT ACGTTACACA 
C CAGT AG CAGAAGG AG CAAAT AGT CAG GAGTCAG ACTTT AAAGT ACAAGT 
AAGTACT AAGT CAC CAAAT CTT C CTTCACG AG CT CAGTTTGATGAAACTA 
AT CGAACATTA^G CTTAG CCATG CCTAAGG AAAGT AGTTATGTTC CTACA 
TATCGTTTACAATTAGTTTTATCTCATGTTGTAAAAGATGAAGAATATGG 
GGATG AGACTT CTTAC CATTATTT C CATATAG AT CAAGAAGGTAAAGTGA 
CACTT CCTAAAACGGTT AAG ATAGG AG AG AGTGAGGTTG CGGTAGACCCT 
AAGGCCTTGACACTTGTTGTGGAAGATAAAGCTGGTAATTTCGcAACGGT 
AAAATTGT CTGAT CT CTTGAAT AAGGCAGTAGTAT CAGAG AAAG AAAACG 
CTATAGTAATTT CTAACAGTTTCAAAT ATTTTGAT AACTTGAAAAAAGAA 
C CTATGTTTATTTCTAAAAAAG AAAAAGT AGT AAACAAG AAT CT AGAAG A 
AATAAT ATT AGTTAAGC CG CAAACTACAGTTACT ACT CAAT CATTGT CT A 
AAGAAATAACTAAAT CAGG AAATGAGAAAGT C CTCACTT CTACAAACAAT 
AATAGTAGCAGAGTAGCTAAGATCATATCACCTAAACATAACGGGGAITC 
TGTTAAC CAT AC C 

SEQ ID NO. 4406 
STRAIN M732 

GAGGAGCAAGAATTAAAAAACCAAGAG CAATCAC CT 
GTAATTG CTAATGTTG CT CAACAG CCAT CG CCAT CGGT AACTACT AATAT 
TGTTG AAAAAACAT CTGTAACAGCTGCTT CTG CTAGT AAT ACAGTGAAAG 
AAATGGGTGATACATCTGTAAAAAATGACAAAACAGAAGATGA^TTATTA 
GAAGAGTTAT CTAAAAACCTTG ATACGTCTAATTTGGGGG CTGAT CTTG A 
AGAAG AATAT C C CT CTAAACCAGAG ACAAC CAACAAT AAAG AAAG CAATG 
TAGTAACAAATGCTT CAACTG CAATAG CACAGAAAGTTC CCT CAG CAT AT 
GAAGAGGTG AAGT CAGAAAG CAAGTCAT CG CTTGCTGTT CTTGATACAT C 
TAAAATAACAAAATTACAAGCCACAACCCAAAGAGGAAAGGGAAATGTAG 
TAGCT ATTATTGATACTGG CTTTG ATATT AAC CATGATATTTTTCGTTT A 
GAT AG C CCAAAA.GATGATAAGCACAGCTTTAAAACTAAGG CAGAATTTC 
GGAATTAAAAG CAAAACATAATATCACTTATG GG AAATGGGTTAACG AT A 
AG ATTGTTTTTG CACATAACTACGC CAACAAT ACAGAAACGGTGG CTGAT 
ATTG CAG CAG CTATGAAAG ATG GTTATGGGTCAG AAG CAAAGAAT ATTTT 
G CATGGTACACACX3TTG CT GGTATTTTTGTAGGTAATAGTAAACGT CCAG 
CAAT CAATAGTCTT CTTTTAGAAGGTG CAGCG C CAAATG CT CAAGTCTTA 
TTAATGCGTATTCCAGATAAAATTGATTCGGACAAATTTGGAGAAGCATA 
TG CTAAAG CAATCAT AGACG CTGTTAAT CTAGGAG CAAAAACGATTAATA 
TGAG CCTGGGAAAAACGGCTGATT CTTTAATTGCT CTCAATGAT AAAGTT 
AAATTAG CACTTAAATT AG CTT CTGAG AAGGG CGTTG CAGTTGTTGTGG C 
TG CCGGAAATGAAGGTG CATTTGGTATGGATT ATAGa^AACCATT ATCAA 
CTAATCCTGACTACGGT ACGGTTAATAGTC CAG CT ATTT CTGAAGAT ACT 
TTGAGTGTTG CTAG CTATGAAT CACTT AAAACTAT CAGTG AGGT CGTTG A 
AACAACTATTGAAGGTAAGTTAGTTAAGTTGC CG AITGTGAC'IT CTAAAC 
CTT t TGACAAAGGTAAGGCCTACGATGTGGTTTATGCCAATTATGGTGCA 
AAAAAGATTTTGAAGGTAAGGACTTTAAAGGTAAGATTGCATTAATTGAG 
CGTGGTGGTGGACTTGATTTTATGACTAAAAT CACTCATG CTACAAATG C 
AGGTGTTGTTGGTATCX3TTATTTTTAACGATCAAGAAAAACGTGGAAATT 
TT CT AATTC CTTACCGT GAATTACCTGTGGGGGTT ATT AGT AAAGT AGAT 
GG CGAG CGTATAAAAAATACTT CAAGTCAGTTAACATTT AAC CAG AGTTT 
TGAAGTAGTTGATAG CCAAGGTGG CAATCGTATG CTGGAACAAT CAAGTT 
GGGGCGTGA CAG CTGAAGGAG CAAT CAAGC CTG ATGT AA CAG CTT CTGGC 
TTTGAAATTTATTCTTCAAC CT AT AAT AAT CAAT ACT AAA CAATGTCTGG 
TACAAGTATGGCTTCACCACATGTTGCAGGATTAATGACAATGCTTCAAA 
GTCATTTGGCTG AG AAAT AT AAAG GGATG AATTTAGATT CTAAAAAATTG 
CT AG AATTGT CTAAAAACAT C CTCATG AG CTCAG CAACAG CATT ATAT AG 
TGAAGAGGATAAGGCGTTTTATTCACCACGTCAGCAAGGTGCAGGTGTAG, 
TTGATGCTGAAAAAGCTATCCAAGCTCAATATTATGTTACTGGAAACGAT 
GG CAAAGTT AAAATT AAT CT CAAACGAGAGGG AG ATAAAT TTG AT ATCAC 
AGTTACMTTCATaAACTTGTAGAAGGTGTCAAAGAATTGTATTATCAAG 
CTAATGT AG CAA CAGAaCAAGT AAAT AAAG GTAAATTTG C C CTT aAACCA 
CAAG C CTTG CT AGAT ACT AATTGG CAGAAAGT AATTCTT CGTG AT AAAG A 
AACA CAAGTT CG ATTTACTATTGAT GCTAGT CAATTT AGT CAGAAATTAA 
AAaAAa^GATGGCAAATGGTTATTTCTTAGAAGGTTTTGTACGTTTTAAA 
GAAGCCAAGGATAGTAATCAGGAGTTAATGAGTATTCCTTTTGTAGGATT 
TAATGGTGATTTTGCGAACTTACAAGCACTTGAAACaCCGATTTATAAGA 
CGCTTTCT AAAG GTAGTTT CTACTATAAAC CAAATG ATACAACT CAT AAA 
GACCAATTGGAGTACAATGAATCAGCTCCTTTTGAAAGCAACAACTATAC 
TGCCTTGTTAACACAATCA.GCGTCTTGGGGCTATGTTGATTATGTCAAAA 
ATGGTGG GG AGTTAG AATTAG CAC CGG AG AGT C CAAAAAG AATTATT TTA 
GGAACTTTTGAGAATAAGGTTGAGGATAAAACAATTCATCTTTTGGAAAG 
AG ATG CAG CG AATAATC CAT ATTTTGC CATTT C CAAAT AAAG ATGGAA 
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ATAGGGACGAAATCACTCCCC^GGC^CTTTCTTAAGAAATGTTAAGGAT 
ATTT CTG CTCAAGTT CTAGAT CAAAATGG AAATGTTATTTGG CAAAGT AA 
GGTTTTAC CATCTT ATCGT AAAAATTT C CATAAT AAT CCAAAG CAAAGTG 
ATGGTCATTATCGTATGGATGCTCTTCAGTGGAGTGGTTTAGATAAGGAT 
GGCAAAGTTGTAGCAGATGGTTTTTATACTTATCGCTTACGTTACACACC 
AGTAG CAGAAGG AG C aAATAGTCAGGAGT CAG ACTTTAAAGTT CAAGTAA 
GTACTAAGTCACCAAATCTTCCTTCACGAG CTCAGTTTGATGAAACTAAT 
CGAACATTAAG CTT AGC CATG C CTAAGGAAAGTAGTTATGTT CCT ACAT A 
TCGTTTACAATTAGTTTTATCTCATGTTGTAAAAGATGAAGAATATGGGG 
ATG AGACTT CTT AC CATTATTTCCATATAGAT CAAGAAGGTAAAGTG ACA 
CTTCCTAAAACGGTTAAGATAGGAGAGAGTGAGGTTGCGGTAGACCCTAA 
GGCCTTGACACTTGTTGTGGAAGATAAAGCTGGTAATTTTGCAACGGTAA 
AATTGTCTG AC CTCTTG AAT AAGG CAGTAGTAT CAGAGAAAG aAAACGCT 
AT AGT AATTT CTAACAGTTT CAAATAT TTTGATAACTTG AAGAAAGAACC 
TATGTTTATTTCTAAAG AAGGAAAAGT AGT AAACAAGAAT CT AG AAG AAA 
TAACATTAGTTAAG C CT CAAACTACAGTTACTACTCAAT CATTGT CTAAA 
GAAATAACTAAATCAGGAAATGAGAAAGTCCTCACTTCTACAAACAATAA 
TAGT AGCAGAGTAG CTAAG ATCATAT CAC CTAAACATAACGGGGATTCTG 
TTAACCATACC 

SEQ ID NO. 4407 
STRAIN COH1 

GAGGAGCAAGAATTAAAAAACCAAGAGCAATCACCTGT 
AATTGCTAATGTTGCTCAACAGCCATCGCCATCGGTaACTACTAATATTG 
TTGAAAAAACAT CTGTAACAG CTG CTT CTG CTAGTAATACAGTG AAAGAA 
ATGGG t gAT ACAT CTGTAAAAAATG ACAAAACAGAAGATGAATTATT AGA 
AGAGTTATCTAAAAACCTTGATACGTCTAATTTGGGGGCTGATCTTGAAG 
AAGAATATCCCTCTAAACCAGAGaCAACCAACAATAAAGAAAGCAATGTA 
GT AACAAATG CTT CAACTGCAATAG CACAG AAAGTTCCCT CAGCAT ATG A 
AGAGGTGAAGTCAGAA&GCAAGTCATCGCTTGCTGTTCT 
AAATAACAAAATTACAAGCCACAACCCAAAGAGGAAAGGGAAATGTAGTA 
G CTATTATTGATACTGG CTTTG ATATTAAC CATG ATATT^ 
TAG CC CAAAAGATG ATAAG CACAG CTTTAAAACT AAGGCAGAATTTG AGG 
AAtTAAAAGCAAAACATAATATCACTTATGGGAAATGGGTTAACGATAAG 
ATTGTTTTTGCACATAACTACGCCAaCAATACAGAAACGGTGGCTGATAT 
TG CAG CAG CT ATGAAAG ATGGTTATGGGT CAGAAG CAAAG AATATTTTG C 
ATGGTACACACGTTGCTGGTATTTTTGTAGGTAATAGTAAACGTCCAGCA 
ATCAATAGTCTT CTTTT AG AAG GTG CAG CG CCAAATG CT CAAGT CTTATT 
AATGCGTATTCCAGATAAAATTGATTCGGACAAATTTGGAGAAGCATATG 
CTAAAGCAATCATAGACGCTGTTAATCTAGGAGCAAAAACGATTAATATG 
AGCCTGGGAAAAACGGCTGATTCTTTAATTGCTCTCAATGATAAAGTTAA 
ATTAG CACTTAAATTAG CTT CT GAG AAGGG CGTTG CAGTTGTTGTGG CTG 
CCGGAAATGAAGGTG CATTTGGTATGG ATTATAG CAAAC CATTAT CAACT 
AAT CCTG ACTACGGT ACGGTTAAT AGT CCAGCTATTT CTG AAGATACTT T 
GAGTGTTGCTAGCTATGAATCACTTAAAACTATCAGTGAGGTCGTTGAAA 
CAACTATTGAAGGT AAGTTAGTTAAGTTG C CGATTGTGACTT CTAAAC CT 
TtTGACAAAGGTAAGGCCTACGATGTGGTTTATGCCAATTATGGTGCAAA 
AAAGATTTTGAAGGTAAGGACTTTAAAGGTAAGATTGCATTAATTGAGCG 
TGGTGGTGG ACITG ATTTT ATG ACTAAAATCACTCATG CTACAAATG CAG 
GTGTTGTTGGTATCGTT ATTTTTAACGAT CAAGAAJ\AACGTGGAAATTTT 
CTAATTCCTTACCGTGAATTACCTGTGGGGGTTATTAGTAAAGTAGATGG 
CGAGCGTATAAAAAATACTTCAAGTCAGTTAACATTTAACCAGAGTTTTG 
AAGTAGTTGATAGCCAA.GGTGGCAATCGTATGCTGGAACAATCAAGTTGG 
GGCGTGACAGCTGAAGGAGCAATCAAGCCTGATGTAACAGCTTCTGGCTT 
TGAaATTTATTCTTCAACCTATAATAATCAATACn'AAACAATGTCTGGTA 
CAAGTATGGCTTCACCACATGTTGCAGGATTAATGACAATGCTTCAAAGT 
CATTTGG CTGAG AAATATAAAGGG ATG AATTTAG ATTCTAaAAAATTG CT 
AG aATTGTCTAa aAACAT C CT CATGAG CT CAG CA^CAGCATTATATAGTG 
AAGAGGATAAGG CGTTTTATTCAC CACGT CAGCAAGGTGCAG GTGTAGTT 
GATGCTGAAAAAGCTATCCAAGCTCAATATTATGTTACTGGAAACGATGG 
CAAAGTTAAAATTAAT CT CAAACGAG AGGG AGATAAATTTG ATAT CACAG 
TTACAATTCATaAACTTGTAGAAGGTGTCAAAGAATTGTATTATCAAGCT 
AATGTAGCAaCAGAACAAGTAAATAAAGGTAAATTTGCCCrTAAACCACA 
AG CCTTG CTAGATACT AATTGG CAG AAAGTAATTCTT cGTGATAAAGAAA 
CACAAGTT CGATTTACTATTGATG CTAGT CAATTT AGTCAG AAATTAAAA 
GAACAGATGGCAAATGGTTATTTCTTAGAAGGTTTTGTACGTTTTAAAGA 
AG C CAAGGATAGTAATCAG G AGTTAATGAGTATT C CTTTTGTAGGATTT A 
ATGGTGATTTTGCGAACTTACAAGCACTTGAAACACCGATTTATAAGACG 
CTTT CTAAAGGTAGT TT CTACTAT AAACCAA^TGATACAACTCATAAAG A 
CCAATTGGAGTACAATGAATCAGCTCCTTTTGAAAGCAACAACTATACTG 
C CTTGTT AACACAAT CAG CGT CTTGGGG CT ATGTTG ATT ATGTCAAAAAT 
GGTGGGGAGTTAGA^TTAGCACCGGAGAGTCCAAA^AGAATTATTTTAGG 
aACTTTTGAGAATAAGGTTGAGGATAAAACAATT CAT CTTTTGG AAAGAG 
ATG CAG CGAATAAT C CAT ATTTTG C CATTT CT C CAAATAAAG ATGGAAAT 
AGGG ACG AAATCACT C CC CAGG CaACTTT CTTAAG AAATGTT AAG GATAT 
TTCTGCTCAAGtTCTAGATCAAAATG^AAATGTTATTTGGCAAAGTAAGG 
. TTTT ACCAT CTT AT CGT AAAAATTT CCAT AAT aAT CCAAAGCAAAGT GAT 
GGTCATTATCGTATGGATGCTCTTCAGTGGAGTGGTTTAgATAAGGATGG 
CAAAGTTGTAgCAGATGGtTTTTATACOTATCGCTTACGTTACACACCAG 
T AGCAG AAGGAG CAAATAGT CAGG AGTCAG ACTTT aAAGTTCAAGTAAGT 
AcTAAGTC^CCAAATCTTCCTTCACGAGCTCAGTTTGATGaAACTAATCG 
AACATT AAG CTT AG C CATG C CTAAG GAAAGTAGTTATGTT CCTACATAT C 
GTTTACAATTAGTTTTATCTCATGTTGTAAAAGATGAAGAATATGGGGAT 
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GAGACTT CTT ACCATTATTT CCATATAGAT CAAGAAGGTAAAGTG ACACT 
TCCTAAAACGGTTAAGATAGGAGAGAGTGAGGTTGCGGTAGACCCTAAGG 
C CTTGACACTTGTTGTGGAAG ATAAAG CTGGTAATTTTG CAACGGTAAAA 
TTGTCTG AC CTCTTGAATAAGG CAGTAGTATCAGAGAAAGAAAACG CTAT 
AGTAATT TCT AACAGTTTCAAATATTTTG ATAACTTC 
TGTTTATTTCTAA^GAAGGAAAAGTAGTAAACAAGAATCTAGAAGAAATA 
ACA k TTAGTTAAGCCTC!AAACTACAGTTACTACTCAATCATTGTCTAAAGA 
AATAACTAAAT CAGG AAATG AGAAAGTCCTCACT T CTACAAACAATAATA 
GTAG CAG AGT AG CT AAGAT CATAT CAC CTAAACAT AACG GGG ATTCTGTT 
AACCATACC 

SEQ ID NO. 4408 
STRAIN M781 

GAGGAGCAAGAATTAAAAAACCAAGAGCAATCACCTGT 
AATTGCTAATGTTGCTCAACAGCCATCGCCATCGGTAACTACTAATATTG 
TTGAAAAAACAT CTGTAACAGCTG CTTCTG CTAGTAATACAGTGAAAGAA 
ATGGGTGATACATC!TGTAAAAAATGACAAAACAGAAGATGAATTATTAGA 
AGAGTTATCTAAAAACCTTGATACGTCTAATTTGGGGGCTGATCTTGAAG 
AAGAATATCCCrCTAAACCAGAGACAACCAACAATAAAGAAAGCAATGTA 
GTAACAAATG CTTCAACTG CAATAG CACAG AAAGTTC C CT CAG CATATG A 
AGAGGTGAAGTCAG AAAG CAAGTCATCGCTTG CTGTT CTTGATACAT CTA 
AAAT AACAAAATTACAAG CCACAAC CCAAAGAGGAAAGGG AAATGTAGTA 
GCTATTATTGATACTGGCTTTGATATTAACCATGATATTTTTCGTTTAGA 
TAG CC CAAAAGATG ATAAG CACAG CTTTAAAACTAAGG CAGAATTTG AG G 
AATTAAAAGCAAAACATAATATCACTTATGGGAAATGGGTTAACGATAAG 
ATTGTTTTTGCACATAACTACGCCAaCAATACAGAAACGGTGGCTGATAT 
TG CAGCAGCTATGAAAG ATGGTTATGGGTCAGAAG CAAAG AATATTTTG C 
ATGGTACACACGTTGCTGGTATTTTTGTAGGTAATAGTAAACGTCCAGCA 
AT CAAT AGTCTT CTTTTAG AAGGTG CAG CGCCAAATGCTCAAGTCTTATT 
AATGCGTATTCCAGATAAAATTGATT CGG ACAAATTTGGAGAAG CATATG 
CTAAAG CAAT CATAGACG CTGTTAATCTAGGAGCAAAAACGATTAATATG 
AG CCTGGGAAAAACGGCTG ATTCTTTAATTG CTCT CAATGATAAAGTTAA 
ATTAGCACTTAAATTAGCTTCTGAGAAGGGCGTTGCAGTTGTTGTGGCTG 
C CGG AAATGAAG GTG CATTTGGTAT GGATTATAG CAA a C CATTATCAa CT 
AATCCTGACTACGGTACGGTTAATAGTCCAGCTATTTCTGAAGATACTTT 
GAGTGTTGCTAGCTATGAATCACTtAAAACTATCAGTGAGGTCGTTGAAA 
CAACTATTGAAGGTAAGTTAGTTAAGTTGCCGATTGTGACtTCTAaACCT 
TTTGACAAAGGTAAGGCCTACGATGTGGTTTATGCCAATTATGGTGCAAA 
AAAGATTTTGAAGGTAAGG A CTTTAAAGGTAAGATTG CATTAATTGAG CG 
TGGTGGTGGACTTGATTTTATGACTAAAATCACTCATGCTACAAATGCAG 
GTGTTGTTGGTATCGTTATTTTTAACGATCAAGAAAAACGTGGAAATTTT 
cTAATTCCTTACCGTGAATTACCTGTGgGGGTTATTAGTAAAGTAGATGG 
CG AG CGTAT AAAAAATACTT CAAGT CAGTTAACATTTAAC CAGAGTTTTg 
AAGTAGTTG ATAG CCAAGGTGG CAATCGTATGCTGGAACAATCAAGTTGG 
GGCGTGACAGCTGAAGGAGCAATCAAGCCTGATGTAACAGCTTCTGGCTT 
■ TGAAATTTATTCTTCAACCTATAATAATCAATACTAAACAATGTCTGGTA 
CAAGTAT GG CTTCAC CACATGTTG CAG GATTAATGAC AATG CTT CAAAGT 
CATTTGG CTGAGAAATATAAAGGGATGAATTTAGATTCTAAAAAATTGCT 
AGAATTGTCTAAAAACATCCTCATGAGCTCAGCAACAGC^TTATATAGTG 
AAGAGGATAAGGCX3TTTTATTCACCACGTCAGCAAGGTC 
GATG CTG AAAAAG CT ATCCAAG CT CAATATTATGTTACTGGAAACGATGG 
CAAAGTTAAAATTAATCTCAAACGAGAGGGAGATAAATTTGATATCACAG 
TTACAATTCATaaACTTGTAgAAGGTGTCAAAGAATTGTATTATCAAGCT 
AATGTAGCaaCAGAACAAGTAAATAaAGGTAAATTTGCCCTTaAaCCaCA 
AG C CTTG CTAGATACTAATTGG CAG AaAGT aATT CTT cGTGATAAAGAAA 
CACAAGT Tc G ATTTACTA t TGATG CTAGTCAATTT AGTCAGAAATTAAAA 
GAACAa^TGGCAAATGGTTATTTCrTAGAAGGTTTTG 
AG CCAAGGATAGTAATCAGGAGTTAATGAGTATTC CTTTTGTAGGATTT A 
ATGGTGATTTTGCGAACTtACAAGCACTTGAAACACCGATTTATAAGACG 
CTTT CTAAAGGTAGTTT CTACTATAAa C CAAATGATACAACT CAT AAAGA 
CCAATTGGAGTACAATGAATCAGCTCCTTTTGAAAGCAACAACTATACTG 
CCTTGTTAACACAATCAGCGTCTTGGGGCTATGTTGATTATGTCAAAAAT 
G-GTGGGG AGTTAGAATT AG CAC CG GAGAGTCCAAAAAG AATT ATTTTAGG 
AACTTTTGAGAATAAGGTTGAGG^TAAAACAATTCATCTTTTGGAAAGAG 
ATG CAG CGAATAATC CATATTTTG C CATTT CT CCAAATAAAG ATG GAAAT 
AGGG ACG a aATCACT CC C CAGG CaAC t TT CTT AAGAAATGTTAAG GATAT 
TT CTG CT CAAG t TCTAG ATCAAAATGG AAATGTT ATTTGG CAAAGTAAG G 
TTTTAC CAT CTTAT CGTAAAAATTT CCATAAT aAT CCAAAG CAAAGTGAT 
GGTCATTATCGTATGGATGCTCTTCAGTGGAGTGGTTTAGATAAGGATGG 
CAAAGTTGTAGCAGATGGTTTTTATACTT^ 

TAGCAGAAGGAGCAAATAGTC^GGAGTCAGACrrTAAAGTTaVAGTAAGT 
ACTAAGT CAC CAAAT CTTC CTT CAC GAG CTCAGTTTG ATGAAACTAATCG 
AACATTAAGCTTAGCCATGCCTAAGGAAAGTAGTTATGTTCCTACAtATC 
GTTT ACAATT AGTTTTAT CT CATGTTGTAAAAGATGAAGAAT ATGGGGAT 
GAGACTT CTT AC CATTATTT CCATATAGAT CAAGAAGGTAAAGTG ACACT 
T C CT AAAACGGTTAAGATAG G AGAG AGTG AGGTTG CGGTAG ACC CTAAGG 
CCTTGACACTTGTTGTGGAAGATAAAG CTGGTAATTTTG CAACGGTAAAA 
TTGTCTG AC CTCTT G AATAAGG CAGTAGTATCAGAGAAAGAAAACG CTAT 
AGTAATTTCTAA CAGTTTCAAATAT TTTG ATAACT TG AAG AAAG AAC CTA 
TGTTTATTTCTAAAGAAGGAAAA.GTAGTAAACAAGAATCTAGAAGAAATA 
ACATTAGTTAAG CCT CAAACT ACAGTTACTACTCAAT CATTGTCTAAAG A 
AATAACTAAATCAGG AAATG AG AAAGTC CT CACT T CTACAAACAATAATA 
GTAGCAGAGTAGCTAAGATCATATCACCTAAACATAACGGGGATTCTGTT 
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AACCATACC 

SEQ ID NO. 4409 
STRAIN CJB110 

GAGGAGCAAGAATTAAAAAACCAAGAGCAATCACCTGTAA 
TTGCTAATGTTGCTCAACAGCCATCGCCATCGGTA^CTACTAATATTGTT 
GAAAAAACATCTGTAnCAGCTGCTTCTGCTAGTAATACAGCGAAAGAAAT 
GGGTGATACATCTGTAAAAAATGACAAAACAGAAGATGAATTATTAGAAG 
AGTTATCTAAAAACCTTGATACGTCTAATv/IGGGGGCTGATCTTGAAGAA. 
GAATAT C CCTCTAAACCAGAGACAAC CAACAATAAAGAAAG CAATGTAGT 
AACAAATGCTTCAACTG CAATAG CACAGAAAGTT C CCTCAGCGTATG AAG 
AGGTGaAGCCAGAAAGCAAGTCATCGCTTGCTGTTTTTGATACATCTAAA 
ATAACAAAATTG CAAG C CATAACC CAAAGAGGAAAGGGAAATGTAGTAG C 
TATT ATTGATACTGG CTTTG ATATTAACCATGATATTTTT CGTTTAGAT A 
GCCCAAAAGATGATAAGCACAGCTTTAAAACTAAAGCAGAATTCGAGGAA 
tTAAAAGCAAAACATAATATCACTTATGGGAAATGGGTTAACGATAAGAT 
TGTTTTTGCACAlTAACTACG CCAACAATACAGAAACGGTG G CTGATATTG 
CAGCAGCTATGAAAGATGGTTATGGGTCAGAAGCAAAGAATATTTCGCAT 
GGTACACACGTTGCTGGTATTTTTGTAGGTAATAGTAAACGTCCAGCAAT 
CAATGGT CTTCTTTTAG AAGGTG CAGCG C CAAATG CT CAAGT CTT ATTAA 
TG CGTATT CCAGAT AAAATTGATTCGGAGAAATTTGG AG AAG CATATG CT 
AAAGCAATCACAGACG CTGTTAAT CTAGG AGCAAAAACGATTAATATGAG 
CCTTGG AAAAACAG CAG ATT CTTTAATTG CACTCAATGATAAAGTTAAAT 

GG AAATGAAGGTGCATTTGGTATGG ATTAT Ag CAAAC CATTATCAACTAA 
Tc CTGACTACGG t ACGGTTAATAGT CCAG CTATTT cTGAAGATACTTTG A 
GTGTTGCTAGCTATGAATCACTTAAAACTATCAGTGAGGTCGTTGaAACA 
ACTATTGAAGGTAAGTTAGTTAAGTTGCCGATTGTGACTTcTAAACCTTT 
TGACAAAGGTAAGG CCTACGATGTGGTTTATGCCAATTATGGTG CAAAAA 
AAGACTTTGAAGGTAAGGALTlU'lAAAGGTAAGATTGCATTAATTGAGCGT 
GGTGGTGGACTTGATTTTATGACTAAAATCACTCATGCTACAAATGCAGG 
TGTTGTTGGTATCGTTATTTTTAACGATCAAGAAAA^CGTGGAAATTTTc 
TAATTCCTTACCGTGAATTACCTGTGgGGGTTATTAGTAAAGTAGATGGC 
GAGCGTATAAAAAATACTTCAAGTCAGTTAACATTTAACCAgAGTTTTGA 
AGTAgTTGATAGCCAAgGTGGCAATCGTATGCTGGAACAATCAAGTtGGG 
GCGTGACAGCTGAAGGAGCAATCAAGCCTGATGTAACAGCTTCTGGCTTT 
GAAATTTATT CTTCAAC CTATAATAAT CAATACCAAACAATGTCTGGTAC 
AAGTATGGCTTCACCACATGtTGCAGGATTAATGACAATGCTTCAAAATC 
ATTTGGCTGAGAAATATAAAGGGATGAATTTAGATTCTAAAAAATTGCTA 
GAATTGTCTAAAAACATCCTCATGAGCTCAGCAACAGCATTATATAGTGA 
AG AGGAT AAGGCGTTTTATTCAC CACGTCAGCAAGG t G CAGGTGT AGTTG 
ATGCTGAAAAAGCTATCCAAGCTCAATATTATGTTACTGGAAACGATGGC 
AAAG CTAAAATTAAT CT CAAACGAGTG GGAGATAAATTTG AT AT CACAGT 
TACAATTCATAAACTTGTAGAAGGTGTCAAAGAATTGTATTATCAAGCTA 
ATGTAGCAACAGAACAAGTAAATAAAGGTAAATTTGCCCrTaAACCACAA 
GCCTTGCTAGATACTAAT/TGGCAGAAAGTAATTCTTcGTGATAAAGAAAC 
ACAAGTTCGATTTACTAtTGATGCTAGTCAATTTAgTCAGAAATTAAAAG 
AACAGATGGCAAATGGTTATTTCTTAgAAGGTTrTGTACGTTTTAAAGAA 
GCCAAGGATAGTAATCA.GGAGTTAATGAGTATTCCTTTTGTAGGATTTAA 
TGGTGATTTTGCGAACTtACAAGCACTTGAAACACCGATTTATAAGACGC 
TTTCTAAAGGTAGT t T CTACTATAAAC CAAATGATACAAC TCATAAAGAC 
CAATTGGAGTACAATGAATCAGCTCc t TTTGAAAG CAACAACTATACTGC 

GTGGGGAGTTAGAATTAG CACCGGAGAGTCCAAAAAGAATTATTTTAGGA 
ACTTTTGAGAATAAGGTTGAGGATAAAACAATTC^^ 

TG CAGCG AATAATCCATATTTTGCCATTTCTCCAAATAAAGATGGAAATA 
GGGATGa aAT CACTCC CCAGGCAAC t TT CTTAAGAAATGTTAAGGATATT 
TCTG CT CAAG TTCTAGATCAAAATGGAAATGTTATTTGG CAAAGTAAGGT 
TTTACCATCTTATCGTAAAAATTTCCATAATAATCCAAAGCAAAGTGATG 
GTCATT ATCGT ATGG ATG C CTTT CAGTGG AGTGGTTTAg ATAAgGATGG C 
AAAGTTGTAG CAGATGGTTTTTATACTTAT CG C CTACGTT ACACACCAGT 
AG CAG AAgG AGCAAATAGT CAGGAGT CAg ACTTTAAAGTTCAAGTAAGTA 
CTAAGT CAC CAAAT CTT CCTTTACTAG CT CAGTTTGATG AAACTAAT CG A 
ACATTAAGCTTAGCCATGCCTAAGGAAAGTAGTTATGTTCCTACATATCG 
TTTACAATTAGTTTTATCTCATGTTGTAAAAGATGAAGAATATGGGGATG 
AGACTTCTTACCATTATTTCCATATAGATCAAGAAGGTAAAGTGACACTT 
CCTAAAACGGTTAAGATAGGAGAGAGTGAGGTTGCAGTAGACCCTAAGGC 
CTTGACACTTGTTGTGGAAGATAAAGCTGGTAATTTTGCAACGGTaAAAT 
TGTCTGACCTCTTGAaTAAgGCAGTAGTATCAGAGAAAGAAAACGCTATA 
GTAATTTCTAACAGTTT CAAAT ATTTTGATAACTTGAAAAAAGAATCTAT 
GTTTATTT CT AAAGAAGGAAAAGT AGTAAACAAG AAT CT AGAAG AAATAA 
CATTAGTTAAGCCGCAaACTACAGTTACTACTCAATCATTGTCTAAAGAA 
ATAACTAAAT CAGG AAATG AGAAAGTCCT CACTT CTACAAACAATAATAG 
TAG CAGAGT AG CT AAGAT CATATCACCTAAACATAACGGG GATT CTGTTA 
ACCATACC 

SEQ ID NO. 4410 
STRAIN 116 9NT 

GAGG AG CAAG AATTAAAAAAC CAAG AG CAATC 

ACCTGTAATTGCTAATGTTGCTCAACAGCCATCGCCATCGGTAACTACTA 
ATATTGTTGAAAAAACATCTGTAACAGCTGCTTCTGCTAGTAATACAGCG 
AAAG AAATG G GTGATACAT CTGTAAAAAATG ACAAAACAG AAGATG AATT 
ATTAGAAGAGTTATCTAAAAACCI^GATACGTCTAATATGGGGGCTGATC 
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TTGAAG AAGAAT AT C C CT CT AAAC CAGAG ACAAC CAACAATAAGG AAAG C 
AATGTAGTAACAAATGCTT CAACTG GAAT AG CACAGAAAGTT C CCTCAGC 
ATATG AAGAGGTGAAG CCAAAAAG CAAGT CAT CG CTTGCTGTTCTTG AT A 
CATCT AAAATAACAAAATTG CAAG CCATAAC C CAAAG AGGAAAGG GAAAT 
GTAGTAGCTATTATTGATACTGGCOTTGATATTAACCATGATATTTT^ 
TTTAGAT AG CCGAAA^G ATG ATAAG CACAG CTTTAAAAAT AAGG CAG AAT 
T CGAGGAATTAAAAG CAAAAGATAATATCACTTATGG GAAAT GG GTT AAC 
GATAAGATTGTTTTTGCACATAACTACGCCAACAATACAGAAACGGTGGC 
TGATATTG CAG CAG CTATGAAAgATGGTTATGGTT G^GAAG CAAAGAATA 
TTTCG CATGGT ACACACGTTG CTGGTATT t TTGTAGGTAATAGT AAACGT 
CCAGCAATCAATGGTCTTCTTTTAgAAGGTGCAgCGCCyVAATGCTCAAGT 
CITATTAATGCGTATTCCAGATAAAATtGATTCXSGAC^AATTtGGAGAAG 
CATATGCTAAAGCAATCACAGACGCTGCTAATCTAGGAGCTaAAACGATT 
AATATGAGTATTGG AAAAACAG CTG ATTCTTTAATTGCT CTCAATGATAA 
AGTTAAATTAgCACTTAAATTAGCTTCTGAGAAGGGCGTTGCAGTTGTTG 
TGGCTGcCGGAAATGAAGGCGGATTtGGTATGGACTATAGCAAACCGTTA 
TCAACTAAT c CTGACTACG G t ACGG tT AATAGT C CAG CT ATTTCTGAAGA 
TACTTTG AGTGTTG CTAG CTATGAATCACTTAAAACT AT CAGTG AGGT CG 
TTGAAACAACTATT G AAGGTAAGTTAGTTAAGT t G C CGATTG t G ACTT CT 
AAAC CTT 1 1 GACAAAGGTAAGG CCTACGATGTGGTTTATG CCAATTATGG 
TG CAAAAAAAGACTTTG AAGGTAAGGACTTTAAAG GTAAGATTG CAT TAA 
TTGAGCX3TGGTGGTGGACTTGATTTTATGACTAAAATCACTCATGCTACA 
AATGCAGGTGTTGTTG<3TATCGTTATTTTTAACGATCAAGAAAAACGTGG 
AAATTTTCTAATTCCTTACCGTGAATTACCTGTGGGGGTTATTAGTAAAG 
TAGATGG CGAG CGTAT AAAAaATACTTCAAGT CAGTTAACATTT AAC CAg 
AGATTTGAAGTAGTTGATAGCC^gGTGGCMTCGTATGCrGGAACAATC 
aAGT t GGGG CGTGACAG CTG AAGG AGCAATCAAG CCTGATGTAACAG GIT 
CTGGCTTCXSaAATTTATTCTTCaaCCT 

TCTGGTACAAGTATGGCITCACCACATGTTGCAGGATTAATGACAATGCT 
T CAAAGT CATTTGG CTG AG aAATATAAAGGGATGAATTTAgATT CTAaAk 
AATTG CTAG AATTGT CT AAAAACATCCT CATGAG CTCAG CAACAG CATTA 
TATAGTGAAGAGGAT AAGG CGTTTTATT CACCACGTCAG CAAGG t GCAGG 
TGTAGCTGATGCTGAAAAAGCTATCCAAGCTCAATATTATGTTACTGGAA 
ACGATGG CAA^G CTAAAATT AATCT CAAACGAGT GGG AG ATAAATTT GAT 
AT CACAGTTACAATT CATAAACTTGTAGAAGGTGTCAAAGAATTGTATTA 
TCAAG CT AATGT AG CAACAG AA CAAGT AAATAAAGGTAAATTTG C CCTTA 
AACCACAAGCCTTGCTAGATACTAACTGGCAGAAAGTAATTCTTcGTGAT 
AAAGAAACACAAGCTCGATTTACTACTGATGCTAGTCAATTTAgTCAGAA 
ATTAAAAGAACAGATGGCAAATGGTTACTTCITAgAA.GGTTTTGTACGTT 
TTAAAGAAG CTAAGGAT AGT AAT CAGGAGTTAATGAGTATT C CTTTTGTA 
GGATTTAATGGTGATTCTGCGAGCTTACAAGCACTTC 
TAAGACG CTTT CTAAAGGT AGTCT CTACT ATAAAC CAAATGATACAACTC 
ATAAAGAC CAATTG G AGTATAATG AAT CAG CT CCTTTTGAAAG CAACAAC 
TATACTG C CTTGTTAACACAAT CAGCGT CTTGGG G CT ATGTTGATTATGT 
CaAAAATGGTGGGGAGTTAGAATTAGCACCGGAGAGTcCAAAAAGAATTA 
TTTTAGGAACTTTTGAGAATAAGGTTGAGGATAAAACAATTCA 
GAAAGAG ATG CAG CG AATAATC CAT ATTTTGC CATTT CT CCAAATAAAG A 
TGGAAATAG G GATGAAATCACT CC C CAGG CAACTTT CTTAAGAAATGTT A 
AGGATATTTCTG CTCAAGTT CT AG ATCAAAATGG AA ATGTTATTTGG CAA 
AGTAAGGCTCTACCATCTT ATCGTAAAAATTTCCATAATAAT C CAAAG CA 
GAGTGATGGTCATTATCGTATGGATGCCCTTCAGTGGAGTGGTTTAgATA 
AGGATGGCAAAGCTGTAGCAGATGGTTCTTATACTTATCGCTTACGTTAC 
ACAC CAG TAG CAGAAGG AG CAAATAGTCAGGAGT CAGACTTT AAAGTTCA 
AGTAAGTACTAAGTCLACCAAAT CTT CCCT CACGAGCTCAGTTTG ATG aAA 
CTAATCGAACATTAAGCTTAGCCATGCCTAAGGGAAGTAGTTATGTTCCT 
ATATATCGTCTACAATTAGTTTTAT CT CATGTTGTAAAAGATGAAGAATA 
TGGAG ATGAG ACCT CTTACT ATTATTT CCAT ATAGAT CAAGAAGGTAAAG 
CGACACTTCCTAAAACGGTTAAGATAGGAGAGAGTGAGGTTGCAGTAGAC 
CCTAAGGCCCTGACACTTGCTGTGGAAGATAAAGCTGGTAATTTCGCAaC 
GGTAAAATTGT CTGACCTCTTG AATAAGG CAGTAGTAT CAGAGAAAG AAA 
, ACGCTAT AGTAATTTCTAACAGTTT CAAATATTTTGATAACTTGAAAAAA 
GAACCTATGTCTATTTCTAAAAAAGAAAAAGTAGTAAACAAGAATCTAGA 
AGAaATAATACTAGTTAAGCCGCAcACTAC^GTTACTACTCAaTCATTGT 
CTAAAGAAATAACTAAATCAGGAAATGAGAAAGTCCTCACTTCTACAAAC 
AATAATAGTAGTAGAGTAGCTAAAATCATATCACCTAAACATAATGGGGA 
'TTCTGTTAACCATACC 

SEQ XD NO. 4411 
STRAIN OM9130013 

GAGG AG CAAGAATT AAAAAACC AAGAG CAATCACCTGTAA 
TTGCTAATGTTGCTCAACAGCCATCGCCATCGGTAACTACTAATACTGTT 
GAAAAAACAT CTGT AACAG CTG CTTCTG CTAGTAAT ACAGCGAAAG AAAT 
GGGTGATACATCTGTAAAAAATGACAAAACAGAAGATGAATTATTAGAAG 
AGTTATCTAAAAACCrrTGATACGTCTAATTTGGGGGCTGATCTTGAAGAA 
GAATATCCCTCTAAACCAGAGACAACCAACAATAAAGAAAGCAATGTAGT 
AACAAATGCTTCAACTG CAAT AG CACAGAAAGTT CC CT CAG CAT ATG AAG 
AGGTGAAGCCAGAAAGCAAGTCATCGCTTGCTGTTCTTGATACATCTAA^ 
AT AACAAAATTACAAG C CATAAC CCAAAG AGG AAAGGGAAATGT AGT AG C 
TATTATTGATACTGGCTTTGATATTAACCATGATATTTTTCGTTTAGATA 
G C CCAAAAG ATG AT AAG CACAG CT TTAAAACT AAGACAG AATTTG AGGAA 
TTAAAAGCAAAACATAATATCACTTATGGGAAATGGGTTAACGATAAGAT 
TGTTTTT G CACATAACT ACG C CAACAAT ACAG AAACGGTGG CTG ATATTG 
CAG CAGCT ATG AAAG AT G GTTATGGTT CAG AAG CAAAGAATATTT CG CAT 



771 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 

GGTACACACGTTGCTGGTATTTTTGTAGGTAATAGTAAACGTCCAGCAAT 

CAATGGTCTTCTTTTAGAAGGTGCAGCGCCAAATGCTCAAGTCTTATTAA 

TG CGTATT C CAG ATAAAATTGATTCGG ACAAATTTGGTGAAG CAT ATGCT 

AAAG CAAT CACAGACGCTGTTAATCTAGG AG CAAAAACG ATTAATATGAG 

TATTGGA^AAACAG CTG ATT CTTTAATTG CT CT CAATGATAAAGTTAAAT 

TAGCACTTAAATTAGCTTCrGAGAAGGGCGTTGCAGTTGTTGTGGCTGCC 

GG AAATG AAGG CGC ATTTG GTATGG ATTAT AG CAAACCATTATCAACTAA 

TCCTGACTACGGTACGGTTAATAGTCCAGCTATTTCTGAAGATACTTTGA 

GTGTTGCTAGCTATGAATCACTTAAAACrATCAGTGAGGTCGTTGAAACA 

ACTATTGA^GGTAAGTTAGTTAAGTTGCCGATTGTGACTTCTAAACCTTT 

TGACAAAgGTAAgGCCTACGATGTGGTTTATGCCAATTATGGTGCAAAAA 

AAGACITTGAAGGT AAGGACTTTAAAGGT AAG ATTG CATT AATTG AG CGT 

GGTGGTGGACTTGATTTTATGACTAAAATCACTCATGCTACAAATGCAGG 

TGTTGTTGGTATCGTTATTTTTAACGATCAAGAAAAACGTGGAAATTTTC 

TAATT CCTTACCGTGAATTAC CTGTGGGG ATTATT AGTAA&GTAGATGG C 

GAG CGTATAAAAAATACTTCAAGTCAGTTAACATTTAAC CAGAGTTTTG A 

AGTAGTTGAT AG CCAAGGTGGTAAT CGTATGCTG G AA.CAATCAAGTTGGG 

GCGTG ACAGCTG AAG GAG CAAT(2AAG CCTG ATGTAACAG CTTCTGG CTTT 

GAAATTTATTCTTCAACCTATAATAATCAATACCAAACAATGTCTGGTAC 

AAGTATGGCTTCACCACATGTTGCAGGATTAATGACAATGCTTCAAAGTC 

ATTTGGCrGAGAAATATAAAGGGaTGAATTTAGATTCTAAAAAATTGCTA 

GAATTGTCTAAAAACATC CT CATGAGCTCAGCAACAG CATTATAT AGTGA 

AGAGGATAAGGCGTTTTATTCACC^CGTCAGCAAGGTGC^GGTGTAGTTG 

ATG CTGAAAAAG CTATC CAAGCT C aATATTAT ATT ACTGGAAACGAT GG C 

AAAGCTAAAATTAATCTCAAACGAATGGGAGATAAATTTGATATCACAGT 

TACAATT CAT aAACTTGTAGAAGGTGTCAAAG AA t TGTATT ATCAAG CTA 

ATGTAGCAACAGAACAAGTAAATAAAGGTAAATTTGCCCITaAACCACAA 

GCCTTGCTAGATACTAA.TTGGCAGAAAGTAATTCTTCGTGATAAAGAAAC 

ACAAGTTCGATTTACTATTGATGCTAGTCAATTTAGTCAGAAATTAAAA.G 

AACAGATGG CAAATGGTTATTT CTTAGAAGGTTTTGTACGTTTTAAAGA 

GC CAAGGATAGT AAT CAGG AGTTAATG AGT ATTCCTTTTGTAGG ATTTAA 

TGGTGATTTTGCGAACT^ACA^GCACTTGAAACACCGATTT 

TTTCTAAAGGTAGTTTCTACTATAAACCAAATGATACAACTCATAAAGAC 

CAATTGG AGTACAATGAAT CAG CT C CTTTTGAAAG CAACAACTAT ACTG C 

CTTGTTAACACAAT CAG CGT CTTGG GGCTATGTTG ATTATGT CAAAAATG 

GTGGGGAGTTAGAATTAGCACCGGAGAGTCCAAAAAGAATTATTTTAGGA 

ACTTTTGAGAATAAGGTTGAGGATAAAACAATTCATCriTTTGG 

TGCAG CG AATAATC CAT ATTTTG C CATTT CTCCAAA1AAAGATG GAAATA 

GGGACGAAATCACTCCCCAGGCAA.CTTTCTTAAGAAATGTTAAGGATATT 

TCTGCTCAAGTTCTAGATCAAAATGGAAATGTTATTTGGCAA7VGTAAGGT 

TTTAC CATCTTATCGTAZ^AAATTTCCAT AATAAT C CAAAG CAAAGTGATG 

GT CATT ATCGTATGGATG CT CTTCAGTGGAGTGGTTTAGAT AAGG ATGG C 

AAAGTTGTAGCAGATGGTTTTTATACTTATCGCTTACGTTACACA.CCAGT 

AGCAGAAGGAGCAAATAGTCAGGAGTCAGACTTTAAAGTACAAGTAAGTA 

CTAAGTCACCAA^TCTTCCTTCACGAGCTCAGTTTGATGAAACTAATCGA 

ACATTAAGCTT AGC CATG CCTAAGG AAAGTAGTT ATGTT C CTACAT ATCG 

TTTACAATTAGTTTTATCTO\TGTTGTAAAAGATGAAGAATATGGGGATG 

AG ACTT CTTAC CATT ATTT C CATAT AG ATGAAGAAGGTAAAGTGACACTT 

CCTAAAACGGTTAAGATAGGAGAG AGTGAG GTTGCGG TAG ACCCT AAGG C 

CTTGACACTTGTTGTGGAAGATAAAGCTGGTAATTTCGCAaCGGTAAAAT 

TGTCTGATCrCTTGAATAAGGCAGTAGTATCAGAGAAAGAAAACGCTATA 

GTAATTT CT aACAGTTTCAAAT ATTTTGATAACTTGAAAAAAGAAC CTAT 

GTTTATTTGTAAAAAAGAAAAAGTAGTAAACAAGAATCTAGAAGAAATAA 

TATTAGTTAAGCCGCA\ACTAa^GTTACTACTCAATCATTGTCTAAAGAA 

ATAACTAAAT CAGG AAA1X3AGAAAGTCCT CACTT CTACAAAGAAT AATAG 

TAG CAGAGTAG CTAAGATCAT ATCACCTAAA.CAT AACGGGGATT CTGTT A 

ACCATACC 

PRETTY of : /biotmp/msal83564 . 2 { * } May 13, 2003 03:28 .. 

1 50 

msal83564 . 2{l47j20Hl) 

msal83564 .2(147_M732} 

msal83564 . 2 { 147__M78 1 } 

msal83564 . 2{147_2603 } gtggataaac atcactcaaa aaaggctatt ttaaagttaa cacttataac 

msal83564 ,2{147_JM9130013} 

msal83564.2{l47_18RS21} 

msal83564 .2{147_090} 

msal83564.2{l47_CJB110} 

msal83564.2(l47_A909) 

msal83564.2{147_H36B} 

msal83564 .2 {147_1169NT} T 

Consensus ********** ********** ********** ********** ********** 

51 100 

msal83564 .2{147_C0H1} GAGGAGCAAG 

msal83564 .2{147_M732} GAGGAGCAAG 

msal83564 .2(l47_M78l} GAGGAGCAAG 

msal83564 .2{147_2603} aactagtatt ttattaatgc atagcaatca agtgaatgca GAGGAGCAAG 

msal83564 .2(14 7_JM9130 013} GAGGAGCAAG 

msal83564.2{!47_18RS2l} GAGGAGCAAG 

msal83564.2{l47_090j GAGGAGCAAG 

msal83564.2{l47_OJB110} - GAGGAGCAAG 
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msal83564.2f 147_A909} GAGGAGCAAG 

msal83564 ; 2{l47_H36B} GAGGAGCAAG 

msal83 564.2{l47_1169NT} GAGGAGCAAG 

Consensus ********** ********** ********** ********** ********** 



101 

msal83564.2{l47 COHl} AATTAAAAAA CCAAGAGCAA TCACCTGTAA 

msal83564.2{l47~M732} AATTAAAAAA CCAAGAGCAA TCACCTGTAA 

msal83564.2{l47_M78lj AATTAAAAAA CCAAGAGCAA TCACCTGTAA 

msal83564.2{l47_2603} AATTAAAAAA CCAAGAGCAA TCACCTGTAA 

msal83564.2{l47_JM9130013} AATTAAAAAA CCAAGAGCAA TCACCTGTAA 

msal83 564.2{l47_18RS2l} AATTAAAAAA CCAAGAGCAA TCACCTGTAA 

msa!83 564 .2{l47_090) AATTAAAAAA CCAAGAGCAA TCACCTGTAA 

msal83564 . 2 { 147_CJB110 } AATTAAAAAA CCAAGAGCAA TCACCTGTAA 

msal83564.2{l47_A909} AATTAAAAAA CCAAGAGCAA TCACCTGTAA 

msal83564.2{l47_H36Bj AATTAAAAAA CCAAGAGCAA TCACCTGTAA 

msal83564 . 2 {l47_1169NTj AATTAAAAAA CCAAGAGCAA TCACCTGTAA 

Consensus ********** ********** ********** 



TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
********** 



150 

TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
********** 



msal83564.2{147_COHl} CCATCGCCAT CGGTAACTAC TAATAtTGTT GAAAAAACAT CTGTAaCAgC 

msal83564.2{l47_M732} CCATCGCCAT CGGTAACTAC TAATAtTGTT GAAAAAACAT CTGTAaCAgC 

msal83564.2{l47_M78l} CCATCGCCAT CGGTAACTAC TAATAtTGTT GAAAAAACAT CTGTAaCAgC 

msal83564.2{l47_2603> CCATCGCCAT CGGTAACTAC TAATAcTGTT GAAAAAACAT CTGTAaCAgC 

msal83564 . 2 { 147_JM9130013 } CCATCGCCAT CGGTAACTAC TAATAcTGTT GAAAAAACAT CTGTAaCAgC 

msal83564 . 2 { 147_18RS2l} CCATCGCCAT CGGTAACTAC TAATAcTGTT GAAAAAACAT CTGTAaCAgC 

msal83564.2{l47_090} CCATCGCCAT CGGTAACTAC TAATAtTGTT GAAAAAACAT CTGTAaCAgC 

msal83564.2{l47_CJB110} CCATCGCCAT CGGTAACTAC TAATAtTGTT GAAAAAACAT CTGTAnCAgC 

msal83564.2{l47_A909} CCATCGCCAT CGGTAACTAC TAATAcTGTT GAAAAAACAT CTGTAaCAtC 

msal83564.2{l47_H36B} CCATCGCCAT CGGTAACTAC TAATAcTGTT GAAAAAACAT CTGTAaCAtC 

msal83564 .2(14 7_1 1 6 9 NT } CCATCGCCAT CGGTAACTAC TAATAtTGTT GAAAAAACAT CTGTAaCAgC 

Consensus ********** ********** *****_**** ********** *****_**„* 

201 250 

msal83564.2{l47_COHl} TGCTTCTGCT AGTAATACAG tGAAAGAAAT GGGTGATACA TCTGTAAAAA 

msal83564.2{l47_M732| TGCTTCTGCT AGTAATACAG tGAAAGAAAT GGGTGATACA TCTGTAAAAA 

msal83564.2{l47__M781> TGCTTCTGCT AGTAATACAG tGAAAGAAAT GGGTGATACA TCTGTAAAAA 

msal83 564. 2 { 147^2603} TGCTTCTGCT AGTAATACAG CGAAAGAAAT GGGTGATACA TCTGTAAAAA 

TOsal83564.2{l47_JM9130013} TGCTTCTGCT AGTAATACAG CGAAAGAAAT GGGTGATACA TCTGTAAAAA 

msal83564.2{l47_18RS2l} TGCTTCTGCT AGTAATACAG CGAAAGAAAT GGGTGATACA TCTGTAAAAA 

msal83564 .2{147_090} TGCTTCTGCT AGTAATACAG tGAAAGAAAT GGGTGATACA TCTGTAAAAA 

msa!83564 .2{l47_CJB110} TGCTTCTGCT AGTAATACAG CGAAAGAAAT GGGTGATACA TCTGTAAAAA 

msal83564 .2|147_A909} TGCTTCTGCT AGTAATACAG CGAAAGAAAT GGGTGATACA TCTGTAAAAA 

msal83564.2{147_H36B} TGCTTCTGCT AGTAATACAG CGAAAGAAAT GGGTGATACA TCTGTAAAAA 

msal83564.2{l47_1169NT} TGCTTCTGCT AGTAATACAG CGAAAGAAAT GGGTGATACA TCTGTAAAAA 

Consensus ********** ********** ********** ********** ********** 



251 300 

msal83564.2{147_COHl} ATGACAAAAC AGAAGATGAA TTATTAGAAG AGTTATCTAA AAACCTTGAT 

msal83564.2{l47_M732} ATGACAAAAC AGAAGATGAA TTATTAGAAG AGTTATCTAA AAACCTTGAT 

msal83564.2{!47_M78l} ATGACAAAAC AGAAGATGAA TTATTAGAAG AGTTATCTAA AAACCTTGAT 

msal83564.2{l47_2603} ATGACAAAAC AGAAGATGAA TTATTAGAAG AGTTATCTAA AAACCTTGAT 

msal83564.2{l47_JM9130013} ATGACAAAAC AGAAGATGAA TTATTAGAAG AGTTATCTAA AAACCTTGAT 

msal83564.2{l47_18KS2l} ATGACAAAAC AGAAGATGAA TTATTAGAAG AGTTATCTAA AAACCTTGAT 

msal83564.2{l47_090} ATGACAAAAC AGAAGATGAA TTATTAGAAG AGTTATCTAA AAACCTTGAT 

msal83564.2{l47_CJB110} ATGACAAAAC AGAAGATGAA TTATTAGAAG AGTTATCTAA AAACCTTGAT 

msal83564.2{l47_A909} ATGACAAAAC AGAAGATGAA TTATTAGAAG AGTTATCTAA AAACCTTGAT 

msal83564.2{l47_H36B} ATGACAAAAC AGAAGATGAA TTATTAGAAG AGTTATCTAA AAACCTTGAT 

msal83564.2{l47_1169NT} ATGACAAAAC AGAAGATGAA TTATTAGAAG AGTTATCTAA AAACCTTGAT 

Consensus ********** ********** ********** ********** ********** 



, 301 350 

msal83564.2f l4 7_COHl> ACGTCTAATt TGGGGGCTGA TCTTGAAGAA GAATATCCCT CTAAACCAGA 

msal83564.2{l47_M732} ACGTCTAATt TGGGGGCTGA TCTTGAAGAA GAATATCCCT CTAAACCAGA 

msal83 564 .2(14 7_M781] ACGTCTAATt TGGGGGCTGA TCTTGAAGAA GAATATCCCT CTAAACCAGA 

msal83564.2{l47_2603j ACGTCTAATt TGGGGGCTGA TCTTGAAGAA GAATATCCCT CTAAACCAGA 

msal83564. 2 {l47_JM9 130013 } ACGTCTAATt TGGGGGCTGA TCTTGAAGAA GAATATCCCT CTAAACCAGA 

msal83564.2{l47_18RS2l} ACGTCTAATt TGGGGGCTGA TCTTGAAGAA GAATATCCCT CTAAACCAGA 

msal83564.2{l47_090} ACGTCTAATt TGGGGGCTGA TCTTGAAGAA GAATATCCCT CTAAACCAGA 

msal83564 ,2{147_CJB11G} ACGTCTAATw TGGGGGCTGA TCTTGAAGAA GAATATCCCT CTAAACCAGA 

msal83564.2{l47__A909} ACGTCTAATt TGGGGGCTGA TCTTGAAGAA GAATATCCCT CTAAACCAGA 

msal83564.2{l47_H36B} ACGTCTAATt TGGGGGCTGA TCTTGAAGAA GAATATCCCT CTAAACCAGA 

msal83564 .2{147_11S9NT} ACGTCTAATa TGGGGGCTGA TCTTGAAGAA GAATATCCCT CTAAACCAGA 

Consensus *********_ ********** ********** ********** ********** 



351 400 

msal83.564.2{l47_COHl} GACAACCAAC AAT AA a G AAA GCAATGTAGT AACAAATGCT TCAACTGCAA 

msal83564 .2fl47_M732 j GACAACCAAC AATAAaGAAA GCAATGTAGT AACAAATGCT TCAACTGCAA 

msal83564.2{147_M78l} GACAACCAAC AATAAaGAAA GCAATGTAGT AACAAATGCT TCAACTGCAA 

msal83564.2{l47_2603} GACAACCAAC AATAAaGAAA GCAATGTAGT AACAAATGCT TCAACTGCAA 

msal83564.2{l47_JM9130013} GACAACCAAC AATAAaGAAA GCAATGTAGT AACAAATGCT TCAACTGCAA 

msal83564 . 2{l47_18RS2l} GACAACCAAC AATAAaGAAA GCAATGTAGT AACAAATGCT TCAACTGCAA 

msal83564.2{l47__090} GACAACCAAC AATAAaGAAA GCAATGTAGT AACAAATGCT TCAACTGCAA 
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msal83564 . 2 { 147_CJB110 } 
msal83564.2{l47_A909} 
msal83564 . 2{ 147^H36B} 

msal83564 . 2 { 147_1169NT} 
Consensus 



msal83564 . 2 { 147_COHl } 
msal83564 . 2 { 147_M732 } 
msal83564.2{147_M78l} 
msal83564.2{l47_2603} 
msa!83564 . 2 {l47_JM9130013 } 
msal83564 . 2 { 147_18RS21 } 
msa!83564 . 2 { 147_090 } 
msal83564 . 2 { 147_CJB110 } 
msal83564 . 2 { 147_A909 } 
msal83564 . 2 { 147_H36B } 
msal83564 . 2 { 147_1169NT} 
Consensus 



msal 8 3 5 64 . 2 { 14 7_COHl } 
msal83564 . 2 { 147_M732 } 
msal83564 .2{l47_M78l} 
msal83564 .2{147_2603} 
msal83S64 . 2 { 147_JM9130013 } 
msal83564 .2 {l47_18RS2l} 
msal83564.2{l47_090} 
msal83564 .2{l47_CJB110} 
msal83564 . 2 { 147_A909} 
msal83564.2{l4 7_H36B} 
msal83 564 . 2 { 147_1169NT} 
Consensus 



msal83564.2{!4 7_COHl } 
msal83564 ,2 {147_M732> 
msal83564 .2{147_M781} 
msal83564 . 2 { 147_2603 } 
msal83564.2{l47_JM9130013} 
msal83564 . 2 {l47_18RS2l} 
msal83564. 2 { 147^090} 
msa!83564 . 2 { 147_CJB110 } 
msal83564 .2{147_A909} 
msal83564 - 2 { 147_H36B} 
msal83564 . 2 { 147_1169NT} 
Consensus 



msal8 3 564 . 2 { 14 7_COHl } 
msal83564 .2f 147_M732j 
msal83564 . 2 { 147J4781} 
msal83564.2{l47 2603} 
msal83564 . 2 { 147_JM9 130013 } 
msal83S64 .2{147_18RS21} 
msal83564 .2{147_090} 
msal83564 . 2 { 147_CJB110 } 
msal83564 . 2 { 147_A909 } 
msal83564.2{l47_H36B) 
msal83564.2{l47_1169NT) 
Consensus 



msal83564 .2{147_C0H1} 
msal83564 .2{147_M732} 
msal83564.2jl47_M78l} 
msal83564 .2{147_2603} 
msal83564 . 2 { 147_JM9130013 } 
msal83564.2{l47_18RS21} 
msa!83 564 . 2 { 147_090 } 
msal83564 . 2 { 147_CJB110 } 
msal83564 . 2 { 147_A909} 
msal83564.2{l47_H36B} 
msal83 564 . 2 { 147_1169NT} 
Consensus 



msa!83564 .2{l47_COHl 
msal83564 . 2 { 147_M732 
msal83564 . 2 { 147_M781 
msal83564.2{!47_2603 
msal83564.2{l47_JM9130013 
■ msal83564.2{l47_18RS21 



GACAACCAAC AATAAaGAAA GCAATGTAGT AACAAATGCT TCAACTGCAA 
GACAACCAAC AATAAaGAAA GCAATGTAGT AACAAATGCT TCAACTGCAA 
GACAACCAAC AATAAaGAAA GCAATGTAGT AACAAATGCT TCAACTGCAA 
GACAACCAAC AATAAgGAAA GCAATGTAGT AACAAATGCT TCAACTGCAA 
********** *****_**** ********** ********** ********** 



AGTTCCCTCA GCaTATGAAG 
AGTTCCCTCA GCaTATGAAG 
AGTTCCCTCA GCaTATGAAG 
AGTTCCCTCA GCaTATGAAG 
AGTTCCCTCA GCaTATGAAG 
AGTTCCCTCA GCaTATGAAG 
AGTTCCCTCA GCgTATGAAG 
AGTTCCCTCA GCgTATGAAG 
AGTTCCCTCA GCaTATGAAG 
AGTTCCCTCA GCaTATGAAG 
AGTTCCCTCA GCaTATGAAG 



450 

AGGTGAAGtC AgAAAGCAAG 
AGGTGAAGtC AgAAAGCAAG 
AGGTGAAGtC AgAAAGCAAG 
AGGTGAAGc C AgAAAGCAAG 
AGGTGAAGcC AgAAAGCAAG 
AGGTGAAGc C AgAAAGCAAG 
AGGTGAAGcC AgAAAGCAAG 
AGGTGAAGcC AgAAAGCAAG 
AGGTGAAGcC AgAAAGCAAG 
AGGTGAAGcC AgAAAGCAAG 
AGGTGAAGcC AaAAAGCAAG 



401 

TAGCACAGAA 
TAGCACAGAA 
TAGCACAGAA 
TAGCACAGAA 
TAGCACAGAA 
TAGCACAGAA 
TAGCACAGAA 
TAGCACAGAA 
TAGCACAGAA 
TAGCACAGAA 
TAGCACAGAA 
********** 

451 

TCATCgCTTG 
TCATCgCTTG 
TCATCgCTTG 
TCATCgCTTG 
TCATCgCTTG 
TCATCgCTTG 
TCATCgCTTG 
TCATCgCTTG 
TCATCaCTTG 
TCATCaCTTG 
TCATCgCTTG 
*****_**** 



501 

AACCCAAAGA 
AACCCAAAGA 
AACCCAAAGA 
AACCCAAAGA 
AACCCAAAGA 
AACCCAAAGA 
AACCCAAAGA 
AACCCAAAGA 
AACCCAAAGA 
AACCCAAAGA 
AACCCAAAGA 
********** 



********** 



**_******* 



********_* 



*_******** 



CTGTTcTTGA TACATCTAAA 
CTGTT cTTGA TACATCTAAA 
CTGTTcTTGA TACATCTAAA 
CTGTTcTTGA TACATCTAAA 
CTGTTcTTGA TACATCTAAA 
CTGTTcTTGA TACATCTAAA 
CTGTTtTTGA TACATCTAAA 
CTGTTtTTGA TACATCTAAA 
CTGTTcTTGA TACATCTAAA 
CTGTTcTTGA TACATCTAAA 
CTGTTcTTGA TACATCTAAA 
*****_**** ********** 



GGAAAGGGAA ATGTAGTAGC 
GGAAAGGGAA ATGTAGTAGC 
GGAAAGGGAA ATGTAGTAGC 
GGAAAGGGAA ATGTAGTAGC 
GGAAAGGGAA ATGTAGTAGC 
GGAAAGGGAA ATGTAGTAGC 
GGAAAGGGAA ATGTAGTAGC 
GGAAAGGGAA ATGTAGTAGC 
GGAAAGGGAA ATGTAGTAGC 
GGAAAGGGAA ATGTAGTAGC 
GGAAAGGGAA ATGTAGTAGC 
********** ********** 



500 

ATAACAAAAT TaCAAGCCAc 
ATAACAAAAT TaCAAGCCAc 
ATAACAAAAT TaCAAGCCAc 
ATAACAAAAT TaCAAGCCAt 
ATAACAAAAT TaCAAGCCAt 
ATAACAAAAT TaCAAGCCAt 
ATAACAAAAT TgCAAGCCAt 
ATAACAAAAT TgCAAGCCAt 
ATAACAAAAT TgCAAGCCAt 
ATAACAAAAT TgCAAGCCAt 
ATAACAAAAT TgCAAGCCAt 
********** *_*******_ 

550 

TATTATTGAT ACTGGCTTTG 
TATTATTGAT ACTGGCTTTG 
TATTATTGAT ACTGGCTTTG 
TATTATTGAT ACTGGCTTTG 
TATTATTGAT ACTGGCTTTG 
TATTATTGAT ACTGGCTTTG 
TATTATTGAT ACTGGCTTTG 
TATTATTGAT ACTGGCTTTG 
TATTATTGAT ACTGGCTTTG 
TATTATTGAT ACTGGCTTTG 
TATTATTGAT ACTGGCTTTG 
********** ********** 



551 600 
ATATTAACCA TGATATTTTT CGTTTAGATA GCCCAAAAGA TGATAAGCAC 
ATATTAACCA TGATATTTTT CGTTTAGATA GCCCAAAAGA TGATAAGCAC 
ATATTAACCA TGATATTTTT CGTTTAGATA GCCCAAAAGA TGATAAGCAC 
ATATTAACCA TGATATTTTT CGTTTAGATA GCCCAAAAGA TGATAAGCAC 
ATATTAACCA TGATATTTTT CGTTTAGATA GCCCAAAAGA TGATAAGCAC 
ATATTAACCA TGATATTTTT CGTTTAGATA GCCCAAAAGA TGATAAGCAC 
ATATTAACCA TGATATTTTT CGTTTAGATA GCCCAAAAGA TGATAAGCAC 
ATATTAACCA TGATATTTTT CGTTTAGATA GCCCAAAAGA TGATAAGCAC 
ATATTAACCA TGATATTTTT CGTTTAGATA GCCCAAAAGA TGATAAGCAC 
ATATTAACCA TGATATTTTT CGTTTAGATA GCCCAAAAGA TGATAAGCAC 
ATATTAACCA TGATATTTTT CGTTTAGATA GCCCAAAAGA TGATAAGCAC 
********** ********** ********** ********** ********** 



601 650 
AGCTTTAAAA cTAAggCAGA ATTtGAGGAA TTAAAAGCAA AACATAATAT 
AGCTTTAAAA CTAAggCAGA ATTtGAGGAA TTAAAAGCAA AACATAATAT 
AGCTTTAAAA cTAAggCAGA ATTtGAGGAA TTAAAAGCAA AACATAATAT 
AGCTTTAAAA cTAAg aCAGA ATTtGAGGAA TTAAAAGCAA AACATAATAT 
AGCTTTAAAA cTAAgaCAGA ATTtGAGGAA TTAAAAGCAA AACATAATAT 
AGCTTTAAAA cTAAgaCAGA ATTtGAGGAA TTAAAAGCAA AACATAATAT 
AGCTTTAAAA cTAAagCAGA ATTcGAGGAA TTAAAAGCAA AACATAATAT 
AGCTTTAAAA cTAAagCAGA ATTcGAGGAA TTAAAAGCAA AACATAATAT 
AGCTTTAAAA cTAAggCAGA ATTtGAGGAA TTAAAAGCAA AACATAATAT 
AGCTTTAAAA cTAAggCAGA ATTtGAGGAA TTAAAAGCAA AACATAATAT 
AGCTTTAAAA aTAAggCAGA ATTcGAGGAA TTAAAAGCAA AACATAATAT 
********** _***_._**** ***„****** ********** ********** 



651 700 
CACTTATGGG AAATGGGTTA ACGATAAGAT TGTTTTTGCA CATAACTACG 
CACTTATGGG AAATGGGTTA ACGATAAGAT TGTTTTTGCA CATAACTACG 
CACTTATGGG AAATGGGTTA ACGATAAGAT TGTTTTTGCA CATAACTACG 
CACTTATGGG AAATGGGTTA ACGATAAGAT TGTTTTTGCA CATAACTACG 
CACTTATGGG AAATGGGTTA ACGATAAGAT TGTTTTTGCA CATAACTACG 
CACTTATGGG AAATGGGTTA ACGATAAGAT TGTTTTTGCA CATAACTACG 



774 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msa!83564 .2{l47_090} 
msal83564 . 2 { 147_CJB110 } 
tnsal83564 .2(147 A909} 
msal83564.2{l47~H36B} 
msal83564 . 2 { 147_1169NT} 
Consensus 



CACTTATGGG 
CACTTATGGG 
CACTTATGGG 
CACTTATGGG 
CACTTATGGG 



AAATGGGTTA 
AAATGGGTTA 
AAATGGGTTA 
AAATGGGTTA 
AAATGGGTTA 



********** ********** 



ACGATAAGAT TGTTTTTGCA 
ACGATAAGAT TGTTTTTGCA 
ACGATAAGAT TGTTTTTGCA 
ACGATAAGAT TGTTTTTGCA 
ACGATAAGAT TGTTTTTGCA 
********** ********** 



CATAACTACG 
CATAACTACG 
CATAACTACG 
CATAACTACG 
CATAACTACG 
********** 



msal83564 . 

msal83564 . 

msa!83564 . 

msal83564 . 
msal83564.2{l47 
msal83564.2{ 
msal83564 
msal83564.2{ 

msal83564 . 

msal83564 . 
msal83564.2{ 



2{l47_COHl} 
2{147_M732} 
2{l47_M78l} 
2 {147 2603} 
JM9 130013} 
147_18RS2l] 
2{l47_090] 
147_CJB110 
2{147_A909] 
2{l47_H36B] 
147_1169NT] 
Consensus 



msa!83564 . 

msal83564 . 

msal83564 . 

msa!83564 . 
msal83564 .2{l47 
msal83564 .2{ 
msal83564 
msal83564 ,2{ 

msal83564 . 

msal83564 . 
msal83564 .2{ 



2 {147 COHl} 
2{147~M732} 
2{147 M781} 
2 {14722603} 
JM9130013} 
147_18RS2l} 
2{147_090} 
147_CJB110} 
2{l47_A909} 
2{147_H36B} 
147JL169NT} 
Consensus 



701 

CCAACAATAC 
C CAACAATAC 
CCAACAATAC 
CCAACAATAC 
CCAACAATAC 
CCAACAATAC 
CCAACAATAC 
CCAACAATAC 
CCAACAATAC 
CCAACAATAC 
CCAACAATAC 
********** 

75'1 

TATGGgTCAG 
TATGGgTCAG 
TATGGgTCAG 
TATGGtTCAG 
TATGGtTCAG 
TATGGtTCAG 
TATGGgTCAG 
TATGGgTCAG 
TATGGgTCAG 
TATGGgTCAG 
TATGGtTCAG 
*****_**** ********** *****_**** 



AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
********** 



AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 



GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
********** 



TATTT tGCAT 
TATTT t GCAT 
TATTT tGCAT 
TATTT CGCAT 
TATTT cGCAT 
TATTT cGCAT 
TATTT cGCAT 
TATTTcGCAT 
TATTT cGCAT 
TATTTcGCAT 
TATTTcGCAT 



CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 



750 

GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 



********** ********** 



GGTACACACG 
GGTACACACG 
GGTACACACG 
GGTACACACG 
GGTACACACG 
GGTACACACG 
GGTACACACG 
GGTACACACG 
GGTACACACG 
GGTACACACG 
GGTACACACG 
********** 



800 
TTGCTGGTAT 
TTGCTGGTAT 
TTGCTGGTAT 
TTGCTGGTAT 
TTGCTGGTAT 
TTGCTGGTAT 
TTGCTGGTAT 
TTGCTGGTAT 
TTGCTGGTAT , 
TTGCTGGTAT 
TTGCTGGTAT 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47. 
msal83564.2{ 
msal83564 
tnsal83564.2{ 

msal83564 . 

msal83564 
msal83564.2{ 



2{147_C0H1} 
2 {147 M732} 
2(l47~M78l} 
2{147 2603} 
_JM9130013} 
147_18RS2l} 
,2{147_090} 
147__CJB110} 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



801 

TTTTGTAGGT 
TTTTGTAGGT 
TTTTGTAGGT 
TTTTGTAGGT 
TTTTGTAGGT 
TTTTGTAGGT 
TTTTGTAGGT 
TTTTGTAGGT 
TTTTGTAGGT 
TTTTGTAGGT 
TTTTGTAGGT 



AATAGTAAAC 
AATAGTAAAC 
AATAGTAAAC 
AATAGTAAAC 
AATAGTAAAC 
AATAGTAAAC 
AATAGTAAAC 
AATAGTAAAC 
AATAGTAAAC 
AATAGTAAAC 
AATAGTAAAC 



********** ********** 



GTCCAGCAAT 
GTCCAGCAAT 
GTCCAGCAAT 
GTCCAGCAAT 
GTCCAGCAAT 
GTCCAGCAAT 
GTCCAGCAAT 
GTCCAGCAAT 
GTCCAGCAAT 
GTCCAGCAAT 
GTCCAGCAAT 
********** 



CAATaGTCTT 
CAATaGTCTT 
CAATaGTCTT 
CAATgGTCTT 
CAATgGTCTT 
CAATgGTCTT 
CAATgGTCTT 
CAATgGTCTT 
CAATgGTCTT 
CAATgGTCTT 
CAATgGTCTT 



8 50 

CTTTTAGAAG 
CTTTTAGAAG 
CTTTTAGAAG 
CTTTTAGAAG 
CTTTTAGAAG 
CTTTTAGAAG 
CTTTTAGAAG 
CTTTTAGAAG 
CTTTTAGAAG 
CTTTTAGAAG 
CTTTTAGAAG 



****_***** ********** 



851 900 

msal83564.2(l47 COHl) GTGCAGCGCC AAATGCTCAA GTCTTATTAA TGCGTATTCC AGATAAAATT 

msal83564.2{147~M732} GTGCAGCGCC AAATGCTCAA GTCTTATTAA TGCGTATTCC AGATAAAATT 

tnsal83S64.2{l47 M78l} GTGCAGCGCC AAATGCTCAA GTCTTATTAA TGCGTATTCC AGATAAAATT 

msal83564.2{l47~2603} GTGCAGCGCC AAATGCTCAA GTCTTATTAA TGCGTATTCC AGATAAAATT 

msalS 3564 .2{147__JM913 0013} GTGCAGCGCC AAATGCTCAA GTCTTATTAA TGCGTATTCC AGATAAAATT 

msal83564.2{l47_18RS2l} GTGCAGCGCC AAATGCTCAA GTCTTATTAA TGCGTATTCC AGATAAAATT 

msal83564 .2{147_090} GTGCAGCGCC AAATGCTCAA GTCTTATTAA TGCGTATTCC AGATAAAATT 

msal83564.2{l47_CJB110} GTGCAGCGCC AAATGCTCAA GTCTTATTAA TGCGTATTCC AGATAAAATT 

msal 83564 .2(147 A909} GTGCAGCGCC AAATGCTCAA GTCTTATTAA TGCGTATTCC AGATAAAATT 

msa!83564 .2{147^H36B} GTGCAGCGCC AAATGCTCAA GTCTTATTAA TGCGTATTCC AGATAAAATT 

msal83564.2{l47_1169NT} GTGCAGCGCC AAATGCTCAA GTCTTATTAA TGCGTATTCC AGATAAAATT 

Consensus ********** ********** ********** ********** ********** 



msal83564 . 

msal83564 . 

rasal83564 . 

msal83564 . 
msal83564.2{l47 
msa!83 564.2 {' 
msal83564 
msal83564 .2{ 

msal83564 

msal83564 . 
msal83564.2{ 



2{l47_COHl) 
2{147_M732> 
2{147_M781) 
2(147 2603} 
JM9130013' 
147_18RS21^ 
2{147 090' 
147^x33110' 
2{147 A909' 
2{l47~H36B) 
147JL169NT} 
Consensus 



901 

GATTCGGACA 
GATTCGGACA 
GATTCGGACA 
GATTCGGACA 
GATTCGGACA 
GATTCGGACA 
GATTCGGACA 
GATTCGGACA 
GATTCGGACA 
GATTCGGACA 
GATTCGGACA 



AATTTGGaGA 
AATTTGGaGA 
AATTTGGaGA 
AATTTGGtGA 
AATTTGGtGA 
AATTTGGtGA 
AATTTGGaGA 
AATTTGGaGA 
AATTTGGtGA 
AATTTGGtGA 
AATTTGGaGA 



********** *******^** 



AGCATATGCT 
AGCATATGCT 
AGCATATGCT 
AGCATATGCT 
AGCATATGCT 
AGCATATGCT 
AGCATATGCT 
AGCATATGCT 
AGCATATGCT 
AGCATATGCT 
AGCATATGCT 
********** 



AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
********** 



950 

tAGACGCTGT 
tAGACGCTGT 
tAGACGCTGT 
CAGACGCTGT 
cAGACGCTGT 
CAGACGCTGT 
CAGACGCTGT 
cAGACGCTGT 
CAGACGCTGT 
cAGACGCTGT 
cAGACGCTGT 
_********* 



msal83564 . 2 { 147_COHl} 
msal83564.2{!47 M732} 
msal83564.2{l47~M78l) 
msal83564 . 2 { 147J2603 J 
msal83564 . 2 { 14 7__JM9130013 } 



951 

TAATCTAGGA 
TAATCTAGGA 
TAATCTAGGA 
TAATCTAGGA 
TAATCTAGGA 



GCaAAAACGA TTAATATGAG ccTgGGAAAA 
GCaAAAACGA TTAATATGAG ccTgGGAAAA 
GCaAAAACGA TTAATATGAG ccTgGGAAAA 
GCaAAAACGA TTAATATGAG taTtGGAAAA 
GCaAAAACGA TTAATATGAG taTtGGAAAA 



1000 
ACgGCtGATT 
ACgGCtGATT 
ACgGCtGATT 
ACaGCtGATT 
ACaGCtGATT 



775 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msal83564 . 2 { 147_18RS21 } 
msal83564 . 2 { 147_090 } 
msa!83564 . 2 { 147_CJB110 } 
msal83564 . 2{ 147_A909} 
msal83564 - 2{ 147_H36B } 
msal83 564 . 2 { 147_1169NT } 
Consensus 



TAATCTAGGA 
TAATCTAGGA 
TAATCTAGGA 
TAATCTAGGA 
TAATCTAGGA 
TAATCTAGGA 



GCaAAAACGA 
GCaAAAACGA 
GCaAAAACGA 
GCaAAAACGA 
GCaAAAACGA 
GCtAAAACGA 



TTAATATGAG 
TTAATATGAG 
TTAATATGAG 
TTAATATGAG 
TTAATATGAG 
TTAATATGAG 



********** **_******* ********** 



taTtGGAAAA 
CCTtGGAAAA 
ccTtGGAAAA 
CCTtGGAAAA 
ccTtGGAAAA 
taTtGGAAAA 
- _* _ ****** 



ACaGCtGATT 
ACaGCaGATT 
ACaGCaGATT 
ACaGCaGATT 
ACaGCaGATT 
ACaGCtGATT 
**_**-**** 



msal83564.2{l47_COHl} 
msal83564 . 2 { 147_M732 } 
msal83564 . 2 { 147_M781 " 
msal83564 . 2 { 147_2603 
msal83564 . 2 { 147_JM9130013 
msal83564 . 2 { 147_18RS2l} 
msal83564.2{l47_090} 
msal83564 . 2 { 147_CJB110 } 
msal83564 . 2{ 147_A909} 
msal83564 . 2{ 147_H36B} 
msal83564 . 2 { 147_1169NT} 
Consensus 



1001 

CTTTAATTGC 
CTTTAATTGC 
CTTTAATTGC 
CTTTAATTGC 
CTTTAATTGC 
CTTTAATTGC 
CTTTAATTGC 
CTTTAATTGC 
CTTTAATTGC 
CTTTAATTGC 
CTTTAATTGC 
********** 



tCTCAATGAT 
tCTCAATGAT 
tCTCAATGAT 
tCTCAATGAT 
tCTCAATGAT 
tCTCAATGAT 
aCTCAATGAT 
aCTCAATGAT 
tCTCAATGAT 
tCTCAATGAT 
tCTCAATGAT 
_********* 



AAAGTTAAAT 
AAAGTTAAAT 
AAAGTTAAAT 
AAAGTTAAAT 
AAAGTTAAAT 
AAAGTTAAAT 
AAAGTTAAAT 
AAAGTTAAAT 
AAAGTTAAAT 
AAAGTTAAAT 
AAAGTTAAAT 
********** 



TAG CACTTAA 
TAG CACTTAA 
TAG CACTT AA 
TAG CACTTAA 
TAG CACTTAA 
TAG CACTTAA 
TAG CACTTAA 
TAG CACTTAA 
TAG CACTTAA 
TAG CACTTAA 
TAG CACTTAA 
********** 



1050 
ATTAGCTTCT 
ATTAGCTTCT 
ATTAGCTTCT 
ATTAGCTTCT 
ATTAGCTTCT 
ATTAGCTTCT 
ATTAGCTTCT 
ATTAGCTTCT 
ATTAGCTTCT 
ATTAGCTTCT 
ATTAGCTTCT 
********** 



msal83564 . 2 { 147_COHl J 
msal83564.2{l47_M732} 
msal83564 . 2{ 147 _M78lj 
msal83564 . 2{ 147_2603 } 
msa!83564 . 2 { 147_JM9130013 } 
msal83564 . 2 { 147_18RS21 } 
msal83564 .2{147_090} 
msal83564 . 2 { 147_CJB110} 
msa!83564 . 2{ 147_A909 } 
msal83564 . 2{ 147_H36B} 
msal83564.2{l47_1169NT} 
Consensus 



1051 

GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 



TTG CAGTTGT 
TTGCAGTTGT 
TTG CAGTTGT 
TTGCAGTTGT 
TTGCAGTTGT 
TTGCAGTTGT 
TTGCAGTTGT 
TTGCAGTTGT 
TTGCAGTTGT 
TTGCAGTTGT 
TTGCAGTTGT 



********** ********** 



TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
********** 



GGAAATGAAG 
GGAAATGAAG 
GGAAATGAAG 
GGAAATGAAG 
GGAAATGAAG 
GGAAATGAAG 
GGAAATGAAG 
GGAAATGAAG 
GGAAATGAAG 
GGAAATGAAG 
GGAAATGAAG 
********** 



1100 
GtGCATTTGG 
GtGCATTTGG 
GtGCATTTGG 
GcGCATTTGG 
GcGCATTTGG 
GcGCATTTGG 
GtGCATTTGG 
GtGCATTTGG 
GtGCATTTGG 
GtGCATTTGG 
GcGCATTTGG 
*_******** 



msal83564.2{!47_COHl} 
msal83564 . 2{ 147 JVI732 } 
msal83564 . 2 { 147_M78 1 } 
msal83564.2{l47_2603} 
msal83564 . 2 { 147_JM9130013 } 
msal83564 . 2 { 147„18RS21 J 
msal83564.2{l47_090} 
msal83564.2{l47_CJB110) 
msal83564.2{l47_A909} 
msal83564 .2{147_H36B} 
msal83564.2{l47_1169NT} 
Consensus 



1101 

TATGGATTAT 
TATGGATTAT 
TATGGATTAT 
TATGGATTAT 
TATGGATTAT 
TATGGATTAT 
TATGGATTAT 
TATGGATTAT 
TATGGATTAT 
TATGGATTAT 
TATGGATTAT 



AGCAAACCaT 
AGCAAACCaT 
AGCAAACCaT 
AGCAAACCaT 
AGCAAACCaT 
AGCAAACCaT 
AGCAAACCaT 
AGCAAACCaT 
AGCAAACCaT 
AGCAAACCaT 
AGCAAACCgT 



TATCAACTAA 
TATCAACTAA 
TATCAACTAA 
TATCAACTAA 
TATCAACTAA 
TATCAACTAA 
TATCAACTAA 
TATCAACTAA 
TATCAACTAA 
TATCAACTAA 
TATCAACTAA 



********** ********_* ********** 



TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
********** 



1150 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
********** 



msal83564 . 

msal83564 . 

msal83564 

msal83564 . 
msal83564 .2(147 
msal83564.2{ 
msal83564 
msal83564.2{ 

msal83564 . 

msal83564 . 
msal83564.2{ 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
2{147_2603 J 
JM9130013} 
147_18RS2l} 
2{147_090} 
147_CJB110 \ 
2{147_A909) 
2{l47_H36B} 
147_1169NT} 
Consensus 



1151 

ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
' ********** 



TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 
********** 



GATACTTTGA 
GATACTTTGA 
GATACTTTGA 
GATACTTTGA 
GATACTTTGA 
GATACTTTGA 
GATACTTTGA 
GATACTTTGA 
GATACTTTGA 
GATACTTTGA 
GATACTTTGA 
********** 



GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 
********** 



1200 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 
********** 



msal83564 . 
msal83564. 
msal83564 ♦ 
msal83564 . 
msal83564.2{!47 

msal835S4.2{ 
msal83564 

msal83564.2{ 
msal83564 
msa!83564 

msal83564.2{ 



2{147_C0H1} 
2{147_M732} 
2{147_M781} 
2{147_2603} 
JM9130013) 
147_18RS2l} 
.2{l47_090; 
147_CJB110 
2{147_A909] 
2{147_H36B] 
147_1169NT] 
Consensus 



1201 

CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
********** 



TCAGTGAGGT 
TCAGTGAGGT 
TCAGTGAGGT 
TCAGTGAGGT 
TCAGTGAGGT 
TCAGTGAGGT 
TCAGTGAGGT 
TCAGTGAGGT 
TCAGTGAGGT 
TCAGTGAGGT 
TCAGTGAGGT 
********** 



CGTTGAAACA 
CGTTGAAACA 
CGTTGAAACA 
CGTTGAAACA 
CGTTGAAACA 
CGTTGAAACA 
CGTTGAAACA 
CGTTGAAACA 
CGTTGAAACA 
CGTTGAAACA 
CGTTGAAACA 
********** 



ACTATTGAAG 
ACTATTGAAG 
ACTATTGAAG 
ACTATTGAAG 
ACTATTGAAG 
ACTATTGAAG 
ACTATTGAAG 
ACTATTGAAG 
ACTATTGAAG 
ACTATTGAAG 
ACTATTGAAG 
********** 



1250 
GTAAGTTAGT 
GTAAGTTAGT 
GTAAGTTAGT 
GTAAGTTAGT 
GTAAGTTAGT 
GTAAGTTAGT 
GTAAGTTAGT 
GTAAGTTAGT 
GTAAGTTAGT 
GTAAGTTAGT 
GTAAGTTAGT 
********** 



msal8 3 564.2(14 7_C0H1 } 
msal83564.2{l47_M732} 
msal83564 . 2 { 147_M78 1 ] 
msal83564.2{147_2603) 



1251 1300 
TAAGTTGCCG ATTGTGACTT CTAAACCTTT TGACAAAGGT AAGGCCTACG 
TAAGTTGCCG ATTGTGACTT CTAAACCTTT TGACAAAGGT AAGGCCTACG 
TAAGTTGCCG ATTGTGACTT CTAAACCTTT TGACAAAGGT AAGGCCTACG 
TAAGTTGCCG ATTGTGACTT CTAAACCTTT TGACAAAGGT AAGGCCTACG 



776 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msal83564.2{l47_JM9130013} TAAGTTGCCG ATTGTGACTT 

msal83564.2{l47_18RS2l} TAAGTTGCCG ATTGTGACTT 

msal83564.2{l47_090} TAAGTTGCCG ATTGTGACTT 

msal83 564 . 2 {147 JTJB110 } TAAGTTGCCG ATTGTGACTT 

msal83564.2(l47_A909} TAAGTTGCCG ATTGTGACTT 

msal83564.2{147_H36B} TAAGTTGCCG ATTGTGACTT 

msa!83564 .2{l47_1169NT} TAAGTTGCCG ATTGTGACTT 

Consensus ********** ********** 



CTAAACCTTT 
CTAAACCTTT 
CTAAACCTTT 
CTAAACCTTT 
CTAAACCTTT 
CTAAACCTTT 
CTAAACCTTT 



TGACAAAGGT 
TGACAAAGGT 
TGACAAAGGT 
TGACAAAGGT 
TGACAAAGGT 
TGACAAAGGT 
TGACAAAGGT 



AAGGCCTACG 
AAGGCCTACG 
AAGGCCTACG 
AAGGCCTACG 
AAGGCCTACG 
AAGGCCTACG 
AAGGCCTACG 



********** ********** ********** 



msal83564 . 
msal83564 
msal83564 
msal83564 
msal83564 .2(147 

msal83564 .2{ 
msal83564 

msal83564 .2{ 
msa!83564 . 
msal83564 . 

msal83564 .2{ 



2{14 7_COHl} 
2{147_M732} 
2{147_M781} 
2{l47_2603~ 

JM9130013 
147 18RS21 
.2{147_090 
147_CJB110 
2{147_A909 
2{147_H36B} 
147_1169NT} 
Consensus 



1301 

ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
********** 



TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
********** 



GGTGC . .AAA 
GGTGC . .AAA 
GGTGC. .AAA 
GGTGC . aAAA 
GGTGC. aAAA 
GGTGC. aAAA 
GGTGC. aAAA 
GGTGC. aAAA 
GGTGC a aAAA 
GGTGC. aAAA 
GGTGC. aAAA 
*****_.-★** 



AAAGAtTTTG 
AAAGAtTTTG 
AAAGAtTTTG 
AAAGAcTTTG 
AAAGAcTTTG 
AAAGAcTTTG 
AAAGAcTTTG 
AAAGAcTTTG 
AAAGACTTTG 
AAAGAcTTTG 
AAAGAcTTTG 
*****_**** 



1350 
AAGGTAAGGA 
AAGGTAAGGA 
AAGGTAAGGA 
AAGGTAAGGA 
AAGGTAAGGA 
AAGGTAAGGA' 
AAGGTAAGGA 
AAGGTAAGGA 
AAGGTAAGGA 
AAGGTAAGGA 
AAGGTAAGGA 
********** 



1351 1400 

msal83564 -2{l47_COHl} CTTTAAAGGT AAGATTGCAT TAATTGAGCG TGGTGGTGGA CTTGATTTTA 

rnsal83564 . 2 { 147_M732 } CTTTAAAGGT AAGATTGCAT TAATTGAGCG TGGTGGTGGA CTTGATTTTA 

msal83564 -2{147_M781} CTTTAAAGGT AAGATTGCAT TAATTGAGCG TGGTGGTGGA CTTGATTTTA 

rasal83564 .2{147_2603} CTTTAAAGGT AAGATTGCAT TAATTGAGCG TGGTGGTGGA CTTGATTTTA 

msal83564 . 2 { 147_JM9130013 } CTTTAAAGGT AAGATTGCAT TAATTGAGCG TGGTGGTGGA CTTGATTTTA 

msal83564.2{l47_18RS2l} CTTTAAAGGT AAGATTGCAT TAATTGAGCG TGGTGGTGGA CTTGATTTTA 

msal83564. 2 { 147^090} CTTTAAAGGT AAGATTGCAT TAATTGAGCG TGGTGGTGGA CTTG AT TTTA 

' msal83564.2{l4 7_C JB 1 10} CTTTAAAGGT AAGATTGCAT TAATTGAGCG TGGTGGTGGA CTTGATTTTA 

msal83564 ,2{l47_A909> CTTTAAAGGT AAGATTGCAT TAATTGAGCG TGGTGGTGGA CTTGATTTTA 

ms'a!83564 ,2{147_H36B} CTTTAAAGGT AAGATTGCAT TAATTGAGCG TGGTGGTGGA CTTGATTTTA 

msal83564 .2{l47_1169NT} CTTTAAAGGT AAGATTGCAT TAATTGAGCG TGGTGGTGGA CTTGATTTTA 

Consensus ********** ********** ********** ********** ********** 



msal83564 . 

msal83564 . 

msa!83564 . 

msal83S64 . 
msal83564.2{l47 
msal83564.2{ 
msal83564 
msal83564.2{ 

msal83564 . 

msal83564 . 
msal83564.2{ 



2(147_C0H1} 
2{147__M732} 
2{147_M781} 
2{147_2603} 
JM9130013} 
147__18RS2l} 
.2{147_090} 
147__CJB110} 
2f 147_A909} 
2{147__H36B} 
147__1169NT} 
Consensus 



1401 

TGACTAAAAT 
TGACTAAAAT 
TGACTAAAAT 
TGACTAAAAT 
TGACTAAAAT 
TGACTAAAAT 
TGACTAAAAT 
TGACTAAAAT 
TGACTAAAAT 
TGACTAAAAT 
TGACTAAAAT 
********** 



CACTCATGCT 
CACTCATGCT 
CACTCATGCT 
CACTCATGCT 
CACTCATGCT 
CACTCATGCT 
CACTCATGCT 
CACTCATGCT 
CACTCATGCT 
CACTCATGCT 
CACTCATGCT 
********** 



ACAAATGCAG 
ACAAATGCAG 
ACAAATGCAG 
ACAAATGCAG 
ACAAATGCAG 
ACAAATGCAG 
ACAAATGCAG 
ACAAATGCAG 
ACAAATGCAG 
ACAAATGCAG 
ACAAATGCAG 
********** 



GTGTTGTTGG 
GTGTTGTTGG 
GTGTTGTTGG 
GTGTTGTTGG 
GTGTTGTTGG 
GTGTTGTTGG 
GTGTTGTTGG 
GTGTTGTTGG 



********** 



1450 
TATCGTTATT 
TATCGTTATT 
TATCGTTATT 
TATCGTTATT 
TATCGTTATT 
TATCGTTATT 
TATCGTTATT 
TATCGTTATT 
TATCGTTATT 
TATCGTTATT 
TATCGTTATT 
********** 



, 1451 1500 

msal83564.2{l47_COHl} TTTAACGATC AAGAAAAACG TGGAAATTTT CTAATTCCTT ACCGTGAATT 

msal83564 .2{147_M732} TTTAACGATC AAGAAAAACG TGGAAATTTT CTAATTCCTT ACCGTGAATT 

msal83564 .2{147_M781> TTTAACGATC AAGAAAAACG TGGAAATTTT CTAATTCCTT ACCGTGAATT 

msa!83564 .2{147_2603} TTTAACGATC AAGAAAAACG TGGAAATTTT CTAATTCCTT ACCGTGAATT 

msal83564.2{l47__JM9130013} TTTAACGATC AAGAAAAACG TGGAAATTTT CTAATTCCTT ACCGTGAATT 

msal83564 .2{l47_18RS2l} TTTAACGATC AAGAAAAACG TGGAAATTTT CTAATTCCTT ACCGTGAATT 

msal83564 .2{l47_090} TTTAACGATC AAGAAAAACG TGGAAATTTT CTAATTCCTT ACCGTGAATT 

msal83564.2{l4 7_CJB1 10} TTTAACGATC AAGAAAAACG TGGAAATTTT CTAATTCCTT ACCGTGAATT 

msal83564 ,2{147_A909} TTTAACGATC AAGAAAAACG TGGAAATTTT CTAATTCCTT ACCGTGAATT 

ms a 1 8 3 5 6 4 . 2 { 1 4 7_H 3 6 B } TTTAACGATC AAGAAAAACG TGGAAATTTT CTAATTCCTT ACCGTGAATT 

msal83564.2{l47__1169NT} TTTAACGATC AAGAAAAACG TGGAAATTTT CTAATTCCTT ACCGTGAATT 

Consensus ********** ********** ********** ********** ********** 



msal83564 
msal83564 
msal83564 
msal83564 
msal83564.2{l47 

msal83564.2{ 
msal83564 

msal83564.2{ 
meal83564 . 
msal83564 . 

msal83564.2{ 



2{l47_COHl} 
2{147_M732} 
2(147_M781} 
2{147__2603> 
_JM9130013} 
147__18RS21} 
2 { 147^090} 
147 CJBllOj 
2{147_A909) 
2{l47_H36B} 
147_1169NT} 
Consensus 



1501 

ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
********** 



gTTATTAGTA 
gTTATTAGTA 
gTTATTAGTA 
aTTATTAGTA 
aTTATTAGTA 
aTTATTAGTA 
gTTATTAGTA 
gTTATTAGTA 
gTTATTAGTA 
gTTATTAGTA 
gTTATTAGTA 



AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 



CGAGCGTATA 
CGAG CGT AT A 
CGAGCGTATA 
CGAGCGTATA 
CGAGCGTATA 
CGAGCGTATA 
CGAGCGTATA 
CGAGCGTATA 
CGAGCGTATA 
CGAGCGTATA 
CGAGCGTATA 



_********* ********** ********** 



1550 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
********** 



tnsal83564 . 2 { 147_C0H1 } 
msal83564 .2f 147_M732} 
msal835(S4 .2{147_M78l} 



1551 1600 

CAAGTCAGTT AACATTTAAC CAGAGtTTTG AAGTAGTTGA TAGCCAAGGT 

CAAGTCAGTT AACATTTAAC CAGAGtTTTG AAGTAGTTGA TAGCCAAGGT 

CAAGTCAGTT AACATTTAAC CAGAGtTTTG AAGTAGTTGA TAGCCAAGGT 



777 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msal83564 . 2 { 147_2603 } 
msal83564 . 2 { 147_JM9130013 } 
msal83564.2(l47_18RS2l} 
msal83564 . 2 { 147_090 } 
msa!83 564 . 2 { 147_CJB110 } 
msal83564 .2{l47_A909} 
msal83564 .2{147_H36B} 
msal83564 . 2 { 147_1169NT} 
Consensus 



CAAGTCAGTT 
CAAGTCAGTT 
CAAGTCAGTT 
CAAGTCAGTT 
CAAGTCAGTT 
CAAGTCAGTT 
CAAGTCAGTT 
CAAGTCAGTT 



AACATTTAAC 
AACATTTAAC 
AACATTTAAC 
AACATTTAAC 
AACATTTAAC 
AACATTTAAC 
AACATTTAAC 
AACATTTAAC 



********** ********** 



CAGAG t TTTG 
CAGAGtTTTG 
CAGAG t TTTG 
CAGAGtTTTG 
CAGAGtTTTG 
CAGAGtTTTG 
CAGAGtTTTG 
CAGAG aTTTG 
*****-**** 



AAGTAGTTGA 
AAGTAGTTGA 
AAGTAGTTGA 
AAGTAGTTGA 
AAGTAGTTGA 
AAGTAGTTGA 
AAGTAGTTGA 
AAGTAGTTGA 
********** 



TAGCCAAGGT 
TAG CCAAGGT 
TAGCCAAGGT 
TAGCCAAGGT 
TAGCCAAGGT 
TAGCCAAGGT 
TAGCCAAGGT 
TAGCCAAGGT 
********** 



msal83564 . 

rasal83564 . 

msal83564 . 

msal83564 
msal83564 .2(147 
msal83564.2{' 
msal83564 
msal83564.2{ 

msal83564 . 

msal83564 . 
msal83564.2{ 



2{l47_COHi; 
2{147_M732 ; 
2{147_M781" 
2{147_2603] 
_JM9130013' 
147_18RS2l| 
2{147_090] 
147__CJB110 } 
2(147_A909> 
2{147_H36B} 
147_1169NT} 
Consensus 



1601 

GGcAATCGTA 
GG cAATCGTA 
GGcAATCGTA 
GG t AATCGTA 
GGtAATCGTA 
GGt AATCGTA 
GGcAATCGTA 
GGcAATCGTA 
GGcAATCGTA 
GGCAATCGTA 
GGcAATCGTA 



TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 



**_******* ********** 



AT CAAGTTGG 
AT CAAGTTGG 
AT CAAGTTGG 
ATCAAGTTGG 
AT CAAGTTGG 
ATCAAGTTGG 
ATCAAGTTGG 
ATCAAGTTGG 
ATCAAGTTGG 
ATCAAGTTGG 
ATCAAGTTGG 
********** 



GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 



1650 
CTGAAGGAGC 
CTGAAGGAGC 
CTGAAGGAGC 
CTGAAGGAGC 
CTGAAGGAGC 
CTGAAGGAGC 
CTGAAGGAGC 
CTGAAGGAGC 
CTGAAGGAGC 
CTGAAGGAGC 
CTGAAGGAGC 



********** ********** 



msal83564 . 

msal83564 

msal83564 . 

msa!83564 . 
msa!83564.2{l47 
msa!83564.2{ 
msal83564 
msal83564 .2{ 

msal83564 . 

msal83564 
msal83564.2{ 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
2{147_2603} 
_JM9130013} 
147_18RS2l} 
.2{147_090) 
147_CJB110) 
2{147_A909} 
2{147JH36B} 
147_1169NT} 
Consensus 



1651 

AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
********** 



GATGTAACAG 
GATGTAACAG 
GATGTAACAG 
GATGTAACAG 
GATGTAACAG 
GATGTAACAG 
GATGTAACAG 
GATGTAACAG 
GATGTAACAG 
GATGTAACAG 
GATGTAACAG 



CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 



tGAAATTTAT 
t G AAATTT AT 
tGAAATTTAT 
tGAAATTTAT 
tGAAATTTAT 
tGAAATTTAT 
tGAAATTTAT 
tGAAATTTAT 
tGAAATTTAT 
tGAAATTTAT 
cGAAATTTAT 



1700 
TCTTCAACCT 
TCTTCAACCT 
TCTTCAACCT 
TCTTCAACCT 
TCTTCAACCT 
TCTTCAACCT 
TCTTCAACCT 
TCTTCAACCT 
TCTTCAACCT 
TCTTCAACCT 
TCTTCAACCT 



********** ********** 



.********* ********** 



1701 1750 

msal83564 -2{l47_COHl} ATAATAATCA ATACtAAACA ATGTCTGGTA CAAGTATGGC TTCACCACAT 

msal83564 . 2 { 147_M732 } ATAATAATCA ATACtAAACA ATGTCTGGTA CAAGTATGGC TTCACCACAT 

msal83564 .2{147_M781} ATAATAATCA ATACtAAACA ATGTCTGGTA CAAGTATGGC TTCACCACAT 

ms al 8 3 5 64 . 2 { 147_2 603} ATAATAATCA ATACcAAACA ATGTCTGGTA CAAGTATGGC TTCACCACAT 

msal83564.2{l4 7_JM9 130013} ATAATAATCA ATACcAAACA ATGTCTGGTA CAAGTATGGC TTCACCACAT 

msal83564.2{l47_18RS2l} ATAATAATCA ATACcAAACA ATGTCTGGTA CAAGTATGGC TTCACCACAT 

msal83564.2{l4 7_0 9 0 } ATAATAATCA ATACcAAACA ATGTCTGGTA CAAGTATGGC TTCACCACAT 

msal83564 .2 { 147_CJB110 j ATAATAATCA ATACcAAACA ATGTCTGGTA CAAGTATGGC TTCACCACAT 

msal83564 -2{147_A909| ATAATAATCA ATACcAAACA ATGTCTGGTA CAAGTATGGC TTCACCACAT 

msal83564 .2{147_H36B} ATAATAATCA ATACcAAACA ATGTCTGGTA CAAGTATGGC TTCACCACAT 

msal83564 -2{147_1169NT} ATAATAATCA ATACcAAACA ATGTCTGGTA CAAGTATGGC TTCACCACAT 

Consensus ********** ****_***** ********** ********** ********** 



msal83564 . 

rasal83564 . 

msal83564 . 

rasal83564 . 
0183183564.2(147, 
msal83564 .2{ 
msal83564 
msal83564 .2{ 

msal83564 

maal83564 
msal83564.2{ 



2{147_C0H1} 
2 (147_M732) 
2{147_M78l} 
2 {147^2603} 
JM9130013} 
147_18RS2l} 
2{147_090} 
147_CJB110} 
2{147_A909} 
2{l47_H36B) 
147_1169NT} 
Consensus 



1751 

GTTG CAGGAT 
GTTGCAGGAT 
GTTG CAGGAT 
GTTGCAGGAT 
GTTGCAGGAT 
GTTGCAGGAT 
GTTGCAGGAT 
GTTGCAGGAT 
GTTGCAGGAT 
GTTGCAGGAT 
GTTGCAGGAT 
********** 



TAATGACAAT 
TAATGACAAT 
TAATGACAAT 
TAATGACAAT 
TAATGACAAT 
TAATGACAAT 
TAATGACAAT 
TAATGACAAT 
TAATGACAAT 
TAATGACAAT 
TAATGACAAT 
********** 



GCTTCAAAgT 
GCTTCAAAgT 
GCTTCAAAgT 
GCTTCAAAgT 
GCTTCAAAgT 
GCTTCAAAgT 
GCTTCAAAgT 
GCTTCAAAaT 
GCTTCAAAgT 
GCTTCAAAgT 
GCTTCAAAgT 
********_* 



CATTTGG CTG 
CATTTGGCTG 
CATTTGGCTG 
CATTTGGCTG 
CATTTGGCTG 
CATTTGGCTG 
CATTTGGCTG 
CATTTGGCTG 
CATTTGGCTG 
CATTTGGCTG 
CATTTGGCTG 
******** ** 



1800 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
********** 



msal83564 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564 .2(147 
msal83564 .2{ 
msal83564 
msal83564 .2{ 

msal83564 . 

msa!83564 . 
msal83564 .2{ 



2 jl47_COHl} 
2{147_M732} 
2{147_M781> 
2(147_2603} 
_JM9130 013} 
147_18RS21} 
.2 (147_090 ) 
147_CJB110} 
2(147_A909} 
2(147_H36B} 
147_1169NT} 
Consensus 



1801 

AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
********** 



TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
********** 



AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
********** 



AGAATTGTCT 
AGAATTGTCT 
AGAATTGTCT 
AGAATTGTCT 
AGAATTGTCT 
AGAATTGTCT 
AGAATTGTCT 
AGAATTGTCT 
AGAATTGTCT 
AGAATTGTCT 
AGAATTGTCT 
********** 



1850 
AAAAACATCC 
AAAAACATCC 
AAAAACATCC 
AAAAACATCC 
AAAAACATCC 
AAAAACATCC 
AAAAACATCC 
AAAAACATCC 
AAAAACATCC 
AAAAACATCC 
AAAAACATCC 
********** 



msal83564 . 2 j 147_C0H1 } 
msal83564 .2{147_M732} 



1851 1900 
TCATGAGCTC AGCAACAGCA TTATATAGTG AAGAGGATAA GGCGTTTTAT 
TCATGAGCTC AGCAACAGCA TTATATAGTG AAGAGGATAA GGCGTTTTAT 



778 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msal83564 . 
msal83564 . 
msal83564.2{l47 
rasal83564.2{" 
msal83564 
msal83564.2{ 
msal83564 
msal83564 
msal83564.2{ 



2fl47_M78l} 
2{147_2603} 
_JM9130013} 
147_18RS2l} 
.2{147_090) 
147_CJB110) 
2{147_A909' 
2{147_H36B 
147_1169NT 
Consensus 



TCATGAGCTC 
TCATGAGCTC 
TCATGAGCTC 
TCATGAGCTC 
TCATGAGCTC 
TCATGAGCTC 
TCATGAGCTC 
TCATGAGCTC 
TCATGAGCTC 



AGCAACAGCA 
AGCAACAGCA 
AGCAACAGCA 
AGCAACAGCA 
AGCAACAGCA 
AGCAACAGCA 
AGCAACAGCA 
AGCAACAGCA 
AGCAACAGCA 



TTATATAGTG 
TTATATAGTG 
TTATATAGTG 
TTATATAGTG 
TTATATAGTG 
TTATATAGTG 
TTATATAGTG 
TTATATAGTG 
TTATATAGTG 



AAGAGGATAA 
AAGAGGATAA 
AAGAGGATAA 
AAGAGGATAA 
AAGAGGATAA 
AAGAGGATAA 
AAGAGGATAA 
AAGAGGATAA 
AAGAGGATAA 



GGCGTTTTAT 
GG CGTTTTAT 
GGCGTTTTAT 
GGCGTTTTAT 
GGCGTTTTAT 
GGCGTTTTAT 
GGCGTTTTAT 
GGCGTTTTAT 
GGCGTTTTAT 



********** ********** ********** ********** ********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msal83564.2{ 
msal83564 
msal83564.2{ 

msal83564 . 

msal83564 . 
msa!83564.2{ 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
2{147_2603} 
JM9130013} 
147_18RS2l} 
.2{l47_090} 
147_CJB110} 
2{l47_A909} 
2{147_H36B} 
147 w _1169NT} 
Consensus 



1901 

TCACCACGTC 
TCACCACGTC 
TCACCACGTC 
TCACCACGTC 
TCACCACGTC 
TCACCACGTC 
TCACCACGTC 
TCACCACGTC 
TCACCACGTC 
TCACCACGTC 
TCACCACGTC 



AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 



********** ********** 



AGGTGTAGTT 
AGGTGTAGTT 
AGGTGTAGTT 
AGGTGTAGTT 
AGGTGTAGTT 
AGGTGTAGTT 
AGGTGTAGTT 
AGGTGTAGTT 
AGGTGTAGTT 
AGGTGTAGTT 
AGGTGTAGTT 
********** 



GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 



1950 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 



********** ********** 



msal83564.2{l47_COHl} 
msal83564 .2{147_M732} 
msal83564 ~2{l47_M78l} 
rasal83564 . 2 { 147_2603 } 
msal83564 . 2 {147_JM9130013 } 
msal83564 . 2 { 147_18RS21 } 
msal83564 . 2 { 147_090 } 
msal83564 . 2 { 147__CJB110 ' 
msal83564 .2{l47_A909 
msal83564.2{l47__H36B 
msal83564 . 2 { 147_1169NT} 
Consensus 



1951 

AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 



TATgTTACTG 
TATgTTACTG 
TATgTTACTG 
TATaTTACTG 
TATaTTACTG 
TATaTTACTG 
TATgTTACTG 
TATgTTACTG 
TATgTTACTG 
TATgTTACTG 
TATgTTACTG 



GAAACGATGG 
GAAACGATGG 
GAAACGATGG 
GAAACGATGG 
GAAACGATGG 
GAAACGATGG 
GAAACGATGG 
GAAACGATGG 
GAAACGATGG 
GAAACGATGG 
GAAACGATGG 



CAAAGtTAAA 
CAAAGtTAAA 
CAAAGtTAAA 
CAAAGcTAAA 
CAAAGcTAAA 
CAAAGcTAAA 
CAAAGCTAAA 
CAAAGCTAAA 
CAAAGcTAAA 
CAAAGCTAAA 
CAAAGcTAAA 



2000 
ATTAATCTCA 
ATTAATCTCA 
ATTAATCTCA 
ATTAATCTCA 
ATTAATCTCA 
ATTAATCTCA 
ATTAATCTCA 
ATTAATCTCA 
ATTAATCTCA 
ATTAATCTCA 
ATTAATCTCA 



********** ***_****** ********** 



*****_**** ********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msal83564.2{ 
msal83564 
rasal83564.2{ 

msal83564 

msal83564 
rasal83564.2{ 



2{l47_COHl} 
2{147_M732} 
2(147_M781} 
2{ 147^2603} 
_JM9130013} 
147_18RS21} 
.2{147_090) 
147_CJB110} 
2{147__A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



2001 

AACGAgaGGG 
AACGAgaGGG 
AACGAgaGGG 
AACGAa t GGG 
AACGAatGGG 
AACGAa t GGG 
AACGAgtGGG 
AACGAg t GGG 
AACGAgtGGG 
AACGAgtGGG 
AACGAgtGGG 



AGATAAATTT 
AGATAAATTT 
AGATAAATTT 
AGATAAATTT 
AGATAAATTT 
AGATAAATTT 
AGATAAATTT 
AGATAAATTT 
AGATAAATTT 
AGATAAATTT 
AGATAAATTT 



GATATCACAG 
GATATCACAG 
GATATCACAG 
GATATCACAG 
GATATCACAG 
GATATCACAG 
GATATCACAG 
GATATCACAG 
GATATCACAG 
GATATCACAG 
GATATCACAG 



TTACAATTCA 
TTACAATTCA 
TTACAATTCA 
TTACAATTCA 
TTACAATTCA 
TTACAATTCA 
TTACAATTCA 
TTACAATTCA 
TTACAATTCA 
TTACAATTCA 
TTACAATTCA 



***★★__*** ********** ********** ********** 



2050 
TAAACTTGTA 
TAAACTTGTA 
TAAACTTGTA 
TAAACTTGTA 
TAAACTTGTA 
TAAACTTGTA 
TAAACTTGTA 
TAAACTTGTA 
TAAACTTGTA 
TAAACTTGTA 
TAAACTTGTA 
********** 



msal83564 . 
msal83564 
msal83564 
msal83564 
msa!83564.2{l47 

msal83564 .2{ 
msal83564 

mBal83564.2{ 
msal83564 . 
msal83564 

mBal83564.2{ 



2(l47_COHl} 
2{147_M732} 
2{l47_M78l) 
2{147_2603} 
JM9130013* 
147_18RS2lj 
.2{147_090; 
147_CJB110; 
2{147_A909; 
2{147_H36B] 
147_1169NT} 
Consensus 



2051 

GAAGGTGTCA 
GAAGGTGTCA 
GAAGGTGTCA 
GAAGGTGTCA 
GAAGGTGTCA 
GAAGGTGTCA 
GAAGGTGTCA 
GAAGGTGTCA 
GAAGGTGTCA 
GAAGGTGTCA 
GAAGGTGTCA 
********** 



AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
********** 



TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
********** 



AATGTAGCAA 
AATGTAGCAA 
AATGTAGCAA 
AATGTAGCAA 
AATGTAGCAA 
AATGTAGCAA 
AATGTAGCAA 
AATGTAGCAA 
AATGTAGCAA 
AATGTAGCAA 
AATGTAGCAA 
********** 



2100 
CAGAACAAGT 
CAGAACAAGT 
CAGAACAAGT 
CAGAACAAGT 
CAGAACAAGT 
CAGAACAAGT 
CAGAACAAGT 
CAGAACAAGT 
CAGAACAAGT 
CAGAACAAGT 
CAGAACAAGT 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msal83 564.2{ , 
msal83564 
msa!83564.2{ 

msal83564 

msal83564 
msal83564 .2{ 



2{147_C0H1} 
2{147_M732} 
2{147_M781} 
2{147_2603} 
JM9130013} 
147_18RS2lj 
2{147_090) 
147_CJB110 
2{147_A909 
2{147_H36B 
147_1169NT 
Consensus 



2101 

AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 



AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 



TTAAACCACA 
TTAAACCACA 
TTAAACCACA 
TTAAACCACA 
TTAAACCACA 
TTAAACCACA 
TTAAACCACA 
TTAAACCACA 
TTAAACCACA 
TTAAACCACA 
TTAAACCACA 



AGCCTTGCTA 
AGCCTTGCTA 
AGCCTTGCTA 
AGCQTTGCTA 
AGCCTTGCTA 
AGCCTTGCTA 
AGCCTTGCTA 
AGCCTTGCTA 
AGCCTTGCTA 
AGCCTTGCTA 
AGCCTTGCTA 



********** ********** ********** ********** 



2150 
GATACTAATT 
GATACTAATT 
GATACTAATT 
GATACTAATT 
GATACTAATT 
GATACTAATT 
GATACTAATT 
GATACTAATT 
GATACTAATT 
GATACTAATT 
GATACTAATT 
********** 



msal83564.2{l47_COHl} 



2151 2200 
GGCAGAAAGT AATTCTTCGT GATAAAGAAA CACAAGTTCG ATT TACT ATT 



779 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msal83564 . 2 { 147_M732 ) 
msal83564 . 2 { 147_M78 1 } 
msal83564 . 2 { 147_2603 } 
msal83564 . 2 { 147_JM9130013 } 
msa!8 3 5 64 . 2 { 14 7JL8RS2 1 } 
msal83564 . 2 { 147__090 } 
msal83564.2(l47_CJB110} 
msal83564 . 2 { 147_A909 } 
msal83564.2{l47_H36B) 
msal8 3 5 64 . 2 { 1 4 7_1 169NT } 
Consensus 



GGCAGAAAGT 
GGCAGAAAGT 
GGCAGAAAGT 
GGCAGAAAGT 
GGCAGAAAGT 
GGCAGAAAGT 
GGCAGAAAGT 
GGCAGAAAGT 
GGCAGAAAGT 
GGCAGAAAGT 



AATTCTTCGT 
AATTCTTCGT 
AATTCTTCGT 
AATTCTTCGT 
AATTCTTCGT 
AATTCTTCGT 
AATTCTTCGT 
AATTCTTCGT 
AATTCTTCGT 
AATTCTTCGT 



GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 



********** ********** ********** 



CACAAGTTCG 
CACAAGTTCG 
CACAAGTTCG 
CACAAGTTCG 
CACAAGTTCG 
CACAAGTTCG 
CACAAGTTCG 
CACAAGTTCG 
CACAAGTTCG 
CACAAGTTCG 
********** 



ATTTACTATT 
ATTTACTATT 
ATTTACTATT 
ATTTACTATT 
ATTTACTATT 
ATTTACTATT 
ATTTACTATT 
ATTTACTATT 
ATTTACTATT 
ATTTACTATT 
********** 



msal83564 . 2 { 147_COHl } 
msal83564 . 2 { 147_M732 } 
msal83564.2fl47_M78l} 
msal83564 . 2 { 147_2603 } 
msal83564 . 2 { 147_JM9130013 } 
msal83564 . 2 { 147__18RS21 } 
msal83564.2{l47_090} 
msal83564 . 2 { 147_CJB110 } 
msal83564 . 2 { 147_A909 } 
msal83564 . 2 { 147_H36B} 
msal83564 .2{147_1169NT} 
Consensus 



2201 

GATgCTAGTC 
GATgCTAGTC 
GATgCTAGTC 
GATgCTAGTC 
GATgCTAGTC 
GATgCTAGTC 
GATgCTAGTC 
GATgCTAGTC 
GAT t CTAGTC 
GAT t CTAGTC 
GATgCTAGTC 
***„****** 



AATTTAGTCA 
AATTTAGTCA 
AATTTAGTCA 
AATTTAGTCA 
AATTTAGTCA 
AATTTAGTCA 
AATTTAGTCA 
AATTTAGTCA 
AATTTAGTCA 
AATTTAGTCA 
AATTTAGTCA 
********** 



GAAATTAAAA 
GAAATTAAAA 
GAAATTAAAA 
GAAATTAAAA 
GAAATTAAAA 
GAAATTAAAA 
GAAATTAAAA 
GAAATTAAAA 
GAAATTAAAA 
GAAATTAAAA 
GAAATTAAAA 
********** 



GAACAGATGG 
GAACAGATGG 
GAACAGATGG 
GAACAGATGG 
GAACAGATGG 
GAACAGATGG 
GAACAGATGG 
GAACAGATGG 
GAACAGATGG 
GAACAGATGG 
GAACAGATGG 
********** 



2250 
CAAATGGTTA 
CAAATGGTTA 
CAAATGGTTA 
CAAATGGTTA 
CAAATGGTTA 
CAAATGGTTA 
CAAATGGTTA 
CAAATGGTTA 
CAAATGGTTA 
CAAATGGTTA 
CAAATGGTTA 
********** 



msal83564 
msal83564 
msal83564 
msal83564 
msal83564 .2(147 

msal83564 .2{ 
msal83564 

msal83564 .2{ 
msal83564 . 
msal83564 

msal83564.2{ 



2{l47_COHl) 
2{147_M732} 
2{147_M781} 
2{l47_2603 } 
_JM9130013} 
147_18RS2l} 
2{l47_090} 
147_CJB110} 
2{147_A909> 
2{147_H36B} 
147„1169NT} 
Consensus 



2251 

TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
********** 



GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 
********** 



GTTTTAAAGA 
GTTTTAAAGA 
GTTTTAAAGA 
GTTTTAAAGA 
GTTTTAAAGA 
GTTTTAAAGA 
GTTTTAAAGA 
GTTTTAAAGA 
GTTTTAAAGA 
GTTTTAAAGA 
GTTTTAAAGA 
********** 



AGCCAAGGAT 
AGCcAAGGAT 
AGCcAAGGAT 
AGCCAAGGAT 
AGCcAAGGAT 
AGCcAAGGAT 
AGCcAAGGAT 
AGCcAAGGAT 
AGCcAAGGAT 
AGCcAAGGAT 
AGC t AAGGAT 



2300 
AGTAATCAGG 
AGTAATCAGG 
AGTAATCAGG 
AGTAATCAGG 
AGTAATCAGG 
AGTAATCAGG 
AGTAATCAGG 
AGTAATCAGG 1 
AGTAATCAGG 
AGTAATCAGG 
AGTAATCAGG 



***_****** ********** 



2301 2350 

msal83564 . 2 { 147_C0H1 } AGTTAATGAG TATTCCTTTT GTAGGATTTA ATGGTGATTT TGCGAaCTTA 

msal83564.2{l47_M732} AGTTAATGAG TATTCCTTTT GTAGGATTTA ATGGTGATTT TGCGAaCTTA 

msal 83564. 2(147 _M 781} AGTTAATGAG TATTCCTTTT GTAGGATTTA ATGGTGATTT TGCGAaCTTA 

msal83564.2{l47_2603} AGTTAATGAG TATTCCTTTT GTAGGATTTA ATGGTGATTT TGCGAaCTTA 

msal83564.2(l47_JM9130013} AGTTAATGAG TATTCCTTTT GTAGGATTTA ATGGTGATTT TGCGAaCTTA 

msal83564.2{l47_18RS2l} AGTTAATGAG TATTCCTTTT GTAGGATTTA ATGGTGATTT TGCGAaCTTA 

msal83564. 2 {147^090} AGTTAATGAG TATTCCTTTT GTAGGATTTA ATGGTGATTT TGCGAaCTTA 

msal83564 . 2 ( 147_CJB110 } AGTTAATGAG TATTCCTTTT GTAGGATTTA ATGGTGATTT TGCGAaCTTA 

msal83564.2(l47_A909} AGTTAATGAG TATTCCTTTT GTAGGATTTA ATGGTGATTT TGCGAaCTTA 

msal83564 . 2(l47_H36B) AGTTAATGAG TATTCCTTTT GTAGGATTTA ATGGTGATTT TGCGAaCTTA 

msal83564.2{l47_1169NT} AGTTAATGAG TATTCCTTTT GTAGGATTTA ATGGTGATTT TGCGAgCTTA 

Consensus ********** ********** ********** ********** *****_**** 



msal83564 
msa!83564 
msal83564 
msal83564 
msal83564. 2(147 

msal83564.2{ 
msal83564 

tnsal83564.2( 
msal83564. 
msal83564 

msal83564.2{ 



7_M732} 
7_M781} 



2(147_C0H1} 
2 {147 

2{147_2603) 
_JM9130013} 
147_18RS21} 
,2{147_090} 
147_CJB110} 
2(147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



2351 

CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 



AAACACCGAT 
AAACACCGAT 
AAACACCGAT 
AAACACCGAT 
AAACACCGAT 
AAACACCGAT 
AAACACCGAT 
AAACACCGAT 
AAACACCGAT 
AAACACCGAT 
AAACACCGAT 



TTATAAGACG 
TTATAAGACG 
TTATAAGACG 
TTATAAGACG 
TTATAAGACG 
TTATAAGACG 
TTATAAGACG 
TTATAAGACG 
TTATAAGACG 
TTATAAGACG 
TTATAAGACG 



********** ********** ********** 



cTTTCTAAAG 
CTTTCTAAAG 
cTTTCTAAAG 
cTTTCTAAAG 
cTTTCTAAAG 
aTTTCTAAAG 
CTTTCTAAAG 
cTTTCTAAAG 
cTTTCTAAAG 
cTTTCTAAAG 
CTTTCTAAAG 
„********* 



2400 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564. 
msal83564. 2(147 
msal83564.2{ 
msal83564 
rnsal83564.2( 

msal83564 . 

msal83564 . 
msal83564.2( 



2fl47_C0HlJ 
2{147_M732} 
2{147_M781} 
2{147_2603} 
JM9130013} 
147_18RS21} 
2{147_090} 
147_CJB110} 
2f 147_A909j 
2{147_H36B) 
147_1169NT} 
Consensus 



2401 

CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
********** 



AATGATACAA 
AATGATACAA 
AATGATACAA 
AATGATACAA 
AATGATACAA 
AATGATACAA 
AATGATACAA 
AATGATACAA 
AATGATACAA 
AATGATACAA 
AATGATACAA 
********** 



CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
********** 



CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
********** 



2450 
TAcAATGAAT 
TAcAATGAAT 
TAcAATGAAT 
TAcAATGAAT 
TAcAATGAAT 
TACAATGAAT 
TAcAATGAAT 
TACAATGAAT 
TAcAATGAAT 
TAcAATGAAT 
TAtAATGAAT 
**_******* 



2451 



2500 



780 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 

msal83564.2(l47_C0Hll CAGCTCCTTT TGAAAGCAAC AACTATACTG CCTTGTTAAC ACAATCAGCG 

msal83564.2{l47_M732} CAGCTCCTTT TGAAAGCAAC AACTATACTG CCTTGTTAAC ACAATCAGCG 

msal83564 . 2 {l47_M78l} CAGCTCCTTT TGAAAGCAAC AACTATACTG CCTTGTTAAC ACAATCAGCG 

msal83564 .2{l47_2603} CAGCTCCTTT TGAAAGCAAC AACTATACTG CCTTGTTAAC ACAATCAGCG 

msal83564.2{l47_JM913 0013) CAGCTCCTTT TGAAAGCAAC AACTATACTG CCTTGTTAAC ACAATCAGCG 

msal83564 .2 {147_18RS21} CAGCTCCTTT TGAAAGCAAC AACTATACTG CCTTGTTAAC ACAATCAGCG 

msal83564 .2{l47_090} CAGCTCCTTT TGAAAGCAAC AACTATACTG CCTTGTTAAC ACAATCAGCG 

msal83564.2{l47_CJB110} CAGCTCCTTT TGAAAGCAAC AACTATACTG CCTTGTTAAC ACAATCAGCG 

msal8 3 564 . 2 { 14 7_A9 09 } CAGCTCCTTT TGAAAGCAAC AACTATACTG CCTTGTTAAC ACAATCAGCG 

msal83564.2{l47_H36B} CAGCTCCTTT TGAAAGCAAC AACTATACTG CCTTGTTAAC ACAATCAGCG 

msal83564.2{l47_1159NT} CAGCTCCTTT TGAAAGCAAC AACTATACTG CCTTGTTAAC ACAATCAGCG 

Consensus ********** ********** ********** ********** ********** 



msal83564 . 

msal83564. 

msal83564 . 

msal83564 - 
sal83564.2{l47 
msal83564.2{ 
msal83564 
msal83564.2{ 

msal83564 . 

msal83564 - 
msal83564 .2{ 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
2{147_2 603} 
_JM9130013} 
147_18RS2l} 
.2{l47_090} 
147_CJB110} 
2{147_A909} 
2{147_H36B) 
147_1169NT) 
Consensus 



2501 

TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
********** 



ATGTTGATTA 
ATGTTGATTA 
ATGTTGATTA 
ATGTTGATTA 
ATGTTGATTA 
ATGTTGATTA 
ATGTTGATTA 
ATGTTGATTA 
ATGTTGATTA 
ATGTTGATTA 
ATGTTGATTA 
********** 



TGTCAAAAAT 
TGTCAAAAAT 
TGTCAAAAAT 
TGTCAAAAAT 
TGTCAAAAAT 
TGTCAAAAAT 
TGTCAAAAAT 
TGTCAAAAAT 
TGTCAAAAAT 
TGTCAAAAAT 
TGTCAAAAAT 
********** 



GGTGGGGAGT 
GGTGGGGAGT 
GGTGGGGAGT 
GGTGGGGAGT 
GGTGGGGAGT 
GGTGGGGAGT 
GGTGGGGAGT 
GGTGGGGAGT 
GGTGGGGAGT 
GGTGGGGAGT 
GGTGGGGAGT 
********** 



2550 
TAGAATTAGC 
TAGAATTAGC 
TAGAATTAGC 
TAGAATTAGC 
TAGAATTAGC 
TAGAATTAGC 
TAGAATTAGC 
TAGAATTAGC 
TAGAATTAGC 
TAGAATTAGC 
TAGAATTAGC 
********** 



2551 

msal83564 .2 { 147_COHl} ACCGGAGAGT CCAAAAAGAA 

msal83564.2{l47_M732} ACCGGAGAGT CCAAAAAGAA 

msal83564 .2{l47_M78l} ACCGGAGAGT CCAAAAAGAA 

msal83564.2{l47_2603> ACCGGAGAGT CCAAAAAGAA 

msal83564.2{l47_JM9130 013} ACCGGAGAGT CCAAAAAGAA 

msal83564.2{l47_18RS2lj ACCGGAGAGT CCAAAAAGAA 

msal83564. 2 { 147^090} ACCGGAGAGT CCAAAAAGAA 

msal83564.2{l47_CJB110} ACCGGAGAGT CCAAAAAGAA 

msal83564.2{l47__A909} ACCGGAGAGT CCAAAAAGAA 

msal83564 .2{ 147_H36B} ACCGGAGAGT CCAAAAAGAA 

msal83564.2{l47_1169NT} ACCGGAGAGT CCAAAAAGAA 

Consensus ********** ********** 



TTATTTTAGG 
TTATTTTAGG 
TTATTTTAGG 
TTATTTTAGG 
TTATTTTAGG 
TTATTTTAGG 
TTATTTTAGG 
TTATTTTAGG 
TTATTTTAGG 
TTATTTTAGG 
TTATTTTAGG 



AACTTTTGAG 
AACTTTTGAG 
AACTTTTGAG 
AACTTTTGAG 
AACTTTTGAG 
AACTTTTGAG 
AACTTTTGAG 
AACTTTTGAG 
AACTTTTGAG 
AACTTTTGAG 
AACTTTTGAG 



2600 
AATAAGGTTG 
AATAAGGTTG 
AATAAGGTTG 
AATAAGGTTG 
AATAAGGTTG 
AATAAGGTTG 
AATAAGGTTG 
AATAAGGTTG 
AATAAGGTTG 
AATAAGGTTG 
AATAAGGTTG 



********** ********** ********** 



msal83564 . 
msal83564 
msal83564 
tnsal83564 . 
msal83564 .2(147 

msal83564 .2{ 
msal83564 

msal83564.2{ 
msal83564 . 
msal83564 . 

msal83564 -2{ 



2{l47_COHl) 
2{147_M732} 
2{147_M781} 
2{147_2603} 
_JM9130013} 
147_18RS2lj 
2{147_090} 
147_CJB110} 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



2601 

AGGATAAAAC 
AGGATAAAAC 
AGGATAAAAC 
AGGATAAAAC 
AGGATAAAAC 
AGGATAAAAC 
AGGATAAAAC 
AGGATAAAAC 
AGGATAAAAC 
AGGATAAAAC 
AGGATAAAAC 
********** 



AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
********** 



TTGGAAAGAG 
TTGGAAAGAG 
TTGGAAAGAG 
TTGGAAAGAG 
TTGGAAAGAG 
TTGGAAAGAG 
TTGGAAAGAG 
TTGGAAAGAG 
TTGGAAAGAG 
TTGGAAAGAG 
TTGGAAAGAG 
********** 



ATGCAGCGAA 
ATGCAGCGAA 
ATGCAGCGAA 
ATGCAGCGAA 
ATGCAGCGAA 
ATGCAGCGAA 
ATGCAGCGAA 
ATGCAGCGAA 
ATGCAGCGAA 
ATGCAGCGAA 
ATGCAGCGAA 
********** 



2650 
TAATCCATAT 
TAATCCATAT 
TAATCCATAT 
TAATCCATAT 
TAATCCATAT 
TAATCCATAT 
TAATCCATAT 
TAATCCATAT 
TAATCCATAT 
TAATCCATAT 
TAATCCATAT 
********** 



2651 

msal83564.2{l47_COHl} TTTGCCATTT 

msal83564.2{l47_M732} TTTGCCATTT 

msal83564.2fl47_M78l} TTTGCCATTT 

msal83564. 2 { 147^2603} TTTGCCATTT 

msal83564.2{l47_JM9130013} TTTGCCATTT 

msal83564.2{l47_18RS2l} TTTGCCATTT 

msal83564.2{l47__090) TTTGCCATTT 

msal83564.2{l47_CJB110} TTTGCCATTT 

msal83564 .2{147__A909} TTTGCCATTT 

msal83564.2{l47_H36Bj TTTGCCATTT 

msal83564.2{l47_1169NT) TTTGCCATTT 

Consensus ********** 



CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 



AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 



AGGGAcGAAA 
AGGGAcGAAA 
AGGGAcGAAA 
AGGGACGAAA 
AGGGAcGAAA 
AGGGAcGAAA 
AGGGAtGAAA 
AGGGAtGAAA 
AGGGAtGAAA 
AGGGAtGAAA 
AGGGAtGAAA 



********** ********** *****„**** 



2700 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564 .2(147 
msal83564.2(' 
msal83564 
msal83564.2{ 

msal83564 

msal83564 . 
msal83564 .2{ 



2{147_C0H1} 
2{147_M732| 
2{147_M781 ! 
2{147__2603' 

JM9130013] 
147_18RS2lj 

2{l47_090 
147_CJB110' 
2{l47_A909l 
2{l47_H36Bj 
147_1169NTJ 
Consensus 



2701 

GGCAACTTTC 
GGCAACTTTC 
GGCAACTTTC 
GGCAACTTTC 
GGCAACTTTC 
GGCAACTTTC 
GGCAACTTTC 
GGCAACTTTC 
GGCAACTTTC 
GGCAACTTTC 
GGCAACTTTC 
********** 



TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
********** 



TTAAGGATAT 
TTAAGGATAT 
TTAAGGATAT 
TTAAGGATAT 
TTAAGGATAT 
TTAAGGATAT 
TTAAGGATAT 
TTAAGGATAT 
TTAAGGATAT 
TTAAGGATAT 
TTAAGGATAT 



TTCTGCTCAA 
TTCTGCTCAA 
TTCTGCTCAA 
TTCTGCTCAA 
TTCTGCTCAA 
TTCTGCTCAA 
TTCTGCTCAA 
TTCTGCTCAA 
TTCTGCTCAA 
TTCTGCTCAA 
TTCTGCTCAA 



********** ********** 



2750 
GTTCTAGATC 
GTTCTAGATC 
GTTCTAGATC 
GTTCTAGATC 
GTTCTAGATC 
GTTCTAGATC 
GTTCTAGATC 
GTTCTAGATC 
GTTCTAGATC 
GTTCTAGATC 
GTTCTAGATC 
********** 



781 



WO 2004/018646 PCT/US2003/026827 
Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msal83564 . 2 { l47_COHl } 
msa!83564 . 2 { 147_M732 } 
msal83564 .2{l47_M78l} 
msal83564 . 2 { 147_2603 } 
msal83564 .2{l47_JM9130013 J 
msal83564 . 2 { 147_18RS21} 
msal83564 . 2 { 147^090} 
msal83564 . 2 {l47_CJB110 } 
msal83564 .2{147_A909} 
ms al 8 3 5 64 . 2 { 14 7__H3 6B } 
msal83564.2{l47_1169NT} 
Consensus 



msal83564 . 2 {l47_COHl } 
msal83S64 .2{147_M732} 
msalB3564 .2{l47_M78l} 
msal83564 .2{l47_2603} 
msal83564 . 2 {l47_JM9130013 } 
msal83564.2{l47_18RS21) 
msal83564 . 2 { 14 7_090 } 
msal83 564 . 2 { 147_CJB110 } 
msal83564 .2{l4 7_A909) 
msal83564.2{l47_H36B} 
msal83564.2{l47_1169NT} 
Consensus 



msal 8 3 5 64 . 2 { 14 7_COHl } 
msa!83564 .2{l47_M732} 
rasal83564 .2{l47_M78l} 
msal83564.2{l47_2603} 
msal83564 . 2 { 147_JM9130013 } 
msa!83564 .2{l47 18RS21} 
msal83564 . 2{ 147_090 } 
msal83564 .2 {l47_CJB110 } 
msal 8 3 564 . 2 {147_A909 } 
msal83564 .2{147_H36B} 
msal83564 .2{147_1169NT} 
Consensus 



msa!83564 - 2 {l4 7_COHl} 
msal83564 . 2 { 147_M732 } 
msal83564 . 2 {147_M781 } 
msal83564.2{l47_2603) 
msal83564.2{l4 7_JM91300!3} 
msal83564 . 2 { 147_18RS21 } 
msal83564 .2{l47_090} 
msal83564 . 2 { 147_CJB110 } 
msal83564.2{l47_A909} 
msal 8 3564 . 2 { 147_H36B} 
msal83564 .2{l47__1169NT} 
Consensus 



msal83564 . 2 { 147_COHl} 
msal83564.2{l47_M732} 
msal83564 . 2 { 147_M781 } 
msal83564 . 2 { 147_2603 } 
msal83564-2{l47_JM9130013} 
msal83564.2{l47 18RS21} 
msal83564.2{l47_090} 
msal83564 .2{147_CJB110} 
msal83564 .2(147_A909} 
msa!83564 .2{l47JH36B} 
msal83564 .2 { 147_1169NT> 
Consensus 



msal 8 3 564 . 2 { 147_COHl } 
msa!83564 - 2 { 147_M732 } 
msal83564.2{l47_M78l| 
msal83564 - 2 { 147_2603 } 
msal83564.2{l47_JM9130013} 
msal83564 .2 {147 18RS21} 
msal83564 .2{147_090) 
msal83564 . 2 { 147__CJB110) 
msal83564 .2{147_A909} 
msal83564 . 2 { 147_H36B} 
msal83564 . 2 { 147_1169NT} 
Consensus 



2751 2800 
AAAATGGAAA TGTTATTTGG CAAAGTAAGG TTTTACCATC TT AT CGT AAA 
AAAATGGAAA TGTTATTTGG CAAAGTAAGG TTTTACCATC TTATCGTAAA 
AAAATGGAAA TGTTATTTGG CAAAGTAAGG TTTTACCATC TTATCGTAAA 
AAAATGGAAA TGTTATTTGG CAAAGTAAGG TTTTACCATC TTATCGTAAA 
AAAATGGAAA TGTTATTTGG CAAAGTAAGG TTTTACCATC TTATCGTAAA 
AAAATGGAAA TGTTATTTGG CAAAGTAAGG TTTTACCATC TTATCGTAAA 
AAAATGGAAA TGTTATTTGG CAAAGTAAGG TTTTACCATC TTATCGTAAA 
AAAATGGAAA TGTTATTTGG CAAAGTAAGG TTTTACCATC TTATCGTAAA 
AAAATGGAAA TGTTATTTGG CAAAGTAAGG TTTTACCATC TTATCGTAAA 
AAAATGGAAA TGTTATTTGG CAAAGTAAGG TTTTACCATC TTATCGTAAA 
AAAATGGAAA TGTTATTTGG CAAAGTAAGG TTTTACCATC TTATCGTAAA 
********** ********** ********** ********** ********** 



2801 2850 
AATTTCCATA ATAATCCAAA GCAaAGTGAT GGTCATTATC GTATGGATGC 
AATTT C CAT A ATAATCCAAA GCAaAGTGAT GGTCATTATC GTATGGATGC 
AATTTCCATA ATAATCCAAA GCAaAGTGAT GGTCATTATC GTATGGATGC 
AATTTCCATA ATAATCCAAA GCAaAGTGAT GGTCATTATC GTATGGATGC 
AATTTCCATA ATAATCCAAA GCAaAGTGAT GGTCATTATC GTATGGATGC 
AATTTCCATA ATAATCCAAA GCAaAGTGAT GGTCATTATC GTATGGATGC 
AATTTCCATA ATAATCCAAA GCAaAGTGAT GGTCATTATC GTATGGATGC 
AATTTCCATA ATAATCCAAA GCAaAGTGAT GGTCATTATC GTATGGATGC 
AATTTCCATA ATAATCCAAA GCAaAGTGAT GGTCATTATC GTATGGATGC 
AATTTCCATA ATAATCCAAA GCAaAGTGAT GGTCATTATC GTATGGATGC 
AATTTCCATA ATAATCCAAA GCAgAGTGAT GGTCATTATC GTATGGATGC 
********** ********** ***_****** ********** ********** 



2851 2900 
tcTTCAGTGG AGTGGTTTAG ATAAGGATGG CAAAGTTGTA GCAGATGGTT 
tcTTCAGTGG AGTGGTTTAG ATAAGGATGG CAAAGTTGTA GCAGATGGTT 
tcTTCAGTGG AGTGGTTTAG ATAAGGATGG CAAAGTTGTA GCAGATGGTT 
tcTTCAGTGG AGTGGTTTAG ATAAGGATGG CAAAGTTGTA GCAGATGGTT 
tcTTCAGTGG AGTGGTTTAG ATAAGGATGG CAAAGTTGTA GCAGATGGTT 
tcTTCAGTGG AGTGGTTTAG ATAAGGATGG CAAAGTTGTA GCAGATGGTT 
ctTTCAGTGG AGTGGTTTAG ATAAGGATGG CAAAGTTGTA GCAGATGGTT 
CtTTCAGTGG AGTGGTTTAG ATAAGGATGG CAAAGTTGTA GCAGATGGTT 
ccTTCAGTGG AGTGGTTTAG ATAAGGATGG CAAAGTTGTA GCAGATGGTT 
ccTTCAGTGG AGTGGTTTAG ATAAGGATGG CAAAGTTGTA GCAGATGGTT 
ccTTCAGTGG AGTGGTTTAG ATAAGGATGG CAAAGTTGTA GCAGATGGTT 
__******** ********** ********** ********** ********** 



2901 2950 
TTTATACTTA TCGctTACGT TACACAC CAG TAGCAGAAGG AGCAAATAGT 
TTTATACTTA TCGctTACGT TACACACCAG TAGCAGAAGG AGCAAATAGT 
TTTATACTTA TCGctTACGT TACACACCAG TAGCAGAAGG AGCAAATAGT 
TTTATACTTA TCGctTACGT TACACACCAG TAGCAGAAGG AGCAAATAGT 
TTTATACTTA TCGctTACGT TACACACCAG TAGCAGAAGG AGCAAATAGT 
TTTATACTTA TCGctTACGT TACACACCAG TAGCAGAAGG AGCAAATAGT 
TTTATACTTA TCG CCTACGT TACACACCAG TAGCAGAAGG AGCAAATAGT 
TTTATACTTA TCGccTACGT TACACACCAG TAGCAGAAGG AGCAAATAGT 
TTTATACTTA TCGttTACGT TACACACCAG TAGCAGAAGG AGCAAATAGT 
TTTATACTTA TCGttTACGT TACACACCAG TAGCAGAAGG AGCAAATAGT 
TTTATACTTA TCGctTACGT TACACACCAG TAGCAGAAGG AGCAAATAGT 



2951 3000 
CAGGAGTCAG ACTTTAAAGT tCAAGTAAGT ACTAAGTCAC CAAATCTTCC 
CAGGAGTCAG ACTTTAAAGT tCAAGTAAGT ACTAAGTCAC CAAATCTTCC 
CAGGAGTCAG ACTTTAAAGT tCAAGTAAGT ACTAAGTCAC CAAATCTTCC 
CAGGAGTCAG ACTTTAAAGT aCAAGTAAGT ACTAAGTCAC CAAATCTTCC 
CAGGAGTCAG ACTTTAAAGT aCAAGTAAGT ACTAAGTCAC CAAATCTTCC 
CAGGAGTCAG ACTTTAAAGT aCAAGTAAGT ACTAAGTCAC CAAATCTTCC 
CAGGAGTCAG ACTTTAAAGT tCAAGTAAGT ACTAAGTCAC CAAATCTTCC 
CAGGAGTCAG ACTTTAAAGT tCAAGTAAGT ACTAAGTCAC CAAATCTTCC 
CAGGAGTCAG ACTTTAAAGT tCAAGTAAGT ACTAAGTCAC CAAATCTTCC 
CAGGAGTCAG ACTTTAAAGT tCAAGTAAGT ACTAAGTCAC CAAATCTTCC 
CAGGAGTCAG ACTTTAAAGT tCAAGTAAGT ACTAAGTCAC CAAATCTTCC 
********** ********** _********* ********** ********** 

3001 3050 
TTcACgAGCT CAGTTTGATG AAACTAATCG AACATTAAGC TTAGCCATGC 
TTcACgAGCT CAGTTTGATG AAACTAATCG AACATTAAGC TTAGCCATGC 
TTcACgAGCT CAGTTTGATG AAACTAATCG AACATTAAGC TTAGCCATGC 
TTcACgAGCT CAGTTTGATG AAACTAATCG AACATTAAGC TTAGCCATGC 
TTcACgAGCT CAGTTTGATG AAACTAATCG AACATTAAGC TTAGCCATGC 
TTcACgAGCT CAGTTTGATG AAACTAATCG AACATTAAGC TTAGCCATGC 
TTtACtAGCT CAGTTTGATG AAACTAATCG AACATTAAGC TTAGCCATGC 
TTtACtAGCT CAGTTTGATG AAACTAATCG AACATTAAGC TTAGCCATGC 
TTcACgAGCT CAGTTTGATG AAACTAATCG' AACATTAAGC TTAGCCATGC 
TTcACgAGCT CAGTTTGATG AAACTAATCG AACATTAAGC TTAGCCATGC 
TTcACgAGCT CAGTTTGATG AAACTAATCG AACATTAAGC TTAGCCATGC 
**-**„**** ********** ********** ********** ********** 



782 
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Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



3051 

msal83564.2{l47_COHl} CTAAGGaAAG TAGTTATGTT 

msal83564 . 2 { 147_M732 } CTAAGGaAAG TAGTTATGTT 

msal83564.2(l47_M78l) CTAAGGaAAG TAGTTATGTT 

msal 83564. 2 {147_2603} CTAAGGaAAG TAGTTATGTT 

msal83564.2{l47_JM9130013} CTAAGGaAAG TAGTTATGTT 

msa!83564.2{l47_18RS2l} CTAAGGaAAG TAGTTATGTT 

msal83 564. 2 {l47_090} CTAAGGaAAG TAGTTATGTT 

msal83 564 . 2 { 147_CJB110 } CTAAGGaAAG TAGTTATGTT 

msa!83564 .2{l47_A909} CTAAGGaAAG TAGTTATGTT 

msa!83564 .2{147_H36B} CTAAGGaAAG TAGTTATGTT 

msal83564.2{l47_1169NT} CTAAGGgAAG TAGTTATGTT 

Consensus ******-*** ********** 



CCTAcATATC 
CCTAcATATC 
CCTAcATATC 
CCTAcATATC 
CCTAcATATC 
CCTAcATATC 
CCTACATATC 
CCTACATATC 
CCTAcATATC 
CCTAcATATC 
CCTAtATATC 



GTtTACAATT 
GTtTACAATT 
GTtTACAATT 
GTtTACAATT 
GTtTACAATT 
GTtTACAATT 
GTtTACAATT 
GTtTACAATT 
GTcTACAATT 
GTcTACAATT 
GTcTACAATT 



3100 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 



****_***** **_******* ********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564 .2(147 
msal83564 .2{ 
msal83564 
msal83564.2{ 

msal83564 . 

msal835€4 - 
msa!83564 .2{ 



2{l47_COHl} 
2{147_M732) 
2{l47_M78l} 
2{l47_2603j 
JM9130013} 
147_18RS21} 
.2{l47_090.} 
147 CJB110} 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



3101 

CATGTTGTAA 
CATGTTGTAA 
CATGTTGTAA 
CATGTTGTAA 
CATGTTGTAA 
CATGTTGTAA 
CATGTTGTAA 
CATGTTGTAA 
CATGTTGTAA 
CATGTTGTAA 
CATGTTGTAA 
********** 



AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 
********** 



ATATGGgGAT 
ATATGGgGAT 
ATATGGgGAT 
ATATGGgGAT 
ATATGGgGAT 
ATATGGgGAT 
ATATGGgGAT 
ATATGGgGAT 
ATATGGaGAT 
ATATGGaGAT 
ATATGGaGAT 
******_*** 



GAGACTTCTT 
GAGACTTCTT 
GAGACTTCTT 
GAGACTTCTT 
GAGACTTCTT 
GAGACTTCTT 
GAGACTTCTT 
GAGACTTCTT 
GAGACTTCTT 
GAGACTTCTT 
GAGACTTCTT 
********** 



3150 
ACcATTATTT 
ACcATTATTT 
ACcATTATTT 
ACcATTATTT 
ACcATTATTT 
ACcATTATTT 
ACcATTATTT 
ACcATTATTT 
ACcATTATTT 
ACcATTATTT 
AC t ATTATTT 
**_******* 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564 .2(147 
msal83564.2{' 
msal83564 
msa!83564 .2{ 

msal83564 . 

msal83564 . 
msal83564.2{ 



2{l47_COHl} 
2{147__M732) 
2{l47_M781} 
2{147_2603} 
JM9130013} 
147_18RS2lJ 
.2{147_090> 
147_CJB110} 
2{l47_A909j 
2{147_H36B> 
147_1169NT) 
Consensus 



3151 

CCATATAGAT 
CCATATAGAT 
CCATATAGAT 
CCATATAGAT 
CCATATAGAT 
CCATATAGAT 
CCATATAGAT 
CCATATAGAT 
CCATATAGAT 
CCATATAGAT 
CCATATAGAT 
********** 



CaAGAAGGTA 
CaAGAAGGTA 
CaAGAAGGTA 
CaAGAAGGTA 
CaAGAAGGTA 
CaAGAAGGTA 
CaAGAAGGTA 
CaAGAAGGTA 
CgAGAAGGTA 
CaAGAAGGTA 
CaAGAAGGTA 
*_******** 



AAGtGACACT 
AAGtGACACT 
AAGtGACACT 
AAGtGACACT 
AAGtGACACT 
AAGtGACACT 
AAGtGACACT 
AAGtGACACT 
AAGtGACACT 
AAGtGACACT 
AAGcGACACT 
***_****** 



TCCTAAAACg 
TCCTAAAACg 
TCCTAAAACg 
TCCTAAAACg 
TCCTAAAACg 
TCCTAAAACg 
TCCTAAAACg 
TCCTAAAACg 
TCCTAAAACa 
TCCTAAAACa 
TCCTAAAACg 
*********_ 



3200 
GTTAAGATAG 
GTTAAGATAG 
GTTAAGATAG 
GTTAAGATAG 
GTTAAGATAG 
GTTAAGATAG 
GTTAAGATAG 
GTTAAGATAG 
GTTAAGATAG 
GTTAAGATAG 
GTTAAGATAG 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msal83564.2{ 
msal83564 
msal83564.2{ 

msal83564 . 

msal 8 3 564 
msal83564 .2 { 



2{l47_COHl} 
2{147_M732' 
2ll47_M78i; 
2{147__2603 
_JM9130013 
147_18RS21" 
.2{147_090 
147_CJB110 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



3201 

GAGAGAGTGA 
GAGAGAGTGA 
GAGAGAGTGA 
GAGAGAGTGA 
GAGAGAGTGA 
GAGAGAGTGA 
GAGAGAGTGA 
GAGAGAGTGA 
GAGAGAGTGA 
GAGAGAGTGA 
GAGAGAGTGA 
********** 



GGTTGCgGTA 
GGTTGCgGTA 
GGTTGCgGTA 
GGTTGCgGTA 
GGTTGCgGTA 
GGTTGCgGTA 
GGTTGCaGTA 
GGTTGCaGTA 
GGTTGCaGTA 
GGTTGCaGTA 
GGTTGCaGTA 



GACCCTAAGg 
GACCCTAAGg 
GACCCTAAGg 
GACCCTAAGg 
GACCCTAAGg 
GACCCTAAGg 
GACCCTAAGg 
GACCCTAAGg 
GACCCTAAGa 
GACCCTAAGa 
GACCCTAAGg 



CCTTGACACT 
CCTTGACACT 
CCTTGACACT 
CCTTGACACT 
CCTTGACACT 
CCTTGACACT 
CCTTGACACT 
CCTTGACACT 
CCTTGACACT 
CCTTGACACT 
CCTTGACACT 



3250 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 



******_*** *********_ ********** ********** 



3251 3300 

msal83564 .2{l47_COHlJ GATAAAGCTG GTAATTT t GC AACGGTAAAA TTGTCTGAc C TCTTGAATAA 

msal83564 .2(147_M732} GATAAAGCTG GTAATTT tGC AACGGTAAAA TTGTCTGAc C TCTTGAATAA 

msal83564.2{l47_M78l} GATAAAGCTG GTAATTTtGC AACGGTAAAA TTGTCTGAc C TCTTGAATAA 

msal83 564.2{l47_2603} GATAAAGCTG GTAATTTcGC AACGGTAAAA TTGTCTGAtC TCTTGAATAA 

ms al 8 3 5 64 . 2 { 14 7_JM9 130 013} GATAAAGCTG GTAATTTcGC AACGGTAAAA TTGTCTGAtC TCTTGAATAA 

msa!83564 . 2 { 147_18RS21) GATAAAGCTG GTAATTTcGC AACGGTAAAA TTGTCTGAtC TCTTGAATAA 

msal83564 .2{147_090} GATAAAGCTG GTAATTTtGC AACGGTAAAA TTGTCTGAc C TCTTGAATAA 

msal83 564 . 2 { 147_CJB110 ) GATAAAGCTG GTAATTTtGC AACGGTAAAA TTGTCTGAc C TCTTGAATAA 

msal83564 .2{147_A909 j GATAAAGCTG GTAATTTcGC AACGGTAAAA TTGTCTGAc C TCTTGAATAA 

msal83564 .2{l47_H36B} GATAAAGCTG GTAATTTcGC AACGGTAAAA TTGTCTGAcC TCTTGAATAA 

msal83564.2{l47_1169NT} GATAAAGCTG GTAATTTcGC AACGGTAAAA TTGTCTGAcC TCTTGAATAA 

Consensus ********** *******_** ********** ********„* ********** 



3301 3350 

msal83564 .2{l47_COHll GGCAGTAGTA TCAGAGAAAG AAAACGCTAT AGTAATTTCT AACAgTTTCA 

msal83564.2 jl47_M732) GGCAGTAGTA TCAGAGAAAG AAAACGCTAT AGTAATTTCT AACAgTTTCA 

msal83564 .2{147_M781} GGCAGTAGTA TCAGAGAAAG AAAACGCTAT AGTAATTTCT AACAgTTTCA 

msal83564.2{ 14 7_2 603} GGCAGTAGTA TCAGAGAAAG AAAACGCTAT AGTAATTTCT AACAgTTTCA 

msal 8 3 5 64 . 2 { 147_JM9 130013} GGCAGTAGTA TCAGAGAAAG AAAACGCTAT AGTAATTTCT AACAgTTTCA 

msal83564.2{l47_18RS2l) GGCAGTAGTA TCAGAGAAAG AAAACGCTAT AGTAATTTCT AACAgTTTCA 

msal83564.2{l47_090) GGCAGTAGTA TCAGAGAAAG AAAACGCTAT AGTAATTTCT AACAgTTTCA 

msal83 564 . 2 { 147_CJB110 J GGCAGTAGTA TCAGAGAAAG AAAACGCTAT AGTAATTTCT AACAgTTTCA 

msal83564 .2{147_A909} GGCAGTAGTA TCAGAGAAAG AAAACGCTAT AGTAATTTCT AACAaTTTCA 

msal83564 . 2 { 147__H36B } GGCAGTAGTA TCAGAGAAAG AAAACGCTAT AGTAATTTCT AACAaTTTCA 

msal 83 5 64 . 2 { 147_1169NT} GGCAGTAGTA TCAGAGAAAG AAAACGCTAT AGTAATTTCT AACAgTTTCA 



783 



WO 2004/018646 
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Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 

Consensus ********** ********** ********** ********** ****_***** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
tnsal83564 .2{l47 
msal83S64.2{ 
msal83564 
msal83564.2{ 

msal83564 

msal83564 
msal83564.2{ 



2{l47_COHl} 
2(147_M732) 
2{l47_M78l} 
2{147_2603} 
JM9130013' 
147 18RS21; 
.2{l47_090; 
147J2JB110] 
2{147_A909] 
2{l47 < _H36Bj 
147_1169NT] 
Consensus 



3351 

AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
********** 



TAACTTGAAg 
TAACTTGAAg 
TAACTTGAAg 
TAACTTGAAa 
TAACTTGAAa 
TAACTTGAAa 
TAACTTGAAa 
TAACTTGAAa 
TAACTTGAAa 
TAACTTGAAa 
TAACTTGAAa 
*********_ 



AAAGAAcCTA 
AAAGAAcCTA 
AAAGAAcCTA 
AAAGAAcCTA 
AAAGAAcCTA 
AAAGAAcCTA 
AAAGAAt CTA 
AAAGAAtCTA 
AAAGAAcCTA 
AAAGAAcCTA 
AAAGAAcCTA 
******^*** 



TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
********** 



3400 
TAAAgAAGgA 
TAAAgAAGgA 
TAAAgAAGgA 
TAAAaAAGaA 
TAAAaAAGaA 
TAAAaAAGaA 
TAAAgAAGgA 
TAAAgAAGgA 
TAAAgAAGgA 
TAAAgAAGgA 
TAAAaAAGaA 
****_***_* 



msa!83564 . 

msal83564 . 

msal83564 . 

msal83S64 . 
msal83564.2{l47 
msal83564 .2{" 
msal83564 
msal83564 .2 { 

msal83564 . 

msal83564 . 
msal83564.2{ 



2{l47_COHl} 
2{147_M732} 
2{14 7_M781) 
2{147_2603} 
JM9130013} 
147_18RS2l} 
.2{l47__090 \ 
147_CJB110) 
2{147_A909} 
2{147_H36B} 
147__1169NT} 
Consensus 



3401 

AAAGTAGTAA 
AAAGTAGTAA 
AAAGTAGTAA 
AAAGTAGTAA 
AAAGTAGTAA 
AAAGTAGTAA 
AAAGTAGTAA 
AAAGTAGTAA 
AAAGTAGTAA 
AAAGTAGTAA 
AAAGTAGTAA 
********** 



ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
********** 



AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
********** 



a cATT AGTT A 
acATTAGTTA 
a cATTAGTTA 
atATTAGTTA 
a tATTAGTTA 
atATTAGTTA 
acATTAGTTA 
acATTAGTTA 
g cATTAGTTA 
g cATTAGTTA 
atATTAGTTA 
_ _******** 



3450 
AGCCtCAaAC 
AGCCtCAaAC 
AGCCtCAaAC 
AGCCgCAaAC 
AGCCgCAaAC 
AGCCgCAaAC 
AGCCgCAaAC 
AGCCgCAaAC 
AGCCgCAaAC 
AGCCgCAaAC 
AGCCgCAcAC 
****_★*_.** 



msal83564 .2{l47_COHl} 
msal83564 .2{l47_M732} 
msal83564 .2{147_M781} 
msa!83564 . 2 { 147_2603 } 
msa!83564.2{l47_JM9130013j 
msal83564.2{l47 18RS21) 
msal83564 . 2 { 147__090} 
msal83564 . 2 { 147_CJB110 } 
msal83564.2{l47_A909} 
msal83564 . 2 { 147_H36B} 
msal83564.2{l47__1169NT} 
Consensus 



3451 

TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
********** 



ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 



TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 



********** ********** 



AATAACTaAA 
AATAACTaAA 
AATAACTaAA 
AATAACTaAA 
AATAACTaAA 
AATAACTaAA 
AATAACTaAA 
AATAACTaAA 
AATAACTcAA 
AATAACTcAA 
AATAACTaAA 
*******„* * 



3500 
TCAGGAAATG 
TCAGGAAATG 
TCAGGAAATG 
TCAGGAAATG 
TCAGGAAATG 
TCAGGAAATG 
TCAGGAAATG 
TCAGGAAATG 
TCAGGAAATG 
TCAGGAAATG 
TCAGGAAATG 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msal83564.2{ 
msal83564 
msal83564.2{ 

msa!83564 . 

msal83564 . 
msal83564 .2{ 



2{147J20H1} 
2{147_M732} 
2{147_M781) 
2{147_2603} 
_JM9130013} 
147_18RS2l} 
2{l47_090} 
147_CJB110} 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



3501 

AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
********** 



CACTTCTACA 
CACTTCTACA 
CACTTCTACA 
CACTTCTACA 
CACTTCTACA 
CACTTCTACA 
CACTTCTACA 
CACTTCTACA 
CACTTCTACA 
CACTTCTACA 
CACTTCTACA 
********** 



AACAATAATA 
AACAATAATA 
AACAATAATA 
AACAATAATA 
AACAATAATA 
AACAATAATA 
AACAATAATA 
AACAATAATA 
AACAATAATA 
AACAATAATA 
AACAATAATA 
********** 



GTAGcAGAGT 
GTAGcAGAGT 
GTAGcAGAGT 
GTAGcAGAGT 
GTAGcAGAGT 
GTAGcAGAGT 
GTAGcAGAGT 
GTAGcAGAGT 
GTAGcAGAGT 
GTAGcAGAGT 
GTAGtAGAGT 
****_***** 



3550 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAaATC 
******_*** 



msa!83564 . 

msal83564 . 

msal83564 . 

msal83564 
msal83S64.2{l47 
msal83564.2{" 
msal83564 
msal83564.2{ 

msa!83564 . 

msalB3564 . 
msal83564.2{ 



msa!83564 . 

msal83564 . 

msal83564 . 

msal83564 
msal83564.2{!47 
msal83564.2{ 
msal83564 
msal83564.2{ 

msal83564 

msa!83564 



3551 3600 

2{l47_COHl} ATATCACCTA AACATAAcGG GGATTCTGTT AACCATACC- • 

2{147_M732} ATATCACCTA AACATAAcGG GGATTCTGTT AACCATACC- 

2{147_M781} ATATCACCTA AACATAAcGG GGATTCTGTT AACCATACC 

2{l47_2603j ATATCACCTA AACATAAcGG GGATTCTGTT AACCATACCt tacctagtac 

_JM9130013} ATATCACCTA AACATAAcGG GGATTCTGTT AACCATACC- 

147_JL8RS21} ATATCACCTA AACATAAcGG GGATTCTGTT AACCATACC- . 

.2 {l47_090} ATATCACCTA AACATAAcGG GGATTCTGTT AACCATACC- 

147__CJB110} ATATCACCTA AACATAAcGG GGATTCTGTT AACCATACC- 

2{147_A909} ATATCACCTA AACATAAcGG GGATTCTGTT AACCATACC- 

2{147_H36B} ATATCACCTA AACATAAcGG GGATTCTGTT AACCATACC 

147_1169NT} ATATCACCTA AACATAAtGG GGATTCTGTT AACCATACC - 

Consensus ********** *******_** ********** ********** ********** 

3601 3650 

2{147_C0H1} 

2{147_M732) ■ 

2{l47_M78l} 

2{147_2603} atcagataga gcaacgaatg gtctatttgt tggtactttg gcattgttat 

JM9130013} ~~ .. , 

14 7_28RS21} 

2{147_090} 

147_CJB110} - 

2(147_A909} 

2(147_H36B} 
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Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 

msal83564 .2 {147_1169NT} 

Consensus ********** ********** ********** ********** ********** 

3651 3700 

msal83564 .2{l47_COHl} 

msal83564.2{147_M732} 

msal83564 .2{147_M7B1} 

msal83564 . 2 { 147__2603) ctagtttact tctttatttg aaacccaaaa agactaaaaa taatagtaaa 

msal83564.2{l47_JM9130013} 

msal83564 .2{l47_18RS2l} 

msal83564 .2{l47_090} 

msal83564.2{l47_CJB110} 

msal83564 .2fl47_A909} 

msal83564.2{l47_H36B} 

msal83564 .2 { 147_1169NT} 

Consensus ********** ********** ********** ********** ********** 

SEQ ID NO. 4412 
STRAIN 2603 

VDKHHSKKAI LKLTL ITTS I LLMHSNQVNAEEQELKNQEQSPVI ANVAQQPS PS VTTNTV 
EKTSVTAASASNTAKEMGDTSVKNDKTEDELLEELSKNLDTSNLGADLEEEYPSKPETTN 
NKESNVVTNASTAIAQKVPSAYEEVKPESKSSLAVLDTSKITKLQAITQRGKGNWAI I D 
TGFD I NHD I FRLDS PKDDKHS FKTKTE FEELKAKHN I TYGKWVNDKI VFAHNYANNTETV 
ADIAAAMKDGYGSEAKNISHGTHVAGI FVGNSKRPAI NGLLLEGAAPNAQVLLMRI PDKI 
DSD KFGEAY AKA I TD AVNLGAKT I NM S I G KTADS L I ALNDKVKLALKLASEKGVAVWAA 
GNEGAFGMDYSKPLSTNPDYGTVNS PAI SEDTLS VAS YESLKTI SEWETTI EGKLVKLP 
IVTSKPPDKGKAYDVVYANYGAKKDFEGKDFKGKIALIERGGGLDFMTKITHATNAGVVG 
IVIF^QEK^GNFLIPYRELPVGI ISKVDGERIKNTSSQLTFNQSFEVVDSQGGNRMLEQ 
SSWGVTAEGAIKPDVTASGFEIYSSTYNNQY<^SGTSMASPHVAGL^rIWI^SHIiAEKYK 
GMNLDSKKLLELSKN I LMS S ATAL YSEEDKAF YS PRQQGAGWDAEKAI QAQYY I TGNDG 
KAKI NLKRMGDKFD I TVT I HKLVEG VKELY YQANVATEQVNKGKFALKPCALLDTNWQKV 
ILFJ5KETQVRFTIDASQFSQKLKEQMANGYFLEGFVRFKEAKDSNQELMSIPFVGFNGDF 
ANLQALETPIYKTLSKGSFYYKPNDTTHKDQLEYNESAPFESNNYTALLTQSASWGYVDY 
VKNGGELELAPESPKRI I LGTFENKVEDKT I HLLERDAANNPYFAI S PNKDGNRDEITPQ 
AT FLRNVKD I SAQVLDQNGNVI WQSKVLPS YRKNFHNNPKQSDGHYRMDALQWSGLDKDG 
KVVADGFYTYRLRYTPVAEGANSQESDFKVQVSTKSPNLPSRAQFDETNRTLSLAMPKES 
SYVPTYRLQLVLSHVVKDEEYGDETSYHYFHIDQEGKVTLPKTVKIGESEVAVDPKALTL 
VVEDKAGNFATVKLSDLLNKAWSEKENAI VI SNSFKYFDNLKKEPMFI SKKEKWNKNL 
EEI ILVKPQTTVTTQSLSKEITKSGNEKVLTSTNNNSSRVAKI ISPKHNGDSVNHTLPST 
SDRATNGLFVGTLALLSSLLLYLKPKKTKNNSK 



SEQ ID NO. 4413 
STRAIN A909 

EEQELKNQEQSPVIANVAQQPSPSVTTOTVEKTSVTSASASNTAKEMGDTSVKm)KTEDE 
LLEELSKNLDTSNLGADDEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVBCPESK 
SSLAVLDTSKITKLQAITQRGKGNWAI IDTGFDINHDI FRLDSPKDDKHSFKTKAEFEE 
LKAKHNITYGKWVNDKI VFAHNYANNTETVAD I AAAM KDG YGS EARN I S HGTHVAG I FVG 
NSKRPA I NGLLLEGAAPNAQVLLMRI PDKI DSDKFGEAYAKA ITDAVNLGAKT I NMSLGK 
TADS L I AIjNDKVKLAL KLAS E KG VAVWAAGNEG AFGMDY S KPL S TN PD YGTVN S PA I S E 
DTLSVASYESLKTI SEWETTI EGKLVKLP I VTSKPFDKGKAYDWYANYGAKKRL .R.G 
L.R. DCIN . AWWWT . FYD . NHSCYKCRCCWYRYF . RSRKTWKFSNSLP . ITCGGY . . SRW 
RAYKKYFKSVNI . PEF . SS . . PRWQS YAGT I KLGRDS . RSNQA . CNSFWL .NLFFNL . . S 
IPNNVWYKYGFTTCCRINDNASKSFG . EI . RDEFRF . KIARIV. KHPHELSNSI I . . RG . 
GVLFTTSARCRCS . C . KSYPSSILCYWKRWQS .N . SQTSGR. I . YHSYNS .TCRRCQRIV 
LSS . CSNRTSK. R . I CP . TTSLARY . LAESNSS . . RNTSS I YY . F . SI . S E I KRTDGKWL 
FLRRFCTF . RSQG . . SGVNEYSFCRI . W . FCELTST . NTDL . DAF . R . FLL . TK . YNS . R 
PIGVQ . ISSF . KQQLYCLVNTI SVLGLC . LCQKWWGVRISTGESKKNYFRNF .E.G. G.N 
NSSFGKRCSE . SI FCHFSK . RWK . G . NHSPGNFLKKC . GYFCSSSRSKWKCYLAK . GFTI 
LS . KFP . . SKAK . WSLSYGCPSVEWFR . GWQSCSRWFLYLSFTLHTSSRRSK . SGVRL . S 
SSKY . VTKSSFTSSV . . N . SNI KLSHA . GK . LCSYI SSTI SFISCCKR . RI WR . DFLPLF 
PYRSRR . SDTS . NS . DRRE . GCSRP . DLDTCCGR . SW . FRNGKIV. PLE . GSSIRERKRY 
SNF . QFQIF - . LEKRTYVYF . RRKSSKQESRRNSIS . AANYSYYSI IV . RNNSIRK. ESP 
HFYKQ . . . QSS . DHIT . T . RGFC . PY 

SEQ ID NO. 4414 
STRAIN H36B 

EEQELKNQEQS P VI ANVAQQPS PS VTTNTVEKTSVTS AS 

LLEELSKNLDTSNLGADLEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVKPESK 
SSLAVLDTSKITKLQAITQRGKGNWAI IDTGFDINHDI FRLDSPKDDKHSFKTKAEFEE 
LKAKHNI TYGKWVNDKI VFAHNYANNTET VAD I AAAMKDGYGSEAKNI S HGTHVAG I FVG 
NS KRPAINGLLLEGAAPNAQ VLLMR I PDKI DSDKFGEAYAKAI TDAVNLGAKT I NMS LGK 
TADSLIALNDKVKLALKLASEKGVAVWAAGNEGAFGMDYSKPLSTNPDYGTVNSPAISE 
DTLSVASYESLKTI SEWETTI EGKLVKLP I VTSKPFDKGKAYDWYANYGAKKDFEGKD 
FKGKI AL I ERGGGLDFMTKI THATNAG WG IV I FNDQEKRGNFL I PYRELPVGVI SKVDG 
ERI KNTS SQLTFNQS FEWDSQGGNRMLEQS S WGVTAEGA I KPDVTASGFE I YSSTYNNQ 
YQTMSGTSMASPHVAGLMTMLQSHLAEKYKGMNLDSKKLLELSKNILMSSATALYSEEDK 
AFYSPRQQGAGWDAEKAI QAQYYVTGNDGKAKI NLKRVGDKFD I TVT I HKLVEG VKELY 
YQANVATEQVNKGKFALKPQALLDTNWQKVILRDKETQVRFTIDSSQFSQKLKEQMANGY 
FLEGFVRFKEAKDSNQELMS I PFVG FNGD FANLQALETP I YKTLSKG S FYYKPNDTTHKD 
QLEYNESAPFESNNYTALLTQSASWGYVDYVKNGGELELAPESPKRI I LGTFENKVEDKT 
I HLLE RDAANNP YFA I S PNKDGNRDE I TPQAT FLRNVKD I S AQVLDQNGNV IWQS KVLPS 
YRKNFHNNPKQSDGHYRMDALQWSGLDKDGKWADGFYTYRLRYTPVAEGANSQESDFKV 
QVSTKSPNLPSRAQFDETNRTLSLAMPKESSYVPTYRLQLVLSHWKDEEYGDETSYHYF 
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Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



HIDQEGKVTLPKTVKIGESEVAVDPKTLTLVVEDKAGNFATVKLSDIiLNKAVVSEKENAI 

VISNNFKYFDNLKKEPMFISKEGKVWKNLEEIALVKPQ 

TSTNNNSSRVAKI I SPKHNGDSVNHT 

SEQ XD NO. 4415 
STRAIN 18RS21 

EEQELKNQEQSPVIANVAQQPSPSVTTNTVEKTSVTAASASOTAKBMGDTSVKNDKTEDE 
LLEELSKNLDTSNLGADLEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVKPESK 
SSLAVLDTSKI TKLQAI TQRGKGNWAI I DTGFD I NHD I FRLDSPKDDKHSFKTKTEFEE 
LKAKHNITYGKWVNDKI VFAHNYANNTETVADIAAAMKDGYGSEAKN 
NSKRPAINGLLLEGAAPNAQVLLMRIPDKIDSDKFGEAYAKAITDAVNLGAKTINMSIGK 
TADSLI AI^KVKLALKLASEKGVAVVVAAGNEGAFGMDYSKPLSTNPDYGTVNS PAI SE 
DTLSVASYESLKTI SEWETTI EGKLVKLPIVTSKPFDKGKAYDWYANYGAKKDFEGKD 
FKGKI ALIERGGGLDFMTKI THATNAGWG I VI FNDQE KRGNFL I PYRELPVGI I SKVDG 
ERI KNTSSQLTFNQSFEWDSQGGNRMLEQSSWGVTAEGAI KPDVTASGFE I YSSTYNNQ 
YQTMSGTS^SPHVAGLMTMLQSHLAEKYKGMNLDSKKLLELSKNILMSSATALYSEEDK 
AFYS PRQQGAGWDAEKAI QAQYYI TGNDGKAKI NLKRMGDKFD I TVT I HKLVEG VKELY 
YQANVATEQVNKGKFALKPQALLDTNWQKV I LRD KETQVRFT I DASQFSQKLKEQMANGY 
FLEGFVRFKEAKDSNQELMS I PFVGFNGD FANLQALETP I YKTI SKGS FYYKPNDTTHKD 
QLEYNESAPFESNNYTALLTQSASWGYVDYVKNGGELELAPESPKRIILGTFENKVEDKT 
I HLLERDAANNP YFA I S PNKDGNRDE ITPQAT FLrRNVKD I SAQVLDQNGNVI WQSKVLPS 
YRKNFHNNPKQSDGHYRMDALQWSGLDKDGKWADGFYTYRLRYTPVAEGANSQESDFKV 
QVSTKSPNLPSRAQFDETNRTLSLAMPKESSYVPTYRLQLVLSHWKDEEYGDETSYHYF 
HIDQEGKVTLPKTVKIGESEVAVDPKALTLWEDKAGNFATVKLSDLLNKAVVSEKENAI 
VI SNSFKYFDNLKKEPMFI SKKEKWNKNLEE 1 1 L VKP QTTVTTQS L S KE I TKS GNE KVL 
TSTNNNSSRVAKI I SPKHNGDSVNHT 

SEQ XD NO. 4416 
STRAIN M7 32 

EEQELKNQEQSPVIANVAQQPSPSVTTNIVEKTSVTAASASNTVKEMGDTSVKNDKTEDE 
LLEELSKNLDTSNLGADLEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVKSESK 
S SLAVLDTS KITKLQATTQRGKGNWAI IDTGFDINHDI FRLDSPKDDKHSFKTKAEFEE 
LKAKHNI TYGKWVNDKI VFAHNYANNTET VAD I AAAMKDGYGSEAKN I LHGTHVAGI FVG 
NSKRPAINSLLLEGAAPNAQVLLMRI PDKI DS DKFGEAYAKA 1 1 DAVNLGAKT I NMSLGK 
TADSLIALNDKVKLALKIASEKGVAWVAAGNEGAFGMDYSKPLSTNPDYGTVNSPAISE 
DTLSVASYESLKTI SEWETTI EGKLVKLP I VTSKPFDKGKAYDVVYANYGAKKI LKVRT 
LKVRLH . LSVWDLI L . LKSLMLQMQVLLVSLFLTI KKNVEI F . FLTVNYLWGLLVK . MA 
SV . KILQVS . HLTRVLK . L I AKVAI VC WNNQVGA . QLKEQSSLM . QLLALKF I LQP 1 1 IN 
TKQCLVQVWLHHMLQD . . QCFKVIWLRNIKG . I . ILKNC . NCLKTSS -AQQQHYI VKRIR 
RFIHHVSKVQV. LMLKKLSKLNIMLLETMAKLKLISNEREINLISQLQFINL . KVSKNCI 
I KLM . QQNK . IKVNLPLNHKPC . ILIGRK . FFVI KKHKFDLLLMLVNLVRN . KNRWQMVI 
S . KVLYVLKKPRIVIRS . . VFLL . DLMVI LRTYKHLKHRF I RRFLKWST I NQM I QL I KT 
NWSTMNQLLLKATT ILPC . HNQRLGAMLIMSKMVGS . N . HRRVQKELF . ELLRI RLRI KQ 
FIFWKEMQRIIHILPFLQIKMEIGTKSLPRQLS .EMLRIFLLKF . I KMEMLFGKVRFYHL 
I VKI S 1 1 1 QSKVMVI I VWMLFSG W . I RMAKL . QMVF I L I AYVTHQ . QKEQI VRSQTLKF 
K. VLSHQIFLHELSLMKLIEH . A. PCLRKWMFLHIVYN . FYLML . KMKNMGMRLLTI IS 
I . I KKVK . HFLKRLR . ERVRLR - TLRP . HLLWKI KLVILQR . NCLTS . IRQ. YQRKKTL . 
. FLTVSNILIT . RKNLCLFLKKEK . . TRI . KK . H . LSLKLQLLLNHCLKK . LNQEMRKSS 
LLQTI I VAE .LRSYHLNITGILLTI 

SEQ XD NO. 4417 
STRAIN COHl 

EEQELKNQEQSPVIANVAQQPSPSVTTNIVEKTSVTAASASNTVKEMGD^ 
LLEELSKNLDTSNLGADLEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVKSESK 
SSLAVLDTSKITKLQATTQRGKGNWAI IDTGFDINHDI FRLDSPKDDKHSFKTKAEFEE 
LKAKHNI TYGKWVNDKI VFAHNYANNTET VAD I AAAMKDGYGSEAKN I LHGTHVAG I FVG 
NSKRPAINSLLLEGAAPNAQVLLMRI PDKI DS DKFGEAYAKA 1 1 DAVNLGAKT I NMSLGK 
TADSL I ALNDKVKLALKLAS EKGVAWVAAGNEGAFGMD YSKPLSTNPD YGTVNS PA I SE 
DTLS VAS YESLKTI SEWETTI EGKLVKLP I VTSKPFDKGKAYDVVYANYGAKKI LKVRT 
LKVRLH . LSVWDLIL . LKSLMLQMQVLLVSL FLT I KKNVE I F . FLTVNYLWGLLVK . MA 
SV . KILQVS . HLTRVLK. LI AKVAI VCWNNQVGA . QLKEQSSLM . QLLALKF I LQP 1 1 IN 
TKQCLVQVWLHHMLQD . . QCFKVIWLRNIKG , I . ILKNC .NCLKTSS .AQQQHYI VKRIR 
RF I HHVS KVQV . LMLKKLS KLN I MLLETMAKLKL I SNERE INL I S QLQF INL . KVS KNC I 
I KLM . QQNK . IKVNLPLNHKPC . ILIGRK. FFVI KKHKFDLLLMLVNLVRN . KNRWQMVI 
S . KVLYVLKKPRIVIRS . . VFLL . DLMVI LRTYKHLKHRF I RRFLKWST I NQM I QL I KT 
NWSTMNQLLLKATT ILPC . HNQRLGAMLIMSKMVGS . N . HRRVQKELF . ELLRI RLRI KQ 
FIFWKEMQRIIHILPFLQIKMEIGTKSLPRQLS .EMLRIFLLKF. I KMEMLFGKVRFYHL 
IVKISII IQSKVMVI IVWMLFSGW . I RMAKL . QMVF I L I AYVTHQ . QKEQI VRSQTLKF 
K. VLSHQIFLHELSLMKLIEH .A . PCLRKWMFLH I VYN . FYLML . KMKNMGMRLLTI IS 
I . I KKVK . HFLKRLR . ERVRLR . TLRP . HLLWKI KLVILQR . NCLTS . IRQ. YQRKKTL . 
. FLTVSNILIT . RKNLCLFLKKEK . . TRI . KK . H . LSLKLQLLLNHCLKK . LNQEMRKSS 
LLQTI I VAE . LRSYHLNITGILLTI 

SEQ XD NO. 4418 
STRAIN M7 81 

EEQELKNQEQSPVIANVAQQPSPSVTTNIVEKTSVTAASASNTVKEMGDTSVKNDKTEDE 
LLEELSKNLDTSNLGADLEEEYPSKPETTNNKESNWTNASTAIAQKVPSAYEEVKSESK 
SSLAVLDTSKITKLQATTQRGKGNWAI IDTGFDINHDI FRLDSPKDDKHSFKTKAEFEE 
LKAKHNI TYGKWVNDKI VFAHNYANNTETVAD I AAAMKDGYGSEAKN I LHGTHVAG I FVG 
NSKRPAINSLLLEGAAPNAQVLLMRI PDKIDSDKFGEAYAKAI I DAVNLGAKT I NMSLGK 
TADSL I AIjNDKVKLALKLASEKGVAVWAAGNEGAFGMD YSKPLSTNPD YGTVNS PA I SE 
DTLSVASYESLKTI SEWETTI EGKLVKLP I VTSKPFDKGKAYDVVYANYGAKKI LKVRT 
LKVRLH . LS VWDL I L . LKSLMLQMQVLLVSL FLT I KKNVEI F . FLTVNYLWGLLVK . MA 
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Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



SV . KILQVS . HLTRVLK. LIAKVAI VCWNNQVGA . QLKEQSSLM . QLLALKFILQPI I IN 
TKQC L VQ VWLHHML QD . . QCFKVIWLRNIKG. I . ILKNC .NCLKTSS . AQQQHYI VKRIR 
RFIHHVSKVQV.I^LKKLSKI^IMLLETMAKLKIjISNEREINIiISQLQFINL . KVSKNCI 
IKLM . QQNK . IKVNLPLNHKPC . ILIGRK. FFVI KKHKFDLLLMLVNLVRN . KNRWQMVI 
S . KVL YVLKKPR IVIRS . . VFLL . DLMVI LRTYKHL.KHRF I RRFLKWST I NQM IQLIKT 
NWSTMNQIiLLKATTILPC.HNQRLGAMLIMSKMVGS . N . HRRVQKEL F . ELLRIRLRIKQ 
FI FWKEMQRI IHILPFLQIKMEIGTKSLPRQLS . EMLRI FLLKF . I KMEMLFGKVRFYHL 
IVKISI I IQSKVMVI IVWMLFSGW . IRMAKL .QMVFILIAYVTHQ . QKEQI VRSQTLKF 
K. VLSHQIFLHELSLMKLIEH. A. PCLRKWMFLHIVYN . FYLML . KMKNMGMRLLTI IS 
I . I KKVK . HFLKRLR . ERVRLR . TLRP . HLLWKI KLVILQR . NCLTS . I RQ . YQRKKTL . 
. FLTVSNILIT . RKNLCLFLKKEK . TRI . KK . H . LSLKLQLLLNHCLKK. LNQEMRKSS 
LLQTIIVAE . LRSYHLNITGILLTI 

SEQ ID KO. 4419 
STRAIN JM9130013 

EEQELKNQEQSPVIANVAC^PSPSVTTNTVEKTSVTAASASNTAKEMGDTSVKNDKTEDE 
LLEELSKNLDTSNLGADLEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVKPESK 
SSLAVLDTS KITKLQAI TQRGKGNWAI I DTGFD I NHDI FRLDSPKDDKHSFKTKTEFEE 
LKAKHNITYGKWVNDKI VFAHNYANNTETVADI AAAMKDGYGSEAKNI SHGTHVAG I FVG 
NSKRPAI NGLLLEGAAPNAQVLLMRI PDKI DSDKFGEAYAKA I TDAVNLGAKT I NMS IGK 
TADSLIAI^KVKl^KI^SEKGVAVWAAGNEGAFGMDYSKPLSTNPDYGTVNSPAISE 
DTLSVASYESLKTI SEWETTI EGKLVKLP I VTSKPFDKGKAYDWYANYGAKKDFEGKD 
FKGKI AL IERGGGLDFMTK I THATNAG WG I VI FNDQEKRGNFIj I PYREIiPVGI ISKVDG 
ERI KNTS SQLTFNQSFE WDSQGGNRMLEQSSWGVTAEGAI KPDVTASGFE I YSSTYNNQ 
YQTMSGTSMASPHVAGLMTMLQSHLAEKYKGMNLDSKKLLELSKNILMSSATALYSEEDK 
AFYSPRQQGAGWDAEKAI QAQYY I TGNDGKAKI NLKRMGDKFD I TVT I HKL VEGVKEL Y 
YQANVATEQVNKGKFALKPQALLDTNWQKVILRDK^QVRFTIDASQFSQKLKEQMANGY 
FLEGFVRFKEAKDSNQELMS I PFVGFNGDFANLQALETPI YKTLSKGSFYYKPNDTTHKD 
QLEYNESAPFESNNYTALLTQSASWGYVDYVKNGGELELAPESPKRIII/5TFENKVEDKT 
I HLLERDAANNP YFAI S PNKDGNRDE I TPQATFLRNVKD I SAQVLDQNGNVI WQSKVLPS 
YRKNFmfNPKQSDGHYRMDALQWSGIJDKDGKWADGFYTYRLRYTPVAEGANSQESDFKV 
QVSTKSPNLPSRAQFDETNRTLSLAMPKESSYVPTYRLQLVLSHVVKDEEYGDETSYHYF 
HIDQEGKVTLPKTVKIGESEVAVDPKALTLVVEDKAGNFATVK^ 

VI SNSFKYFDNLKKEPMFI SKKEKWNKNLEE 1 1 LVKPQTTVTTQSLSKEITKSGNEKVL 
TSTNNNSSRVAKI I S PKHNGDS VNHT 

SEQ ID NO. 4420 
STRAIN 090 

EEQELKNQEQSPVIANVAC^PSPSVTrNIVEKTSVTAASASNTVKEMGDTSVKNDKTEDE 
LLEELSKNLDTSNLGADLEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVKPESK 
SSLAVFDTSKITKLQAITQRGKGNVVAIIDTGFDINHDIFRIiDSPKDDKHSFKTKAEFEE 
LKAKHNITYGKWVNDKI VFAHNYANNTETVADIAAAMKDGYGSEAKNI SHGTHVAGI FVG 
NSKRPAINGLLLEGAAPNAQVLLMRIPDKIDSDKFGEAYAKAITDAVNLGAKTINMSLGK 
TADSLIALNDKVKIJUIjKIASEKGVAVWAAGNEGAFGMDYSKPLSTNPDYGTVNSPAISE 
DTLSVASYESLKTI SEWETTI EGKLVKLP I VTSKPFDKGKAYDWYANYGAKKDFEGKD 
FKGKI AL I ERGGGLDFMTK I THATNAGWG I VI FNDQEKRGNFLI PYRELPVGVI SKVDG 
ERIKNTSSQLTFNQSFEWDSOGGNRMLEQSSWGVTAEGAIKPDVTASGFEIYSSTYNNQ 
YQTMSGTSMASPHVAGLMTMLQSHLAEKYKGMNLDSKKLLELSKNILMSSATALYSEEDK 
AFYSPRQI^AGVVDAEKAIQAQYYVTGNDGKAKINLKRVGDKFDITVTIHKLVEGVKELY 
YQANVATEQVNKGKFALKPQALItDTNWQKVI LRDKETQVRFT I DASQFSQKLKEQMANG Y 
FLEGFVRFKEAKDSNQELMS I PFVGFNGDFANLQALETPI YKTLSKGSFYYKPNDTTHKD 
QLEYNESAPFESNNYTAI^TQSASWGYVDYVKNGGELELAPESPKRIILGTFENKVEDKT 
I HLLERDAANNPYFAI S PNKDGNRDE I TPQATFLRNVKD I SAQVLDQNGNVI WQSKVLPS 
YRKNFHNNPKQSDGHYRMDAFQWSGLDKDGKWADGFYTYRLRYTPVAEGANSQESDFKV 
QVSTKSPNLPLLAQFDETNRTLSIJ^PKESSYVPTYRLQLVLSHWKDEEYGDETSYHYF 
HIDQEGKVTLPKTVKIGESEVAVDPKALTLWEDKAGNFATVKLSDLLNKAWSEKENAI 
VI SNS FKYFDNLKKESMF I SKEGKWNKNLEE ITLVKPQTTVTTQSLSKEITKSGNEKVL 
TSTNNNSSRVAKI I SPKHNGDS VNHT 

SEQ ID NO. 4421 
STRAIN CJB110 

EEQELKNQEQSPVIANVAQQPSPSVTTNIVEKTSVTAASASNTAKEMGDTSVKNDKTEDE 
LLEELSKNLDTSNLGADLEEEYPSKPETTNNKESNWTNASTAIAQKVPSAYEEVKPESK 
SSLAVFDTSKITKLQAI TQRGKGNWAI I DTGFD I NHDI FRLDSPKDDKHSFKTKAEFEE 
LKAKHNITYGKWVNDKI VFAHNYANNTETVAD I AAAMKDGYGSEAKNI SHGTHVAGI FVG 
NS KRPAI NGLLLEGAAPNAQVLLMRI PDKI D SDKFGEAYAKAI TDAVNLGAKT I NMSLGK 
TADSLIAI^DKVKLALKLASEKGVAVWAAGNEGAFGMDYSKPLSTNPDYGTVNSPAISE 
DTLSVASYESLKTI SEWETTI EGKLVKLP I VTSKPFDKGKAYDWYANYGAKKDFEGKD 
FKGKI AL I ERGGGLDFMTKI THATNAGWG I VI FNDQEKRGNFLI PYRELPVGVI SKVDG 
ERI KNTSSQLTFNQS FEWDSQGGNRMLEQSSWGVTAEGAI KPDVTASGFE I YSSTYNNQ 
YQ^SGTSMASPHVAGLMTMLQNHLAEKYKGMNLDSKKLLELSKNILMSSATALYSEEDK 
AFYSPRQQGAGWDAEKAI QAQYYVTGNDGKAKINLKRVGDKFDITVT I HKL VEGVKEL Y 
YQANVATEQVNKGKFALKPQALLDTNWQKVI LRDKETQVRFT I DASQFSQKLKEQMANG Y 
FLEGFVRFKEAKDSNQELMS I PFVGFNGDFANLQALETP I YKTLSKGSFYYKPNDTTHKD 
QLEYNESAPFESNNYTALLTQSASWGYVDYVKNGGELELAPESPKRIILGTFENKVEDKT 
I HLLERDAANNPYFAI S PNKDGNRDE I TPQATFLRNVKD I SAQVLDQNGNVI WQSKVLPS 
YRKNFHNNPKQSDGHYRMDAFQWSGLDKDGKWADGFYTYRLRYTPVAEGANSQESDFKV 
QVSTKSPNLPLLAQFDETNRTLSLAMPKESSWPTYR^QLVLSHWKDEEYGDETSYHYF 
HIDQEGKVTLPKTVKIGESEVAVDPKALTLVVEDKAGNFATVKLSDLLNKAWSEKENAI 
VI SNSFKYFDNLKKESMFI SKEGKWNKNLEEI TLVKPQTTVTTQSLSKE I TKSGNEKVL 
TSTNNNSSRVAKI I SPKHNGDS VNHT 

SEQ ID NO. 4422 
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Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



STRAIN 1169NT 

EEQELKNQEQSPVIANVAQQPSPSVTTNIVEKTSVTAASASl^AKEMGDTSVKNDKTEDE 
LLEELSmLDTSNMGADLEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVKPKSK 
SSLAVLDTSKITKLQAITQRGKGNWAI IDTGFDINHDI PRLDSPKDDKHSFKNKAEFEE 
LKAKHN I TYGKWVNDKI VFAHNYANNTET VAD I AAAMKDGYG SEAKN I SHGTHVAG I FVG 
NS KRPAI NGLLLEGAAPNAQ VLLMRI PDKI DSDKFGEAYAKAITDAVNLGAKTINMS IGK 
TADSL I ALNDKVKUUiKLASEKGVAVVVAAGNEGAFGMDYSKPLSTNPDYGTVNS paise 
DTLSVASYESLKTI SEWETTI EGKLVKLPI VTSKPFDKGKAYDWYANYGAKKDFEGKD 
FKGKI AL I ERGGGLD FMTKI THATNAG WG I VI FNDQEKRGNFL I PYRELPVGVI SKVDG 
ERIKNTSSQLTFNQRFEWDSQGGNRMLEQSSWGVTAEGAI KPDVTASGFE I YSSTYNNQ 
YQTMSGTSMASPHVAGLMTMLQSHLAEKYKGMNLDSKKLLEIiSKNI LMS SATALYSEEDK 
AFYSPRQQGAGWDAEKAI QAQYYVTGNDGKAKINLKRVGDKFD I TVT I HKL VEGVKEL Y 
YQANVATEQVNKGKFALKPQALLDTNWQKVILRDKETQVRFTIDASQFSQKLKEQMANGY 
FLEGFVRFKEAKDSNQELMS I P FVGFNGD FAS LQALETP I YKTLS KGS FYYKPNDTTHKD 
QLEYNE SAPFESNNYTALLTQSAS WGYVD YVKNGGELELAPE S PKRI I LGTFENKVEDKT 
IHLLERDAANNPYFAI S PNKDGNRDE ITPQATFLRNVKD I SAQVLDQNGNVI WQSKVLPS 
YRKNFHNNPKQSDGHYRMDALQWSGIjDKDGKWADGFYTYRLRYTPVAEGANSQESDFKV 
QVSTKSPNLPSRAQFDETNRTLSLAMPKGSSYVPIYRLQLVLSHVVKDEEYGDETSYYYF 
H I DQEGKATLPKTVKI GESE VAVDPKALTLVVEDKAGNFATVKLS DLLNKAVVS EKENA I 
VI SNSFKYFDNliKKEPMFI SKKEKWNKNLEEI ILVKPHTTVTTQSLSKEITKSGNEKVL 
TSTNNNSSRVAKI I S PKHNGD S VNHT 
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msa209368 .2{ 

msa209368 
msa209368 .2(147 
msa209368 
rasa209368 
msa2 09368 

msa209368 . 

msa209368 



1 50 

2{l47_C0Hl) EEQELKNQEQ SPVIANVAQQ 

2{147_M732) EEQELKNQEQ SPVIANVAQQ 

2{147_M781} EEQELKNQEQ SPVIANVAQQ 

147_18RS2l} EEQELKNQEQ SPVIANVAQQ 

2{l47_2603} vdkhhskkai lkltlittsi llmhsnqvna EEQELKNQEQ SPVIANVAQQ 

JM9130013 } EEQELKNQEQ SPVIANVAQQ 

.2 {l47_090} EEQELKNQEQ SPVIANVAQQ 

2{147_CJB110} EEQELKNQEQ SPVIANVAQQ 

2{147_1169NT} EEQELKNQEQ SPVIANVAQQ 

2{147_H36B} EEQELKNQEQ SPVIANVAQQ 

2{147_A909} EEQELKNQEQ SPVIANVAQQ 

Consensus ********** ********** ********** ********** ********** 
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msa2 09368 
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msa209368 .2{ 

msa209368 

msa2 09368 . 



2{147_C0H1} 
2{147_M732 \ 
2{147_M781} 
147_18RS2l} 
2{147_2603} 
_JM9130013> 
72{147_090} 
147_CJB110} 
147_1169NT} 
2{l47_H36Bj 
2{147_A909) 
Consensus 
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PSPSVTTNiV 
PSPSVTTNiV 
PSPSVTTNiV 
PSPSVTTNtV 
PSPSVTTNtV 
PSPSVTTNtV 
PSPSVTTNiV 
PSPSVTTNiV 
PSPSVTTNiV 
PSPSVTTNtV 
PSPSVTTNtV 
********_* 



EKTSVTaASA 
EKTSVTaASA 
EKTSVTaASA 
EKTSVTaASA 
EKTSVTaASA 
EKTSVTaASA 
EKTSVTaASA 
EKTSVTaASA 
EKTSVTaASA 
EKTSVTsASA 
EKTSVTsASA 
******_*** 



SNTvKEMGDT 
SNTvKEMGDT 
SNTvKEMGDT 
SNTaKEMGDT 
SNTaKEMGDT 
SNTaKEMGDT 
SNTvKEMGDT 
SNTaKEMGDT 
SNTaKEMGDT 
SNTaKEMGDT 
SNTaKEMGDT 
***_****** 



SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
********** 
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LLEELSKNLD 
LLEELSKNLD 
LLEELSKNLD 
LLEELSKNLD 
LLEELSKNLD 
LLEELSKNLD 
LLEELSKNLD 
LLEELSKNLD 
LLEELSKNLD 
LLEELSKNLD 
LLEELSKNLD 
********** 
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msa2 09368 . 

msa209368 . 
msa209368 .2{ 

msa209368 . 
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F 



2{l47_COHl) 
2{147_M732} 
2{l47_M78l} 
147_18RS21) 
2{147_2603> 
_JM9130013} 
.2{147_090) 
147_CJB110 j 
147_1169NT) 
2{147_H36B} 
2{147_A909} 
Consensus 
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TSN1GADLEE 
TSN1GADLEE 
TSN1GADLEE 
TSN1 GADLEE 
TSN1GADLEE 
TSNl GADLEE 
TSN1GADLEE 
TSNl GADLEE 
TSNmGADLEE 
TSN1GADLEE 
TSN1GADLEE 
********** 



EYPSKPETTN 
EYPSKPETTN 
EYPSKPETTN 
EYPSKPETTN 
EYPSKPETTN 
EYPSKPETTN 
EYPSKPETTN 
EYPSKPETTN 
EYPSKPETTN 
EYPSKPETTN 
EYPSKPETTN 



NKESNWTNA 
NKESNWTNA 
NKESNWTNA 
NKESNWTNA 
NKESNWTNA 
NKESNWTNA 
NKESNWTNA 
NKESNWTNA 
NKESNWTNA 
NKESNWTNA 
NKESNWTNA 



STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
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AYEEVKS eSK 
AYEEVKseSK 
AYEEVKs eSK 
AYEEVKpeSK 
AYEEVKpeSK 
AYEEVKpeSK 
AYEEVKpeSK 
AYEEVKpeSK 
AYEEVKpkSK 
AYEEVKpeSK 
AYEEVKpeSK 
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2{l47_COHl 
2{147_M732 
2{l47_M781 
147_18RS21 
2{147_2603 
7_JM9130013' 
,2{147_090; 
147_CJB110; 
147_1169NT] 
2{147_H36B # 
2{147_A909} 
Consensus 
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SSLAV1DTSK 
SSLAV1DTSK 
SSLAV1DTSK 
SSLAV1DTSK 
SSLAV1DTSK 
SSLAV1DTSK 
SSLAVf DTSK 
SSLAVf DTSK 
SSLAV1DTSK 
SSLAV1DTSK 
SSLAV1DTSK 
*****_**** 



ITKLQAtTQR 
ITKLQAtTQR 
ITKLQAtTQR 
ITKLQAiTQR 
ITKLQAiTQR 
ITKLQAiTQR 
ITKLQAiTQR 
ITKLQAiTQR 
ITKLQAiTQR 
ITKLQAiTQR 
ITKLQAiTQR 
******„*** 



GKGNWAI I D 
GKGNWAIID 
GKGNWAI ID 
GKGNWAIID 
GKGNWAIID 
GKGNWAIID 
GKGNWAIID 
GKGNWAIID 
GKGNWAIID 
GKGNWAIID 
GKGNWAIID 
********** 



TGFDINHDIF 
TGFDINHDIF 
TGFDINHDIF 
TGFDINHDIF 
TGFDINHDIF 
TGFDINHDIF 
TGFDINHDIF 
TGFDINHDIF 
TGFDINHDIF 
TGFDINHDIF 
TGFDINHDIF 
********** 



200 

RLDSPKDDKH 
RLDSPKDDKH 
RLDSPKDDKH 
RLDSPKDDKH 
RLDSPKDDKH 
RLDSPKDDKH 
RLDSPKDDKH 
RLDSPKDDKH 
RLDSPKDDKH 
RLDSPKDDKH 
RLDSPKDDKH 
********** 



msa2 09368 .2{l47_COHl} 
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Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msa209368 . 

msa2 09368 . 
msa209368 ,2{ 

msa209368 . 
msa209368 .2(147 
msa209368 
msa209368 .2 { 
msa209368 .2{ 

msa209368 . 

msa209368 . 



2{147_M732} 
2{147_M781} 
147_18RS2l} 
2{147_2603} 
JM9130013} 
7*2{l47_090} 
147_CJB110} 
147_1169NT} 
2f 147_H36B} 
2{147_A909} 
Consensus 



SFKtKaEFEE 
SFKtKaEFEE 
SFKtKtEFEE 
SFKtKtEFEE 
SFKtKtEFEE 
SFKtKaEFEE 
SFKtKaEFEE 
SFKnKaEFEE 
SFKtKaEFEE 
SFKtKaEFEE 



LKAKHNITYG 
LKAKHNITYG 
LKAKHNITYG 
LKAKHNITYG 
LKAKHNITYG 
LKAKHNITYG 
LKAKHNITYG 
LKAKHNITYG 
LKAKHNITYG 
LKAKHNITYG 



***„*_**** ********** 



KWVNDKIVFA 
KWVNDKIVFA 
KWVNDKIVFA 
KWVNDKIVFA 
KWVNDKIVFA 
KWVNDKIVFA 
KWVNDKIVFA 
KWVNDKIVFA 
KWVNDKIVFA 
KWVNDKIVFA 
********** 



HNYANNTETV 
HNYANNTETV 
HNYANNTETV 
HNYANNTETV 
HNYANNTETV 
HNYANNTETV 
HNYANNTETV 
HNYANNTETV 
HNYANNTETV 
HNYANNTETV 
********** 



ADIAAAMKDG 
ADIAAAMKDG 
ADIAAAMKDG 
ADIAAAMKDG 
ADIAAAMKDG 
ADIAAAMKDG 
ADIAAAMKDG 
ADIAAAMKDG 
ADIAAAMKDG 
ADIAAAMKDG 
********** 



msa209368 . 2 { 147_COHl} 
msa209368 .2{l47_M732} 
msa209368 ,2{147 M78l} 
msa209368 . 2 { 147_18RS2l} 
msa209368 .2(147 2603} 
msa209368 . 2 { 147_JM9130013 } 
msa209368.2(l47_090} 
msa2093 68 - 2 { 147_CJB110 } 
msa209368 . 2 { 147_1169NT} 
msa209368 . 2 { 147_H36B} 
tnsa2 093 68 .2{147_A909} 
Consensus 



251 

YGSEAKNI1H 
YGSEAKNI1H 
YGSEAKNI1H 
YGSEAKNIsH 
YGSEAKNISH 
YGSEAKNIsH 
YGSEAKNIsH 
YGSEAKNIsH 
YGSEAKNIsH 
YGSEAKNIsH 
YGSEAKNIsH 



GTHVAG I FVG 
GTHVAG I FVG 
GTHVAG I FVG 
GTHVAG I FVG 
GTHVAG I FVG 
GTHVAG I FVG 
GTHVAG I FVG 
GTHVAG I FVG 
GTHVAG I FVG 
GTHVAG I FVG 
GTHVAG I FVG 



NSKRPAINsL 
NSKRPAINSL 
NSKRPAINsL 
NSKRPAINgL 
NSKRPAINgL 
NSKRPAINgL 
NSKRPAINgL 
NSKRPAINgL 
NSKRPAINgL 
NSKRPAINgL 
NSKRPAINgL 



LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 



300 

VLLMRIPDKI 
VLLMRIPDKI 
VLLMRIPDKI 
VLLMRIPDKI 
VLLMRIPDKI 
VLLMRIPDKI 
VLLMRIPDKI 
VLLMRIPDKI 
VLLMRIPDKI 
VLLMRIPDKI 
VLLMRIPDKI 



********_* ********** 



********_* ********** ********** 



msa209368 . 

msa209368 . 

msa209368 
msa209368 ,2{ 

msa209368 
msa209368 .2(147 
msa209368 
msa209368 .2 { 
msa209368 .2{ 

msa209368 - 

msa209368 . 



2{147_C0H1) 
2{147 M732} 
2{147"M781} 
147_18RS2l} 
2{147_2603} 
JM9130013} 
2{147_090} 
147_CJB110} 
147_1169NT} 
2{147_H36B} 
2{147_A909} 
Consensus 



301 

DSDKFGEAYA 
DSDKFGEAYA 
DSDKFGEAYA 
DSDKFGEAYA 
DSDKFGEAYA 
DSDKFGEAYA 
DSDKFGEAYA 
DSDKFGEAYA 
DSDKFGEAYA 
DSDKFGEAYA 
DSDKFGEAYA 
********** 



KAIiDAVNLG 
KAIiDAVNLG 
KAIiDAVNLG 
KAItDAVNLG 
KAItDAVNLG 
KAItDAVNLG 
KAItDAVNLG 
KAItDAVNLG 
KAItDAVNLG 
KAItDAVNLG 
KAItDAVNLG 
***_****** 



AKTINMS1GK 
AKTINMS1GK 
AKTINMS1GK 
AKTINMSiGK 
AKTINMSiGK 
AKTINMSiGK 
AKTINMSIGK 
AKTINMSIGK 
AKTINMSiGK 
AKTINMSIGK 
AKTINMSIGK 
*******_** 



TADSLIALND 
TADSLIALND 
TADSLIALND 
TADSLIALND 
TADSLIALND 
TADSLIALND 
TADSLIALND 
TADSLIALND 
TADSLIALND 
TADSLIALND 
TADSLIALND 
********** 



350 

KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 
********** 



209368 
msa209368 
msa209368 
msa209368 .2 
msa209368 
msa209368 .2{14 
msa20936 
msa209368 .2 
msa209368 .2 
msa209368 
msa209368 



.2{l47_COHl} 
-2{147_M732} 
-2{147_M781} 
{147_18RS21} 
.2{147_2603} 
7_JM9130013} 
8 .2{147_090} 
{147_CJB110} 
{147__1169NT} 
.2{147_H36B} 
.2{147_A909} 
Consensus 



351 

EKGVAVWAA 
EKGVAVWAA 
EKGVAVWAA 
EKGVAVWAA 
EKGVAVWAA 
EKGVAVWAA 
EKGVAVWAA 
EKGVAVWAA 
EKGVAVWAA 
EKGVAVWAA 
EKGVAVWAA 
********** 



GNEGAFGMDY 
GNEGAFGMDY 
GNEGAFGMDY 
GNEGAFGMDY 
GNEGAFGMDY 
GNEGAFGMDY 
GNEGAFGMDY 
GNEGAFGMDY 
GNEGAFGMDY 
GNEGAFGMDY 
GNEGAFGMDY 
********** 



SKPLSTNPDY 
SKPLSTNPDY 
SKPLSTNPDY 
SKPLSTNPDY 
SKPLSTNPDY 
SKPLSTNPDY 
SKPLSTNPDY 
SKPLSTNPDY 
SKPLSTNPDY 
SKPLSTNPDY 
SKPLSTNPDY 
********** 



GTVNSPAISE 
GTVNSPAISE 
GTVNSPAISE 
GTVNSPAISE 
GTVNSPAISE 
GTVNSPAISE 
GTVNSPAISE 
GTVNSPAISE 
GTVNSPAISE 
GTVNSPAISE 
GTVNSPAISE 



400 

DTLSVASYES 
DTLSVASYES 
DTLSVASYES 
DTLSVASYES 
DTLSVASYES 
DTLSVASYES 
DTLSVASYES 
DTLSVASYES 
DTLSVASYES 
DTLSVASYES 
DTLSVASYES 



********** ********** 



msa209368 . 

msa209368 . 

msa209368 . 
rasa209368 .2 { 

msa209368 . 
0133209368.2(147, 
msa209368 
msa209368 
msa209368 

msa209368 . 

msa209368 . 



2{147_C0H1} 
2{147_M732} 
2{147_M781] 
147_18RS2l] 
2{l47_2603^ 
JM9130013 ! 
,^ U ."2{147_090; 
.2(l47_CJB110j 
.2{147_1169NTJ 
2{147_H36B} 
2{l47J\909} 
Consensus 



401 

LKTISEWET 
LKTISEWET 
LKTISEWET 
LKTISEWET 
LKTISEWET 
LKTISEWET 
LKTISEWET 
LKTISEWET 
LKTISEWET 
LKTISEWET 
LKTISEWET 
********** 



TIEGKLVKLP 
T I EGKLVKLP 
TIEGKLVKLP 
TIEGKLVKLP 
TIEGKLVKLP 
TIEGKLVKLP 
TIEGKLVKLP 
TIEGKLVKLP 
TIEGKLVKLP 
TIEGKLVKLP 
TIEGKLVKLP 
********** 



I VTS KPFDKG 
IVTSKPFDKG 
I VTS KPFDKG 
IVTSKPFDKG 
IVTSKPFDKG 
IVTSKPFDKG 
IVTSKPFDKG 
IVTSKPFDKG 
IVTSKPFDKG 
IVTSKPFDKG 
IVTSKPFDKG 
********** 



KAYDWYANY 
KAYDWYANY 
KAYDWYANY 
KAYDWYANY 
KAYDWYANY 
KAYDWYANY 
KAYDWYANY 
KAYDWYANY 
KAYDWYANY 
KAYDWYANY 
KAYDWYANY 
********** 



450 

GAKKilkvrt 
GAKKilkvrt 
GAKKilkvrt 
GAKKdfegkd 
GAKKdfegkd 
GAKKdfegkd 
GAKKdfegkd 
GAKKdfegkd 
GAKKdfegkd 
GAKKdfegkd 
GAKKrl.r.g 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2{ 

rasa209368 . 
msa209368 .2(147 
msa209368 
msa209368 .2 
msa209368 . 2 

msa209368 

msa20936B 



2{l47_COHl) 
2{147_M732) 
2(147_M781} 
147_18RS2l} 
2{147_2603 j 
^9130013^ 
2{147_090; 
147_CJB110l 
147 1169NTJ 
2{147_H36B] 
2{147_A909] 
Consensus 



451 

lkvrlh. lsv 
lkvrlh. lsv 
lkvrlh. lsv 
f kgkialier 
fkgkialier 
f kgkialier 
fkgkialier 
fkgkialier 
fkgkialier 
fkgkialier 
l.r.dcin.a 



wdlil .Iks 
wdlil .Iks 
wdlil. Iks 
gggldf mtki 
gggldfmtki 
gggldfmtki 
gggldfmtki 
gggldfmtki 
gggldfmtki 
gggldfmtki 
wwwt . f yd . n 



lmlqmqvllv 
lmlqmqvllv 
lmlqmqvllv 
thatnagwg 
thatnagwg 
thatnagwg 
thatnagwg 
thatnagwg 
thatnagwg 
thatnagwg 
hscykcrccw 



slfltikknv 
slfltikknv 
slfltikknv 
ivif ndqekr 
ivifndqekr 
ivif ndqekr 
ivifndqekr 
ivifndqekr 
ivifndqekr 
ivifndqekr 
yryf . rsrkt 



500 

eiF.f ltvny 
eiF.f ltvny 
eiF.f ltvny 
gnFlipyrel 
gnFlipyrel 
gnFlipyrel 
gnFlipyrel 
gnFlipyrel 
gnFlipyrel 
gnFlipyrel 
wkFsnslp.i 
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Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2{ 

msa209368 
msa209368.2{l47 
msa209368 
msa209368 .2{ 
msa209368 .2{ 

msa209368 . 

msa209368 . 



2f 147_COHl} 
2{147__M732} 
2{l47_M78l} 
147_18RS2l} 
2{147_2603) 
_JM9130013} 
.2{l47_090} 
147_CJB110} 
147_1169NT) 
2{147_H36B) 
2{147_A909} 
Consensus 



lwGllvk.ma 
lwGllvk.ma 
lwGllvk.ma 
pvGiiskvdg 
pvGiiskvdg 
pvGiiskvdg 
pvGviskvdg 
pvGviskvdg 
pvGviskvdg 
pvGviskvdg 
tcGgy. .srw 



sv.Kilqvs . 
sv.Kilqvs . 
sv.Kilqvs . 
eriKntssql 
eriKntssql 
eriKntssql 
eriKntssql 
eriKntssql 
eriKntssql 
eriKntssql 
rayKkyfksv 



hltrvlk.li 
hltrvlk.li 
hltrvlk.li 
tfnqsf ewd 
tfnqsf ewd 
tfnqsf ewd 
tfnqsf ewd 
tfnqsf ewd 
tfnqrf ewd 
tfnqsf ewd 
ni.pef ,ss. 



akvaivcwnti 
akvaivcwnti 
akvaivcwnn 
s qggnrml eq 
sqggnrmleq 
s qggnrml eq 
sqggnrmleq 
sqggnrmleq 
sqggnrmleq 
sqggnrmleq 
. prwqsyagt 



qvga.qlkeq 
qvga.qlkeq 
qvga.qlkeq 
sswgvtaega 
sswgvtaega 
sswgvtaega 
sswgvtaega 
sswgvtaega 
sswgvtaega 
sswgvtaega 
iklgrds . rs 



msa2093 68 . 

msa209368 . 

msa209368 . 
msa209368.2{ 

msa209368 . 
msa209368 .2{l47 
msa209368 
msa209368 .2{ 
msa209368 .2{ 

msa209368 . 

msa209368 . 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
147_18RS2l} 
2 { 147^2603} 
JM9130013) 
2{147_090} 
147_CJB110} 
147_1169NT} 
2(l47_H36B} 
2{147_A909} 
Consensus 



551 

sslm. qllal 
sslm.qllal 
sslm. qllal 
ikpdvtasgf 
ikpdvtasgf 
ikpdvtasgf 
ikpdvtasgf 
ikpdvtasgf 
ikpdvtasgf 
ikpdvtasgf 
nqa.cnsfwl 



kfilqpiiin 
kfilqpiiin 
kfilqpiiin 
eiysstynnq 
eiysstynnq 
eiysstynnq 
eiysstynnq 
eiysstynnq 
eiysstynnq 
eiysstynnq 
.nlffnl. .s 



tkqclvqvwl 
tkqclvqvwl 
tkqclvqvwl 
yqtmsgtsma 
yqtmsgtsma 
yqtmsgtsma 
yqtmsgtsma 
yqtmsgtsma 
yqtmsgtsma 
yqtmsgtsma 
ipnnvwykyg 



hhmlqd . . qc 
hhmlqd . -qc 
hhmlqd. .qc 
sphvaglmtm 
sphvaglmtm 
sphvaglmtm 
sphvaglmtm 
sphvaglmtm 
sphvaglmtm 
sphvaglmtm 
f ttccrindn 



600 

f kviwlrnik 
fkviwlmik 
f kviwlrnik 
lqshlaekyk 
lqshlaekyk 
lqshlaekyk 
lqshlaekyk 
lqnhlaekyk 
lqshlaekyk 
lqshlaekyk 
asksfg. ei . 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2{ 

msa209368 
msa209368 .2(147 
msa209368 
msa209368 ,2 { 
msa209368 .2 { 

msa209368 . 

msa209368 . 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
147_18RS2l} 
2{147_2603} 
JM9130013} 
*2{147_090} 
14 7_CJB110} 
147_1169NT} 
2{147_H36B} 
2{147_A909} 
Consensus 



601 

g.i.ilknc. 
g.i . ilknc . 
g. i .ilknc . 
gmnldskkll 
gmnldskkll 
gmnldskkll 
gmnldskkll 
gmnldskkll 
gmnldskkll 
gmnldskkll 
rdef rf .kia 



nclktss .aq 
nclktss .aq 
nclktss . aq 
elsknilmss 
elsknilmss 
elsknilmss 
elsknilmss 
elsknilmss 
elsknilmss 
elsknilmss 
riv . khphel 



qqhyivkrir 
qqhyivkrir 
qqhyivkrir 
atalyseedk 
atalyseedk 
atalyseedk 
atalyseedk 
atalyseedk 
atalyseedk 
atalyseedk 
snsii . .rg. 



rfihhvskvq 
rf ihhvskvq 
rfihhvskvq 
afysprqqga 
afysprqqga 
afysprqqga 
afysprqqga 
afysprqqga 
afysprqqga 
afysprqqga 
gvlf ttsarc 



650 

v.lmlkKlsk 
v.lmlkKlsk 
v.lmlkKlsk 
gwdaeKaiq 
gwdaeKaiq 
gwdaeKaiq 
gwdaeKaiq 
gwdaeKaiq 
gwdaeKaiq 
gwdaeKaiq 
res .c .Ksyp 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2 { 

msa209368 
msa209368 .2(147 
msa209368 
msa209368 
msa209368 

msa209368 

msa209368 . 



3.2{ 
i.2{ 



2{147_C0H1} 
2{147_M732} 
2{147_M781} 
147_18RS21} 
2 { 147^2603} 
_JM9130013} 
.2{l47_090} 
147_CJB110} 
147_1169NT} 
2{147_H36B} 
2{147_A909} 
Consensus 



msa209368 . 

msa209368 

msa209368 . 
msa209368 .2 { 

msa209368 . 
msa209368 .2(147 
msa209368 
msa209368 .2{ 
msa209368 .2{ 

msa209368 . 

msa209368 . 



2{147_C0H1} 
2{147_M732} 
2{147_M7 81} 
147_18RS2l} 
2{147_2603} 
_JM9130013} 
,2{147_090} 
147_CJB110} 
147__1169NT} 
2{147_H3 6B} 
2{147_A909} 
Consensus 



651 

lnimlletma 
lnimlletma 
lnimlletma 
aqyyitgndg 
aqyyitgndg 
aqyyitgndg 
aqyyvtgndg 
aqyyvtgndg 
aqyyvtgndg 
aqyyvtgndg 
ssilcywkrw 



klklisnere 
klklisnere 
klklisnere 
kakinlkrmg 
kakinlkrmg 
kakinlkrmg 
kakinlkrvg 
kakinlkrvg 
kakinlkrvg 
kakinlkrvg 
qs .n. sqtsg 



inlisqlqf i 
inlisqlqfi 
inlisqlqf i 
dkfditvtih 
dkfditvtih 
dkfditvtih 
dkfditvtih 
dkfditvtih 
dkfditvtih 
dkfditvtih 
r . i .yhsyns 



nl .kvsknei 
nl . kvsknei 
nl . kvsknei 
klvegvkely 
klvegvkely 
klvegvkely 
klvegvkely 
klvegvkely 
klvegvkely 
klvegvkely 
. tcrrcqriv 



700 

iklm.qqnk. 
iklm.qqnk. 
iklm.qqnk. 
yqanvateqv 
yqanvateqv 
yqanvateqv 
yqanvateqv 
yqanvateqv 
yqanvateqv 
yqanvateqv 
lss .csnrts 



701 

ikvnlplnhk 
ikvnlplnhk 
ikvnlplnhk 
nkgkfalkpq 
nkgkfalkpq 
nkgkfalkpq 
nkgkfalkpq 
nkgkfalkpq 
nkgkfalkpq 
nkgkfalkpq 
k.r.icp.tt 



pc. iligrk. 
pc. iligrk. 
pc. iligrk. 
alldtnwqkv 
alldtnwqkv 
alldtnwqkv 
alldtnwqkv 
alldtnwqkv 
alldtnwqkv 
alldtnwqkv 
slary .laes 



ffvikkhkfd 
ffvikkhkfd 
ffvikkhkfd 
ilrdketqvr 
ilrdketqvr 
ilrdketqvr 
ilrdketqvr 
ilrdketqvr 
ilrdketqvr 
ilrdketqvr 
nss. .rntsB 



lllmlvnlvr 
lllmlvnlvr 
lllmlvnlvr 
f tidasqf sq 
f tidasqf sq 
f tidasqf sq 
f tidasqf sq 
f tidasqf sq 
f tidasqf sq 
f tidssqf sq 
iyy.f .si.s 



750 

n . Knrwqmvi 
n . Knrwqmvi 
n . Knrwqmvi 
klKeqmangy 
klKeqmangy 
klKeqmangy 
klKeqmangy 
klKeqmangy 
klKeqmangy 
klKeqmangy 
eiKrtdgkwl 



msa209368 
msa209368 
msa209368 
msa209368 .2 
msa209368 
msa209368 .2(14 
msa20936 
msa209368 .2 
msa209368 .2 
msa209368 
msa209368 



.2{l47_COHl} 
.2{147_M732} 
.2{147_M781} 
{147_18RS21} 
.2{147_2603} 
7_JM9130013) 
8 .2{147__090) 
{147_CJB110} 
{147_1169NT} 
.2{147_H36B} 
.2{147_A909} 
Consensus 



751 

s . kvlyvlkk 
s . kvlyvlkk 
s . kvlyvlkk 
f legfvrfke 
f legfvrfke 
f legfvrfke 
f legfvrfke 
f legfvrfke 
f legfvrfke 
f legfvrfke 
flrrfctf .r 



privirs . . v 
privirs . . v 
privirs. .v 
akdsnqelms 
akdsnqelms 
akdsnqelms 
akdsnqelms 
akdsnqelms 
akdsnqelms 
akdsnqelms 
sqg . . sgvne 



fll.dlmvil 
fll.dlmvil 
fll.dlmvil 
ipfvgfngdf 
ipfvgfngdf 
ipfvgfngdf 
ipfvgfngdf 
ipfvgfngdf 
ipfvgfngdf 
ipfvgfngdf 
ysfcri .w.f 



rtykhlkhrf 
rtykhlkhrf 
rtykhlkhrf 
anlqaletpi 
anlqaletpi 
anlqaletpi 
anlqaletpi 
anlqaletpi 
aslqaletpi 
anlqaletpi 
celtst.ntd 



800 

irrf Ikwst 
irrf Ikwst 
irrf Ikwst 
yktiskgsfy 
yktlskgsfy 
yktiskgsfy 
yktlskgsfy 
yktiskgsfy 
yktlskgsfy 
yktlskgsfy 
l.daf .r.fl 



790 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



ms a2 0 93 68 . 2 { 14 7_COHl } 
msa209368 . 2 { 147_M73 2 } 
msa209368.2{l47_M7Bl} 
msa209368.2{l47_18RS2l} 
msa209368 . 2 { 147_2603} 
msa209368.2{l47_JM9130013} 
msa209368 .2{l47_090} 
msa2 09368.2(14 7_C JB11 0 } 
msa209368 . 2 { 147_1169NT) 
msa2 09368 . 2 { 147_H36B} 
msa209368.2{l47_A9O9} 
Consensus 



801 

inqmiqlikt 
inqmiqlikt 
inqmiqlikt 
ykpndtthkd 
ykpndtthkd 
ykpndtthkd 
ykpndtthkd 
ykpndtthkd 
ykpndtthkd 
ykpndtthkd 
1 . tk.yns .r 



nwstmnqlll 
nwstmnqlll 
nwstmnqlll 
qleynesapf 
qleynesapf 
qleynesapf 
qleynesapf 
qleynesapf 
qleynesapf 
qleynesapf 
pigvq. issf 



kattilpc.h 
kattilpc . h 
kattilpc.h 
esnnytallt 
esnnytallt 
esnnytallt 
esnnytallt 
esnnytallt 
esnnytallt 
esnnytallt 
.kqqlyclvn 



nqrlgamlim 
nqrlgamlim 
nqrlgamlim 
qsaswgyvdy 
qsaswgyvdy 
qsaswgyvdy 
qsaswgyvdy 
qsaswgyvdy 
qsaswgyvdy 
qsaswgyvdy 
tisvlglc .1 



850 

skmvgs .n.h 
skmvgs.n.h 
skmvgs .n.h 
vknggelela 
vknggelela 
vknggelela 
vknggelela 
vknggelela 
vknggelela 
vknggelela 
cqkwwgvris 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2{ 

msa209368 . 
msa209368.2{l47 
msa209368 
msa209368 .2{ 
msa209368 .2{ 

msa209368 . 

msa2 09368 . 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
147_18RS2l} 
2{147_2603} 
_JM9130013} 
2{147_090} 
147_CJB110} 
147_1169NT} 
2{147_H36B} 
2{147_A909} 
Consensus 



851 

rrvqKelf .e 
rrvqKelf .e 
rrvqKelf .e 
pespKriilg 
pespKriilg 
pespKriilg 
pespKriilg 
pespKriilg 
pespKriilg 
pespKriilg 
tgesKknyf r 



llrirlrikq 
llrirlrikq 
llrirlrikq 
tf enkvedkt 
tf enkvedkt 
tf enkvedkt 
tf enkvedkt 
tf enkvedkt 
tf enkvedkt 
tf enkvedkt 
nf .e.g.g.n 



f ifwkemqri 
f ifwkemqri 
f ifwkemqri 
ihllerdaan 
ihllerdaan 
ihllerdaan 
ihllerdaan 
ihllerdaan 
ihllerdaan 
ihllerdaan 
nssfgkrcse 



ihilpflqik 
ihilpflqik 
ihilpflqik 
npyf aispnk 
npyf aispnk 
npyf aispnk 
npyf aispnk 
npyf aispnk 
npyf aispnk 
npyf aispnk 
.sifchfsk. 



900 

meigtkslpr 
meigtkslpr 
meigtkslpr 
dgnrdeitpq 
dgnrdeitpq 
dgnrdeitpq 
dgnrdeitpq 
dgnrdeitpq 
dgnrdeitpq 
dgnrdeitpq 
rwk . g . nhsp 



msa209368.2{l47 t _COHl} 
msa209368 .2{147_M732} 
msa209368 .2{l47_M78l} 
msa20936B . 2 { 147_18RS21 } 
msa209368 . 2 { 147_2603 } 
msa209368 . 2 { 147_JM9130013 } 
msa209368 .2{l47_090} 
msa2 09368 . 2 { 147_CJB110 } 
msa209368 . 2 { 147_1169NT} 
msa2 09368 . 2 { 147_H36B ) 
msa209368 .2{147_A909} 
Consensus 



901 

qls .emlrif 
qls . emlrif 
qls .emlrif 
atf lrnvkdi 
atf lrnvkdi 
atf lrnvkdi 
atf lrnvkdi 
atf lrnvkdi 
atf lrnvkdi 
atf lrnvkdi 
gnf lkkc .gy 



llkf . ikmem 
llkf .ikmem 
llkf . ikmem 
saqvldqngn 
saqvldqngn 
saqvldqngn 
saqvldqngn 
saqvldqngn 
saqvldqngn 
saqvldqngn 
f csssrskwk 



lfgkvrfyhl 
lfgkvrfyhl 
lfgkvrfyhl 
viwqskvlps 
viwqskvlps 
viwqskvlps 
viwqskvlps 
viwqskvlps 
viwqskvlps 
viwqskvlps 
cylak.gf ti 



ivkisiiiqs 
ivkisiiiqs 
ivkisiiiqs 
yrknf hnnpk 
yrknfhnnpk 
yrknf hnnpk 
yrknfhnnpk 
yrknfhnnpk 
yrknfhnnpk 
yrknfhnnpk 
Is .kfp . . sk 



950 

kvmviivwml 
kvmviivwml 
kvmviivwml 
qsdghyrmda 
qsdghyrmda 
qsdghyrmda 
qsdghyrmda 
qsdghyrmda 
qsdghyrmda 
qsdghyrmda 
ak. wslsygc 



msa209368 
msa209368 
msa209368 . 
msa209368 .2 { 
msa209368 
msa209368 .2(147 
msa209368 
msa209368 .2 
msa209368 .2 
msa209368 
msa209368 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
147_18RS2l} 
2{147_2603} 
JM9130013) 
*2{147_090} 
147__CJB110} 
147_1169NT} 
2{147_H36B} 
2{147_A909} 
Consensus 



951 

f sgw.irma 
f sgw. irma 
f sgw. irma 
lqwsgldkdg 
lqwsgldkdg 
lqwsgldkdg 
f qwsgldkdg 
fqwsgldkdg 
lqwsgldkdg 
lqwsgldkdg 
psvewf r .gw 



kl .qmvFili 
kl .qmvFili 
kl .qmvFili 
kwadgFyty 
kwadgFyty 
kwadgFyty 
kwadgFyty 
kwadgFyty 
kwadgFyty 
kwadgFyty 
qscsrwFlyl 



ayvthq.qke 
ayvthq . qke 
ayvthq.qke 
rlrytpvaeg 
rlrytpvaeg 
rlrytpvaeg 
rlrytpvaeg 
rlrytpvaeg 
rlrytpvaeg 
rlrytpvaeg 
sf tlhtssrr 



qivrsqtlkf 
qivrsqtlkf 
qivrsqtlkf 
ansqesdf kv 
ansqesdfkv 
ansqesdf kv 
ansqesdfkv 
ansqesdf kv 
ansqesdfkv 
ansqesdfkv 
sk.sgvrl . s 



1000 
k.vlshqifl 
k.vlshqif 1 
k.vlshqif 1 
qvstkspnlp 
qvstkspnlp 
qvstkspnlp 
qvstkspnlp 
qvstkspnlp 
qvstkspnlp 
qvstkspnlp 
ssky . vtkss 



msa209368 
msa209368 . 
msa209368 
msa209368.2{ 
msa209368 . 
msa209368.2{l47 
msa209368 
msa209368 .2 
msa209368 .2 
msa209368 
msa209368 . 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
147_18RS2l} 
2{l47_2603} 
_JM9130013 j 
,2{l47_090] 
147_CJB110] 
147_1169NT] 
2{147_H36B) 
2{147_A909) 
Consensus 



1001 

helslmklie 
helslmklie 
helslmklie 
sraqfdetnr 
sraqfdetnr 
sraqfdetnr 
llaqfdetnr 
llaqfdetnr 
sraqfdetnr 
sraqfdetnr 
f tssv. .n.s 



h.a.pclrkv 
h.a.pclrkv 
h. a .pclrkv 
tlslampkes 
tlslampkes 
tlslampkes 
tlslampkes 
tlslampkes 
tlslampkgs 
tlslampkes 
niklsha.gk 



vmf lhivyn. 
vmf lhivyn. 
vmf lhivyn. 
syvptyrlql 
syvptyrlql 
syvptyrlql 
syvptyrlql 
syvptyrlql 
syvpiyrlql 
syvptyrlql 
.lcsyissti 



f ylml . Kmkn 
f ylml . Kmkn 
f ylml . Kmkn 
vlshwKdee 
vlshwKdee 
vlshwKdee 
vlshwKdee 
vlshwKdee 
vlshwKdee 
vlshwKdee 
sfisccKr .r 



1050 
mgmrlltiis 
mgmrlltiis 
mgmrlltiis 
ygdetsyhyf 
ygdetsyhyf 
ygdetsyhyf 
ygdetsyhyf 
ygdetsyhyf 
ygdetsyyyf 
ygdetsyhyf 
iwr.dflplf 



msa209368 . 

msa209368 . 

msa209368 
msa209368.2{ 

msa209368 . 
msa209368 .2{147, 
msa209368 
msa209368 .2{ 
msa209368 .2{ 

msa209368 

msa209368 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
147_18RS21) 
2{147_2603} 
JM9130013} 
'2{l47_090) 
147_CJB110) 
147_1169NT} 
2{147__H36B} 
2{147_A90 9) 
Consensus 



1051 

i . ikkvk . hf 
i . ikkvk . hf 
i. ikkvk. hf 
hidqegkvtl 
hidqegkvtl 
hidqegkvtl 
hidqegkvtl 
hidqegkvtl 
hidqegkatl 
hidqegkvtl 
pyrsrr . sdt 



lkrlr . ervr 
lkrlr . ervr 
lkrlr . ervr 
pktvkigese 
pktvkigese 
pktvkigese 
pktvkigese 
pktvkigese 
pktvkigese 
pktvkigese 
s .ns.drre. 



lr.tlrp.hl 
lr.tlrp.hl 
lr.tlrp.hl 
vavdpkaltl 
vavdpkaltl 
vavdpkaltl 
vavdpkaltl 
vavdpkaltl 
vavdpkaltl 
vavdpktltl 
gcsrp.dldt 



lwkiklvilq 
lwkiklvilq 
lwkiklvilq 
wedkagnfa 
wedkagnf a 
wedkagnfa 
wedkagnfa 
wedkagnfa 
wedkagnfa 
wedkagnfa 
ccgr.sw.fr 



1100 
r.nclts . ir 
r.nclts . ir 
r.nclts . ir 
tvklsdllnk 
tvklsdllnk 
tvklsdllnk 
tvklsdllnk 
tvklsdllnk 
tvklsdllnk 
tvklsdllnk 
ngkiv.ple. 
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Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msa209368 .2{l47_COHl} 
maa209368 . 2 { 147_M732 } 
msa209368 .2{l47_M78l} 
msa209368.2{l47_18RS2l} 
msa209368 . 2 { 147_2603 } 
maa209368 . 2 { 147_JM9130013 } 
msa209368.2{l47_090} 
msa209368 . 2 { 147_CJB110 } 
msa209368 . 2 { 147_1169NT] 
msa209368 .2{147_H36B} 
msa209368 .2{l47_A909} 
Consensus 



1101 

q , yqrkktl . 
q. yqrkktl . 
q . yqrkktl . 
awsekenai 
awsekenai 
awsekenai 
awsekenai 
awsekenai 
awsekenai 
awsekenai 
gssirerkry 



. f Itvsnili 
♦ f Itvsnili 
•f Itvsnili 
visnsfkyfd 
visnsf kyfd 
visnsfkyfd 
visnsfkyfd 
visnsfkyfd 
visnsfkyfd 
visnnfkyfd 
snf .qfqif . 



t.rKnlclfl 
t.rKnlclfl 
t.rKnlclfl 
nlkKepmfis 
nlkKepmf is 
nlkKepmfis 
nlkKesmf is 
nlkKesmfis 
nlkKepmfis 
nlkKepmfis 
. leKrtyvyf 



kkeK. .tri. 
kkeK. .tri. 
kkeK. .tri. 
kkeKwnknl 
kkeKwnknl 
kkeKwnknl 
kegKwnknl 
kegKwnknl 
kkeKwnknl 
kegKwnknl 
.rrKsskqes 



1150 
kk.h.lslkl 
kk.h.lslkl 
kk.h.lslkl 
eeiilvkpqt 
eeiilvkpqt 
eeiilvkpqt 
eeitlvkpqt 
eeitlvkpqt 
eeiilvkpht 
eeialvkpqt 
rrnsis.aan 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2 { 

msa209368 
msa209368.2{l47 
msa209368 
msa209368 .2 { 
msa209368 .2 { 

msa209368 . 

msa209368 . 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
147_18RS2l} 
2{l47_2603} 
_JM9130013} 
2{147_090} 
147_CJB110} 
147_1169NT} 
2{l47_H36B} 
2{147_A909} 
Consensus 



1151 

qlllnhclkk 
qlllnhclkk 
qlllnhclkk 
tvttqslske 
tvttqslske 
tvttqslske 
tvttqslske 
tvttqslske 
tvttqslske 
tvttqslske 
ysyysiiv.r 



.Inqemrkss 
.lnqemrkss 
. lnqemrkss 
itksgnekvl 
itksgnekvl 
itksgnekvl 
itksgnekvl 
itksgnekvl 
itksgnekvl 
itqsgnekvl 
nnsirk.esp 



llqtiivae. 
llqtiivae. 
llqtiivae. 
tstnnnssrv 
tstnnnssrv 
tstnnnssrv 
tstnnnssrv 
tstnnnssrv 
tstnnnssrv 
tstnnnssrv 
hfykq. . .qs 



lrsyhlnitg 
lrsyhlni tg 
lrsyhlnitg 
akiispkhng 
akiispkhng 
akiispkhng 
akiispkhng 
akiispkhng 
aki i spkhng 
akiispkhng 
s .dhit .t .r 



1200 

illti 

illti 

illti 

dsvnhT 

dsvnhTlpst 

dsvnhT 

dsvnhT 

dsvnhT 

dsvnhT 

dsvnhT 

gfc.py 



msa2 09368 . 2 { 147_COHl } 
msa209368 . 2 { 147_M732 } 
msa209368 . 2 { 147_M781 } 
msa209368 . 2 { 147_18RS21 } 
j msa2 09368 .2{l47_2603} 
msa209368.2{l47_JM9130013} 
msa209368 . 2 { 147_090 } 
msa209368 . 2 { 147__CJB110 } 
msa209368.2{l47_1169NT} 
msa209368 . 2 { 147_H36B} 
msa209368 .2{147_A909} 
Consensus 



sdratnglfv gtlallssll lylkpkktkn nsk 
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Table 45: Comparative Sequences relating to SAG1404 (strain info highlighted in BOLD) 



SEQ ID NO. 4501 
STRAIN 2603 

ATGAAAAAGATTAGAAAAAGTTTAGGACTTCrACTATGTTGCTTTTTAGGATTGGTAC^ 

TTAGCGTTTTTTTCGGTAGCCAGTGTAAATGCTGATACCCCrrAATCAACTA^CAATCACA 

CAGATAGGACTTCAGCCAAATACTACAGAGGAGGGGATTTCTTATCGTTTA 

ACTGACAACTTAAAAGTTG ATTTATTGAG C CAAATGACAG AT AG CGAATT GAACCAG AAG 

TATAAG AGTATCTTGACTT CTC CT ACTGAT ACTAATGGT CAG ACAAAGAT AG CACTCCCA 

AATGGTTCGTAC^TGGTCGTGCTTATAAAGCTGATCAAAGCGTTTCAACAATAGTACCT 

TTTCATATTGAATTACCAGATGATAAGTTATCAAATCAATT^ 

AAAGTTGAAACAGG C CGATT AAAACTTATTAAATATACAAAAGAAGG AAAGATAAAG AAA 

AGGCTATCCGGAGTAATATTTGTATTATACGATAACCAGAATCAGCCAGTTCGCrTTAAA 

AATGGACGATTTACGACCGATCAAGATGGGATTACTTCATTAGTAACTGATGATAAGGGA 

GAAATTGAGGTTGAAGGTTTATTACCT GGTAAGTATATTTTT CG AGAAG CAAAAG CACTA 

ACrGGTTACCGTATATCTATGAAGGATGCTGTAGTTGCTGTAGTTGCTAATAAAACACAG 

GAAGTAGAGGTAGAAAACGAAAAAGAA^CTCCTCCACCAACAAATCCTAAACCATCACAA 

CCG CTTTTT C CACAATCATTTCTT CCT AAAACAGG AATG ATT ATTGGTGGAGGACTGACA 

ATTCTTGGTTGTATTATTTTGGGAATTTTGTTTATCTTTTTAAGAAAAACTA 

AAAT CTG AAAGAAACG ATACAGT A 

SEQ ID NO. 4502 
STRAIN 090 

GATACCCCTAATCAACTAACAATCACAC 

AGATAGGACTT CAG C CAAAT ACTACAGAGG AGGGG ATTT CTT AT CGTTT A 
TGGACTGTGACTGACAACTTAAAAGTTGATTTATTGAGCCAAATGACAGA 
TAGCGAATTGAACCAGAAGTATAAGAGTATCTTGACTTCTCCTACTGATA 
CTAATGGtCAGACAAAGATAGCACTCCCAAATGGTTCGTACTTTGGTCGT 
GCTT ATAAAG CTGAT CAAAGCGTTT CAACAAT AGTACCTTTTTATATTG A 
ATTACCAGATGATAAGTTAT CAAATCAATTACAGATAAATCCTAAGCGAA 
AAGTTGAAACAGGCCGATTAAAACTTATTAAATATACAAAAGAAGGAAAG 
ATAAAGAAAAGGCTATCAGGAGTAATATTTGTATTATACGATAACCAGAA 
TCAGCCAGTTCG CTTTAAAAATGGACGATTTACGACCGAT CAAGATGGGA 
TTACTTCATTAGTAACTGATGATAAGGGAGAAATTGAGGTTGAAGGTTTA 
TTAC CTGGTAAGTATATTTTT CGAGAAG CAAAAG CACTAACTGG t TACCG 
TATATCTATGAAGGATGCTGTAGTTGCTGTAGTTGCTAATAAAACACAGG 
AAGTaGAGGTaGAAAACGAAAAAGAAACTCCTCCACCAAGAAATCCTAAA 
CCATCACAACCG 

SEQ ID NO. 4503 
STRAIN H36B 

GATACCCCTAATCAACTAACAATCACACAGA 

TAGGACTTCAG CCAAATACT AGAGAGGAGGGGATTT CTTATCGTTTATGG 
ACTGTGACTG ACAACTTAAAAGTTGATTTATTGAG C CAAATGACAGATAG 
CGAATTG AACCAGAAGTATAAGAGTAT CTTGACTT CTCCTACTGATACTA 
ATGG t CAGAC^AAGATAGCACTCCGAAATGGTTCGTACTTTGGTCGTGCT 
TATAAAG CTGATCAAAG CGTTT CAACAATAGTAC CTTTTTATATTGAATT 
ACCAGATGATAAGTTATCAAAT CAATTACAGATAAATC CTAAGCGAAAAG 
TTGAAACAGG CCGATTAAAACTTATTAAATATACAAAAGAAGGAAAGATA 
AAGAAAAGG CTwTCCGGAGTAATATTTGT ATTAT ACG AT AACCAGAAT CA 
GCCAGTTCX5CTTTAAAAATGGA.CGATTTACGACCGAT CAAGATGGGATTA 
CTTC^TTAGTAACTGATGATAAGGGAGAAATTGAGGTTGAAGGTTTATTA 
CCTGGTAAGTATATTTTTCGAGAAGCAAAAGCACr 

AT CTATGAAGGATG CTGTAGTTGCTGTAGTTG CTAATAAAACACAGGAAG 
TAGAGGTAGAAAACGAAAAAGAAACTCCTCCA,CCAACAAATCCTA\ACCA 
TCACAACCGC 

SEQ ID NO. 4504 
STRAIN 18RS21 

GATACC CCTAAT CAACTAACAATCACACAG 

AT AGGACTT CAG CCAAATACTACAGAGGAGGGGATTT CTTATCGTTT ATG 

GACTGTGACTGACAACTTAAAAGTTGATTTATTGAGCCAAATGACAGATA 

GCX3AATTGAACCAGAAGTATAAGAGTATCTTGACTTCT 

AATGG t CAGACAAAGATAG CACTCC CAAATGGTTCGTACTTTGGTCGTG C 

TTATAAAGCTGATCAAAGCGTTTCAACAATAGTACCTTI^TATATTGAAT 

TACCAGATGATAAGTTATCAAATCAATTACAGATAAATCCTAAGCGAAAA 

GTTGAAACAGGCCGATTAAAACTTATTAAATATACAA^GAAGGAAAGAT 

AAAGAAAAGGCTATCCGGAGTAATATTTGTATTATACGATAACCAGAATC 

AG CCAGTTCG CTTTAAAAATGGACG ATTTACGACCGATCAAGATGGGATT 

ACTTCATTAGTAACTGATGATAAGGGAGAAATTGAGGTTGAAGGTTTATT 

ACCTGGTAAGTATATTTITCGAGAAGCAAAAG CACTAACTGGTTACCGTA 

TATCTATGAAGGATGCTGTAGTTGCTGTAGTTGCTAATAAAAOVCAGGAA 

GTAGAGGTAGAAAACa^^AAAGAAACTCCTCC^CCAACAAATCCTAAACC 

ATCACAACC 

SEQ ID NO. 4505 
STRAIN CJB110 

GATACCCCTAATCAACTAACAATCACACA 

GATAGGACTTCAGCCAAATACTAa^GAGGAGGGGATTTCTTATCGTT^ 

GGaCTGTGACTGACAACTTAAAAGTTGATTTATTGAGCCPAA 

AGCGAATTgAACCAGAAGTATAAGAGTATCTTGACTTCTCctACTGATAc 

T AATGGTCAG ACAAAGATAG CACT CCC AAATGGTT CGTACTTTGGT CGT G 

CTTAT AAAG CTGATCAAAG CGTTT CAACAATAGTACCTTTTTATATTGAA 

TTACCAGATGATAAGTTATCAAATCAATTAC^GatA 

AGTTGAAACAGG CCG ATTa aAACTTATTAAATATACAAAAGAAGGAAAGA 
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Table 45: Comparative Sequences relating to SAG1404 (strain info highlighted in BOLD) 

TAAAGAAAAGGCTaTCAGGAGTAATATTTGTATTATACGATAACCAGAAT 
CAGCCAGTTCGCTTTAAAAATGGACGATTTACGACCGATCAAGATGGGAT 
TACIT CATTAGTAACTGATGATAAGGGAGAAATTGAGGTTGAAGGTTTAT 
TACCTGGT AAGTATATTTTT CGAG AAG CAAAAG CACTAACTGGTTa CCGT 
AT AT ATG AAGGATG CTGT AGTTG CTGT AGTTG CTAAT AAAACACA0X3 A 
AGTAGAGGTAGAAAACGAAAAAGAAACTCCTCCACCAACAAATCCTAAAC 
CATCACAACC 

SEQ ID NO. 4506 
STRAIN 1169NT 

GATAC CC CTAAT CAACTAACAAT CACACAG 

ATAGGACTTCAG C CAAATACT ACAG AGGAGGG G ATTT CTT ATCGTTT ATG 

GACTGTGACTGACAACTTAAAAGTTGATTTATTGAGCCAAATGACAGATA 

GCGAATTGAACCAGAAGTATAAGAGTATCTTGACTTCTCCTACTGATACT 

AATGGtCAgaCAAAGATAGCACTCCCAAATGGTTCGTACTTTGGTCGTGC 

TTATAAAGCTGATGAAAGCGTTTCAACAATAGTACCTT^ 

TAC CAGATGATAAGTTATGAAATCAATTACAGAT AAATCCTAAG CGAAAA. 

GTTGAAACAGGCCGATTAAAACrTATTAAATATACAAAAGAAGGAAAGAT 

AAAGAAAAGGCTATCAGGAGTAATATTTGTATTATACGATAACCAGAATC 

AGCCAGTTCG CTITTAAAAATGG ACGATTTACGAC CGATCAAG ATGGGAT T 

ACTTCATTAGTAACtgaTGATAAGGGAGAAATTGAGGTTGA^GGTTTATT 

AC CTGGTAAGTATATTTTT CGAGAAGCAAAAG CACTAACTGGTT ACCGTA 

TATCTATGAAGGATGCTGTAGTTGCTGTAGTTGCTAATAAAACACAGGAA 

GTAGAGGTAGAAAACGAAAAAGAAACTCCTCCACCAACAAATCCTAAACC 

ATCACAACC 

PRETTY Of : /biotmp/msal84750 . 2 { * } May 13, 2003 06:23 

1 50 

msal84750.2{l50_090} 

msa!84750 . 2 { 150_1169NT} 

msalB47 50 . 2 { 150_CJB110 } 

msal84750.2{l50_18RS2l} 

msal84750.2{l50_2603) atgaaaaaga ttagaaaaag tttaggactt ctactatgtt gctttttagg 

msal84750.2{l50__H36B} 

Consensus ********** ********** ********** ********** ********** 

51 100 

msal847 50 . 2 { 150 JD 90 } GATACCC 

msal84750.2{l50_1169NT} GATACCC 

rasal847 50, 2( 150_CJB110> GATACCC 

msal84750.2{150_18RS2l} GATACCC 

msal84750-2{l50_2603} attggtacaa fctagcgtttt tttcggtagc cagtgtaaat gctGATACCC 

msal84750.2{l50_H36B} GATACCC 

Consensus ********** ********** ********** ********** ********** 



msal84750 .2{150_090} 
rasal84750 . 2 { 150_1169NT \ 
msal84750.2{l50_CJB110} 
msal84750 . 2 { 150_18RS21 } 
msal84750.2{l50_2603> 
msal84750 . 2 { 150_H3 6B} 
Consensus 



101 

CTAATCAACT 
CTAAT CAACT 
CTAATCAACT 
CTAATCAACT 
CTAATCAACT 
CTAATCAACT 
********** 



AACAATCACA 
AACAATCACA 
AACAATCACA 
AACAATCACA 
AACAATCACA 
AACAATCACA 
********** 



CAGATAGGAC 
CAGATAGGAC 
CAGATAGGAC 
CAGATAGGAC 
CAGATAGGAC 
CAGATAGGAC 
********** 



TTCAGCCAAA 
TTCAGCCAAA 
TTCAGCCAAA 
TTCAGCCAAA 
TTCAGCCAAA 
TTCAGCCAAA 
********** 



150 

TACTACAGAG 
TACTACAGAG 
TACTACAGAG 
TACTACAGAG 
TACTACAGAG 
TACTACAGAG 
********** 



151 200 

msal84750 ,2{150_090} GAGGGGATTT CTTATCGTTT ATGGACTGTG ACTGACAACT TAAAAGTTGA 

msal 84750 -2{l50_1169NT} GAGGGGATTT CTTATCGTTT ATGGACTGTG ACTGACAACT TAAAAGTTGA 

msal84750 .2{l50_CJB110} GAGGGGATTT CTTATCGTTT ATGGACTGTG ACTGACAACT TAAAAGTTGA 

msal84750.2{l50_18RS2lJ GAGGGGATTT CTTATCGTTT ATGGACTGTG ACTGACAACT TAAAAGTTGA 

msal84750.2{l50_2S03) GAGGGGATTT CTTATCGTTT ATGGACTGTG ACTGACAACT TAAAAGTTGA 

msal84750.2{l50_H36B} GAGGGGATTT CTTATCGTTT ATGGACTGTG ACTGACAACT TAAAAGTTGA 

Consensus ********** ********** ********** ********** ********** 



msal84750 ,2{l50_090> 
msal84750 . 2 { 150_1169NT| 
msal84750.2(l50_CJB110 
msal84750 - 2 { 150_18RS21 ] 
msal84750 . 2 { 150__2603 ] 
msal84750 . 2{150_H36B] 
Consensus 



201 

TTTATTGAGC 
TTTATTG AG C 
TTTATTGAGC 
TTTATTGAGC 
TTTATTGAGC 
TTTATTGAGC 
********** 



CAAATGACAG 
CAAATGACAG 
CAAATGACAG 
CAAATGACAG 
CAAATGACAG 
CAAATGACAG 
********** 



ATAGCGAATT 
ATAGCGAATT 
ATAGCGAATT 
ATAGCGAATT 
ATAGCGAATT 
ATAGCGAATT 
********** 



GAACCAGAAG 
GAACCAGAAG 
GAACCAGAAG 
GAACCAGAAG 
GAACCAGAAG 
GAACCAGAAG 
********** 



250 

TATAAGAGTA 
TATAAGAGTA 
TATAAGAGTA 
TATAAGAGTA 
TATAAGAGTA 
TATAAGAGTA 
********** 



251 

tnsa!84750 .2{150_090} TCTTGACTTC 

msal84750 .2(150 1169NT} TCTTGACTTC 

msal84750.2{l50~CJB110} TCTTGACTTC 

msal84750 .2 { 150_18RS2l} TCTTGACTTC 

msal84750.2{l50_2603j TCTTGACTTC 

msal84750.2{l50_H3SB) TCTTGACTTC 

Consensus ********** 



TCCTACTGAT 
TCCTACTGAT 
TCCTACTGAT 
TCCTACTGAT 
TCCTACTGAT 
TCCTACTGAT 
********** 



ACTAATGGTC 
ACTAATGGTC 
ACTAATGGTC 
ACTAATGGTC 
ACTAATGGTC 
ACTAATGGTC 



AGACAAAGAT 
AGACAAAGAT 
AGACAAAGAT 
AGACAAAGAT 
AGACAAAGAT 
AGACAAAGAT 



********** ********** 



300 

AGCACTCCCA 
AGCACTCCCA 
AGCACTCCCA 
AGCACTCCCA 
AGCACTCCCA 
AGCACTCCCA 
********** 



301 



350 



794 



WO 2004/018646 



PCT/US2003/026827 



Table 45: Comparative Sequences relating to SAG1404 (strain info highlighted in BOLD) 



msa!84750 .2{l50_090} 
tnsal84750 . 2 { 150_1169NT} 
msal84750 . 2 { 150_CJB110 } 
msal84750.2{l50_18RS2l} 
msal84750 . 2 (l5CT_2603 } 
msal84750.2{150_H36B} 
Consensus 



AATGGTTCGT 
AATGGTTCGT 
AATGGTTCGT 
AATGGTTCGT 
AATGGTTCGT 
AATGGTTCGT 
********** 



ACTTTGGTCG 
ACTTTGGTCG 
ACTTTGGTCG 
ACTTTGGTCG 
ACTTTGGTCG 
ACTTTGGTCG 
********** 



TGCTTATAAA 
TGCTTATAAA 
TGCTTATAAA 
TGCTTATAAA 
TGCTTATAAA 
TGCTTATAAA 
********** 



GCTGATCAAA 
GCTGATCAAA 
GCTGATCAAA 
GCTGATCAAA 
GCTGATCAAA 
GCTGATCAAA 
********** 



GCGTTTCAAC 
GCGTTTCAAC 
GCGTTTCAAC 
GCGTTTCAAC 
GCGTTTCAAC 
GCGTTTCAAC 
********** 



351 

meal 84 7 50 .2{l50_090} AATAGTACCT TTTTATATTG AATTACCAGA 

msal84750 . 2 { 150_1169NT} AATAGTACCT TTTTATATTG AATTACCAGA 

msal84750.2{l50_CJB110} AATAGTACCT TTTTATATTG AATTACCAGA 

msal84750 . 2 { 150_18R52l} AATAGTACCT TTTTATATTG AATTACCAGA 

msal84750.2{l50_2603} AATAGTACCT TTTTATATTG AATTACCAGA 

rasal84750.2{l50_H36B} AATAGTACCT TTTTATATTG AATTACCAGA 

Consensus ********** ********** ********** 



TGATAAGTTA 
TGATAAGTTA 
TGATAAGTTA 
TGATAAGTTA 
TGATAAGTTA 
TGATAAGTTA 



400 

TCAAATCAAT 
TCAAATCAAT 
TCAAATCAAT 
TCAAATCAAT 
TCAAATCAAT 
TCAAATCAAT 



********** ********** 



401 450 

msal84 750. 2 { 150_090} TACAGATAAA TCCTAAGCGA AAAGTTGAAA CAGGCCGATT AAAACTTATT 

msal84750.2ll50_1169NT} TACAGATAAA TCCTAAGCGA AAAGTTGAAA CAGGCCGATT AAAACTTATT 

msal 8 4 7 5 0 . 2 { 1 5 0_C JB1 10} TACAGATAAA TCCTAAGCGA AAAGTTGAAA CAGGCCGATT AAAACTTATT 

msal84750.2{l50_18RS2l} TACAGATAAA TCCTAAGCGA AAAGTTGAAA CAGGCCGATT AAAACTTATT 

msal84750 . 2{150__2603} TACAGATAAA TCCTAAGCGA AAAGTTGAAA CAGGCCGATT AAAACTTATT 

msal84750.2{l50_H36B} TACAGATAAA TCCTAAGCGA AAAGTTGAAA CAGGCCGATT AAAACTTATT 

Consensus ********** ********** ********** ********** ********** 



msal84750 .2{150_090} 
msal84750 . 2 { 150_1169NT} 
msal84750 . 2 { 150_CJB110 } 
msal84750 .2{150_18RS21} 
msal84750 . 2 f 150_2'603 ) 
msal84750 . 2 { 150_H36B) 
Consensus 



451 

AAATATACAA 
AAATATACAA 
AAATATACAA 
AAATATACAA 
AAATATACAA 
AAATATACAA 
********** 



AAGAAGGAAA 
AAGAAGGAAA 
AAGAAGGAAA 
AAGAAGGAAA 
AAGAAGGAAA 
AAGAAGGAAA 
********** 



GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
********** 



AGGCTaTCaG 
AGGCTaTCaG 
AGGCTaTCaG 
AGGCTaTCcG 
AGGCTaTCcG 
AGGCTwTCcG 
*****„**_* 



500 

GAGTAATATT 
GAGTAATATT 
GAGTAATATT 
GAGTAATATT 
GAGTAATATT 
GAGTAATATT 
********** 



501 

msal84750.2{l50_090} TGTATTATAC GATAACCAGA 

msal84750.2{l50_1169NT} TGTATTATAC GATAACCAGA 

msal84750.2|l50_CJB110} TGTATTATAC GATAACCAGA 

msal84750.2{150_18RS2l} TGTATTATAC GATAACCAGA 

msal8 4750. 2 ( 150^2603} TGTATTATAC GATAACCAGA 

msal84750.2{150_H36B} TGTATTATAC GATAACCAGA 

Consensus ********** ********** 



ATCAGCCAGT 
ATCAGCCAGT 
ATCAGCCAGT 
ATCAGCCAGT 
ATCAGCCAGT 
ATCAGCCAGT 



TCGCTTTAAA 
TCGCTTTAAA 
TCGCTTTAAA 
TCGCTTTAAA 
TCGCTTTAAA 
TCGCTTTAAA 



********** ********** 



550 

AATGGACGAT 
AATGGACGAT 
AATGGACGAT 
AATGGACGAT 
AATGGACGAT 
AATGGACGAT 
********** 



551 600 

msal847 50. 2 { 150_090> TTACGACCGA TCAAGATGGG ATTACTTCAT TAGTAACTGA TGATAAGGGA 

msal84750 . 2 { 150_1169NT> TTACGACCGA TCAAGATGGG ATTACTTCAT TAGTAACTGA TGATAAGGGA 

msal84750 .2 { 150_CJB110) TTACGACCGA TCAAGATGGG ATTACTTCAT TAGTAACTGA TGATAAGGGA 

msal84750.2{l50_18RS2l} TTACGACCGA TCAAGATGGG ATTACTTCAT TAGTAACTGA TGATAAGGGA 

msal847 5 0.2 {l50_2603} TTACGACCGA TCAAGATGGG ATTACTTCAT TAGTAACTGA TGATAAGGGA 

ms al 847 50 . 2 { 1 5 0_H3 6B } TTACGACCGA TCAAGATGGG ATTACTTCAT TAGTAACTGA TGATAAGGGA 

Consensus ********** ********** ********** ********** ********** 

601 650 

msal84750 .2{l50_090) GAAATTGAGG TTGAAGGTTT ATTACCTGGT AAGTATATTT TTCGAGAAGC 

msa!84750 .2{150_1169NT) GAAATTGAGG TTGAAGGTTT ATTACCTGGT AAGTATATTT TTCGAGAAGC 

msal84750 .2{150_CJB110} GAAATTGAGG TTGAAGGTTT ATTACCTGGT AAGTATATTT TTCGAGAAGC 

msal84750.2{l50_18RS2l} GAAATTGAGG TTGAAGGTTT ATTACCTGGT AAGTATATTT TTCGAGAAGC 

msal84750.2(l50_2603} GAAATTGAGG TTGAAGGTTT ATTACCTGGT AAGTATATTT TTCGAGAAGC 

msa!84750.2{l50_H36B} GAAATTGAGG TTGAAGGTTT ATTACCTGGT AAGTATATTT TTCGAGAAGC 

Consensus ********** ********** ********** ********** ********** 



651 

msal84750 .2{150_090} AAAAGCACTA 

msal84750.2{l50_ll69NT} AAAAGCACTA 

msal84750 .2{150_CJB110} AAAAGCACTA 

msal84750 .2{150_18RS21} AAAAGCACTA 

msal84750.2{l50_2603} AAAAGCACTA 

msal84750.2{l50_H36B} AAAAGCACTA 

Consensus ********** 



ACTGGTTACC 
ACTGGTTACC 
ACTGGTTACC 
ACTGGTTACC 
ACTGGTTACC 
ACTGGTTACC 
********** 



GTATATCTAT 
GTATATCTAT 
GTATATCTAT 
GTATATCTAT 
GTATATCTAT 
GTATATCTAT 
********** 



GAAGGATGCT 
GAAGGATGCT 
GAAGGATGCT 
GAAGGATGCT 
GAAGGATGCT 
GAAGGATGCT 
********** 



700 

GTAGTTGCTG 
GTAGTTGCTG 
GTAGTTGCTG 
GTAGTTGCTG 
GTAGTTGCTG 
GTAGTTGCTG 
********** 



701 

msal84750.2{l50_090} TAGTTG CTAA 

msal84750.2{l50_1169NT} TAGTTGCTAA 

msal84750.2{150_CJB110} TAGTTGCTAA 

msal84750.2{ 15 0_1 8RS2 1 } TAGTTGCTAA 

msal84750.2{l50_2603} TAGTTGCTAA 

msal84750.2{ 1 50_H3 6B } TAGTTGCTAA 

Consensus ********** 



TAAAACACAG 
TAAAACACAG 
TAAAACACAG 
TAAAACACAG 
TAAAACACAG 
TAAAACACAG 
********** 



GAAGTAGAGG 
GAAGTAGAGG 
GAAGTAGAGG 
GAAGTAGAGG 
GAAGTAGAGG 
GAAGTAGAGG 
********** 



TAGAAAACGA 
TAGAAAACGA 
TAGAAAACGA 
TAGAAAACGA 
TAGAAAACGA 
TAGAAAACGA 
********** 



750 

AAAAGAAACT 
AAAAGAAACT 
AAAAGAAACT 
AAAAGAAACT 
AAAAGAAACT 
AAAAGAAACT 
********** 



msal84750.2{150_090} 
msal84750 . 2 { 150_1169NT} 



751 

CCTCCACCAA CAAATCCTAA ACCATCACAA CCg- 
CCTCCACCAA CAAATCCTAA ACCATCACAA CC~~ 



800 



795 



WO 2004/018646 



PCT/US2003/026827 



Table 45: Comparative Sequences relating to SAG1404 (strain info highlighted in BOLD) 



msal84750-2(l50_CJB110) 
msal84750.2{150_18RS21) 
msal847S0 . 2 { 150_2603 } 
msal84750 . 2 { 150_H36B} 
Consensus 



msal847S0.2{l50_090} 
msal84750 . 2 {150 JL169NT} 
msal84750.2{l50_CJB110) 
msal84750 . 2 { 150_18RS21 } 
rasal84750 . 2 { 150_2603 } 
msal84750 .2{150_H36B} 
Consensus 



msal84750 . 2 { 150_090 } 
msal84750.2{l50_1169NT} 
msal84750 . 2 { 150_CJB110 } 
msal84750 . 2 { 150_18RS21 J 
rasal84750 . 2 { 150_2603 } 
msal84750 . 2 { 150_H36B} 
Consensus 



msal84750 . 2 { 150_090 } 
msal84750 .2 f 150_1169NT} 
rasal84750 . 2 { 150_CJB110 } 
msal84750.2(150_18RS2l} 
msal84750 . 2 { 150_2603 } 
msal84750.2{l50_H36B} 
Consensus 



CCTCCACCAA CAAATCCTAA AC CAT CACAA CC 

CCTCCACCAA CAAATCCTAA AC CAT CACAA CC 

CCTCCACCAA CAAATCCTAA ACCATCACAA CCgCtttttc cacaatcatt 

CCTCCACCAA CAAATCCTAA ACCATCACAA CCgC 

********** ********** ********** **_******* ********** 



801 



850 



tcttcctaaa acaggaatga ttattggtgg aggactgaca attcttggtt 
********** ********** ********** ********** ********** 

851 900 



gtattatttt gggaattttg tttatctttt taagaaaaac taaaaatagc 
********** ********** ********** ********** ********** 

901 924 



aaatctgaaa gaaacgatac agta 



********** ********** **** 



SEQ ZD NO. 4507 
STRAIN 2603 

MKKI RKSLGLLLCCFLGLVQLAFFSVASVNADTPNQLTI TQIGLQPNTTEEGI SYRLWTV 
TDNLKVDLLSQMTDSELNQKYKS I LTS PTDTNGQTKI ALPNGS YFGRAYKADQSVST I VP 
FYI ELPDDKLSNQLQ I NPKRKVETGRLKL I KYTKEGKI KKRLSGVI FVLYDNQNQPVRFK 
NGRFTTDQDG ITSLVTDDKGE I EVEGLLPGKY I FREAKALTG YR I SM KD AWAWANKTQ 
EVEVENEKETPPPTNPKPSQPLFPQSFLPKTGMI IGGGLTILGCI ILGILFI FLRKTKNS 
KSERNDTV 



SEQ ID NO. 4508 
STRAIN 090 

DTPNQLT I TQ I GLQPNTTEEG I S YRLWTVTDNLKVDLLSQMTDSELNQKYKS ILTS PTDT 
NGQTKI ALPNGS YFGRAYKADQS VSTI VPF Y I ELPDDKLSNQLQ I NPKRKVETGRLKL I K 
YTKEGKI KKRLSGVI FVLYDNQNQPVRFKNGRFTTDQDG ITSLVTDDKGE I EVEGLLPGK 
YI FREAKALTG YR I SMKDAVVAWANKTQEVEVENEKETPPPTNPKPSQP 

SEQ XD NO. 4509 
STRAIN H3 6B 

DTPNQLTITQ I GLQPNTTEEG I S YRLWTVTDNLKVDLLSQMTDSELNQKYKS I LTS PTDT 
NGQTKI ALPNGS YFGRAYKADQSVSTI VPF YI ELPDDKLSNQLQ I NPKRKVETGRLKLI K 
YTKEGKI KKRLSGVI FVLYDNQNQPVRFKNGRFTTDQDGI TSLVTDDKGE I EVEGLLPGK 
YI FREAKALTGYRI SMKDAWAWANKTQEVEVENEKETPPPTNPKPSQP 

SEQ XD NO. 4510 
STRAIN 18RS21 

DTPNQLTITQIGLQPNTTEEGISYRLWTVTDNLKVDLLSQ^^T^SEI^QKYKSILTSPTDT 
NGQTKI ALPNGS YFGRAYKADQS VST I VPF Y I ELPDDKLSNQLQ I NPKRKVETGRLKLI K 
YTKEGKI KKRLSGVI FVLYDNQNQPVRFKNGRFTTDQDG ITSLVTDDKGE I EVEGLLPGK 
YI FREAKALTGYRI SMKDAWAWANKTQEVEVENEKETPPPTNPKPSQ 

SEQ XD NO. 4511 
STRAIN 1169NT 

DTPNQLTITQIGLQPNTTEEGI S YRLWTVTDNLKVDLLSQMTDSELNQKYKS ILTSPTDT 
NGQTKIALPNGSYFGRAYKATXJSVSTIVPFYIELPDDKLSNQIX3 1 NPKRKVETGRLKLI K 
YTKEGKI KKRLSGVI FVLYDNQNQPVRFKNGRFTTDQDG I TSLVTDDKGE I EVEGLLPGK 
YI FREAKALTGYRI SM KDA WA WANKTQEVE VENE KET P P PTN PKP S Q 



PRETTY of: /biotmp/msal84868 . 2 { * } May 13, 2003 06:25 .. 

1 50 

msal84868.2{l50_090} DTPNQLTIT QIGLQPNTTE 

msal84868.2f 150_2603} mkkirkslgl llccflglvq laffsvasvn aDTPNQLTIT QIGLQPNTTE 

msal84868 .2{150_H36B} DTPNQLTIT QIGLQPNTTE 

msal84868 .2f 150_1169NT) DTPNQLTIT QIGLQPNTTE 

msal84868 .2{150_18RS21) DTPNQLTIT QIGLQPNTTE 

Consensus ********** ********** ********** ********** ********** 

51 100 
msal84868. 2 {150^090} EG I SYRLWTV TDNLKVDLLS QMTDSELNQK YKSILTSPTD TNGQTKIALP 



796 



WO 2004/018646 



PCT/US2003/026827 



Table 45: Comparative Sequences relating to SAG1404 (strain info highlighted in BOLD) 



msal84868 . 2 { 150_2603 } 
rasal84868 . 2 { 150__H36B} 
msal84868 . 2 { 150_1169NT} 
msal84868 . 2 { 150_18RS21 } 
Consensus 



EGISYRLWTV TDNLKVDLLS QMTDSELNQK YKSILTSPTD TNGQTKIALP 
EGISYRLWTV TDNLKVDLLS QMTDSELNQK YKSILTSPTD TNGQTKIALP 
EGISYRLWTV TDNLKVDLLS QMTDSELNQK YKSILTSPTD TNGQTKIALP 
EGISYRLWTV TDNLKVDLLS QMTDSELNQK YKSILTSPTD TNGQTKIALP 
********** ********** ********** ********** ********** 



rasal84868 .2{150_090} 
msal84868 . 2 { 150_2603 J 
msal84868 .2{150_H36B) 
msal848 68.2{l50__1169NT} 
msa!84868 . 2 { 150_18RS21 } 



101 

NGSYFGRAYK ADQSVSTIVP FYIELPDDKL SNQLQINPKR 
NGSYFGRAYK ADQSVSTIVP FYIELPDDKL SNQLQINPKR 
NGSYFGRAYK ADQSVSTIVP FYIELPDDKL SNQLQINPKR 
NGSYFGRAYK ADQSVSTIVP FYIELPDDKL SNQLQINPKR 
NGSYFGRAYK ADQSVSTIVP FYIELPDDKL SNQLQINPKR 
Consensus ********** ********** ********** ********** 



150 

KVETGRLKLI 
KVETGRLKLI 
KVETGRLKLI 
KVETGRLKLI 
KVETGRLKLI 
********** 



msal84868 . 2 { 150_090 } 
msal84868 .2{150_2603] 
msal84868.2{l50_H3 6B} 
msal84868 . 2 {l50_1169NT} 
msal84868 .2{150_18RS21} 
Consensus 



151 

KYTKEGKIKK 
KYTKEGKIKK 
KYTKEGKIKK 
KYTKEGKIKK 
KYTKEGKIKK 



RLSGVIFVLY 
RLSGVIFVLY 
RLSGVIFVLY 
RLSGVIFVLY 
RLSGVIFVLY 



DNQNQPVRFK 
DNQNQPVRFK 
DNQNQPVRFK 
DNQNQPVRFK 
DNQNQPVRFK 



NGRFTTDQDG 
NGRFTTDQDG 
NGRFTTDQDG 
NGRFTTDQDG 
NGRFTTDQDG 



********** ********** ********** ********** 



200 

ITSLVTDDKG 
I TSLVTDDKG 
ITSLVTDDKG 
ITSLVTDDKG 
ITSLVTDDKG 
********** 



msal84868 .2{150_090} 
msal84868 .2{l50__2603 } 
msal84868.2{l50_H36B} 
msal848 68 .2{150_1169NT} 
tnsal84868 .2{150_18RS21} 
Consensus 



201 

EIEVEGLLPG 
EIEVEGLLPG 
EIEVEGLLPG 
EIEVEGLLPG 
EIEVEGLLPG 



KYIFREAKAL 
KYIFREAKAL 
KYIFREAKAL 
KYIFREAKAL 
KYIFREAKAL 



TGYRI SMKDA 
TGYRI SMKDA 
TGYRI SMKDA 
TGYRI SMKDA 
TGYRI SMKDA 



WAWANKTQ 
WAWANKTQ 
WAWANKTQ 
WAWANKTQ 
WAWANKTQ 



********** ********** ********** ********** 



250 

EVEVENEKET 
EVEVENEKET 
EVEVENEKET 
EVEVENEKET 
EVEVENEKET 
********** 



msal84868 . 2 { 150_090 } 
msal84868 .2{150_2603} 
msal84868.2{l50_H36B} 
msal84868 . 2 { 150_1169NT} 
msal84868 .2{150_18RS21} 
Consensus 



251 300 

PPPTNPKPSQ p 

PPPTNPKPSQ plfpqsflpk tgmiiggglt ilgciilgil fiflrktkns 

PPPTNPKPSQ p 

PPPTNPKPSQ 

PPPTNPKPSQ 

********** „********* ********** ********** ********** 



301 

msal84868 .2{l50_090} 

msal84868-2{l50_2603} ksemdtv 

msal84868 -2{150_H3SB} 

msal84868 . 2 { 150_1169NT} 

msal84868 . 2{l50_18RS2l} 

Consensus ******** 



797 



WO 2004/018646 PCT/US2003/026827 
Table 46: Comparative Sequences relating to SAG1615 (strain info highlighted in BOLD) 



SEQ ID NO. 4601 
STRAIN A909 

TGACAAATATTATTTTACCCAACGTGGT^ 

CTCAC CGAATAATAT CAGTGAGGATTTAG AGATTATTG CAGG AAATG CTTTT CGT CCAGA 

TAACAATGAAGAGTTGGCTTATGTTATTGAAAAGGGCTATCATTTTAAACGATATCATGA 

ATTT CTCGG AGATTTT ATG CGT CAGTT CACTAGT CTAGGTGTAG CTG GGG CACATGGAAA 

AACCTCAACGACAGGTTTATTAGCTCATGTTTTAAAA 

AATTGGAGATGGTACAGGACGTGGTTCTGCTAATGCTAATTACriTTGTC 

TGAATACGAACGTCATTTT ATG C CGTAC CATC CAGAATACT^ 

TTTTGACCATCCTGATTATTTTACAG^ 

TGCT AAG CAAGTTCAAAAAGGTTTATT CATTTATGGAGAAGATC CAAAACTTCATGAAAT 
CACT T CTGAGGCAC CAATATATTATTATGGTTTTGAAGATTCAAA.TGATTTT AT AGCAAA 
AGACATCACTCGAACTGTTAA.TGGTTCTGACTTTAAGGTTTTCTA^ 
TGGTCAGTTTCATGTACCAGCATACGGTAAACATAATATCTTAAATGCA^ 
TGCTAACCTTTACATAATGGGAATTGATATGGCATTAGTAGCTGAGCATTTGAAGACATT 
TT CAG GGGT AAAGCGT CGTTTTACTGAG AA.GATT ATTGACGATACTGT CATT ATTGATGA 
CTTTG CT CACCATC CTACTGAGATTATTGCGACATTAGATG CTG CT CGACAAAAATACCC 
GT CAAAAGAAATTGT AG CTATTTT C CAAC CGCATACGTTCACTC GTACGATAGCT CTTTT 
AGACGAATTTG C CCATG CCTTGAGTCAAGCGGAT AG CGTTTATCT CG CT CAAATATATGG 
TTCTGCTAGAGAAGTAGATAATGGTGAGGTGAAGGTAGAAGATTTAGCTGCTAAGATTGT 
CAAACACTCAGATTTAGTGACAGTCGAAAATGTCTCGCCrTTACT 
TGTCTATGTCTTTATGGGTG CTGGAGACATTC^ 
ATTAGCTAACCTAACTAAAAATACACAA 

SEQ ID NO. 4602 
STRAIN 1169NT 

AAAAGCAGGCTCTAGTGACGTTGACAAATATTATTTTACCC^ 

AGGTGTAACTATATTAC CTT'TOTCACCGAATAATATCAGTGAGGATTTAGAGATT ATTG C 
AGGAAATGCTTTTCGTC CAGATAACAATGAAGAGTTGGCTTATGTTATT 
TCATTTTAAACGATATCATGAATTTCTCGGAGATTTT A CACT AGTCT AGG 
TGTAGCTGGGGCACATGGAAAAA 

TATTACAGACACTTCr 1 N i.' CCT AATTGG AGATGGT ACAGGACGTGGTT CTG CTAATG CTAA 
TTACTTTGTGTTTGAAGCTGATGA^ 

CT CAATTATT AC CAATATTG ATTTTGAC CAT CCTG ATTATTTTACAGGC CTAGAGGACGT 

ATTCAATGCCTTTAATGACTATGCTAAGCAAGTTCAAAAAG^ 

AGAT C CAAAACTT CATG AAATCACTTCTG AGGCACCAATATATT ATTATGG'^ 

TTCAAATGATTTTATAG CAAAAGACAT CACT CGAACTGTTAATGGTT CTGACTTT AAGGT 

TTTCTATAACCAAGAAGAAATTGGTCAGTTT 

CTTAAATGCAACraCTGTTATTGCrAACCTT^ 

AGCTGAG CATTTGAAGACATTTTCAGGGGTAAAG CGT CGTTTTACTG AGAAGATTATTGA 
CGATACTGTCATTATTGATGACTTTGCTCACCAT^ 

TG CTG CT CGACAAAAATACCCGTCAAAAG AAATTGTAG CTATTTTCCAAC CG CATACGTT 
CACT CGT ACGAT AGCTCTTTTAGACGAATTTG CC CATG CCTTGAGT CAAGCGGATAGTO 
TTATCTCGCTCAAATATATGGTTCTGCTAGAGAAGTAGATAATGGTGAGGTGAAGGTAGA 
AGATTTAGCTG CTAAGATTGTCAAACACTCAGATTTAGTG 

TTTACTCAATCATGATAATG CTGTCTATGTCTTTATGGGTG CTGG AG ACATT CAATTGT A 
TGAG CGCTCTTTTG AAG AATTATTAG CTAACCTAACTAAAAATACACAA 

SEQ ID NO. 4603 
STRAIN 090 

AAAGCAGGCTCTAGTGACGTTGACAAATATTATTTTACCCAACGTGGTTTAGAG 
GGTGTAACTATATTACCTT'TCI'CACCX3AATAATATCAGTGAGGATTTAGAGATTATTG^ 
GGAAATG CTTTT CGTCCA.GATAACAATGAAGAGTTGGCTTATGTT ATTG AAAA.GGGCTAT 
CATTTTAAA CG ATATCATG AATTT CTCGGAGATTTTATGCGT CAGTT CACTAGTCTAGGT 
GTAGCTGGGGCACATGGAAAAACerCAACGACAGGTTTATTA 
ATTACAGACACTTCTTTCCTAATTGGAGATGGTACAGG 

TACTTTGTGTTTGAAG CTG ATGAAT ACGAACGTCATTTT ATG C C GTACCATC CAGAATAC 
TCAATTATTACCAATATTGATTTTGAC CAT CCTGATTATTTTACAGG CCTAGAGGACGTA 
TTCAATGCTTTTAATGACTATGCTAAGCAAGTTCAAA 

GATT CAAAACTTCATGAAAT CACTTCTAAGG CACCAATATATTATTATGGTTTTGAAGAT 
TCAAATGATTTTATAGCAAAAGACATCACTCGAACTGTTAATG^ 

TTCTATAACCAAGAAGAAATTGGTCAGTTT CATGTACCAG CATACGGTAAACAT AAT AT C 
TTAAATG CAACTGCTGTTATTG CTAAC CTTTACATAATGGGAATTGATATGG CATTAGTA 
GCTGAG CATTTGAAG ACATTTT CAGGGGT AAAACGTCGTTTT ACTGAGAAGATT ATTGAC 
GATACIGTCATTATTGATGACTTTG CT CAC CATCCTACT 

GCTG CTCGACAAAAAT ACC CGT CAAAAGAAATTGT AG CTATTTTC CAAC CG CATACGTT C 
ACTCGTA03ATAGCrram?TAGACGATTTTGCCCATGCTTTGAG 

TATCTTG CT CAAAT ATATG GTT CT G CTAGAGAAGTAGATAATGGTGAGGTGAAGGTAGAA 
GATTTAGCTGCTAAGATTGTCAAACACTCAGATTTAGTC 
TTACTCAATCATCATAATGCTGTCTATGTCTTTATGGGTGC^ 
GAGCGCTCTTTTGAAGAATTATTAGCTAACCT 

SEQ ID NO. 4604 
STRAIN H36B 

AAAAG CAGGCT CTAGTgACGTTgACAAATAT t ATTTTACT CAACGTGGTT t AGAG CAAG CAGGT 
ATAACTATATTACCITTCT CAC CGAATAATAT CAGTGAGGATTTAGAGATTATTG CAGG A 
AATGCTTTTCGT CCAGATAACAATGAAGAGTTGG CTTATGTTATTGAAAAGGGCTAT CAT 
TTTAAACGATATCATGAATTTCTCGGAGATTTTATGCGTCAGTT 
GCTGGGGCA(^TC4GAAAAACCTCAACGACAGGTTTATTAG 

ACAGACACIT'CTIT COTAATTGGAGATGGTACAGGACGTGGTT CTG CTAATG CT 

TTTGTGTTTGAAGCTGATGAATACGAACGTCATTTTA 

ATTATTACCAATATTGATTTTGACCATCCTGATTA^ 

AATG CTTTTAATGACTATGCTAAG CAAGTTCAAAAAGGTTTATT CATTTATGGAGAAGAT 
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CCAAAACTTCATGAAATCACTTCTGAGGCACCAATATAl^ 

AATG ATT TT AT AG CAAAAG AT ATCACT CG AACTGTT AAT GGTT CT GACTTT AAG GTTTT C 

TATAACGAAGAAGAAATTGGTGA.GTTTCACGTACCAGCATACGGTAAACATAATATCTTA 

AATGCAACTGCTGTTATTGCTAACCTTTAGATAATGG 

GAGCATTTGAAGACATTTTCAGGGGTAAAACGTCGTTTTACTGAGAA 

ACTGTCATTATTGATGACTTTGCTCAC CAT CCTACTGAGATTATTG CGACATTAG ATGCT 

GCTCGACAAAAATACCCGTCAAAAGAAATTGTAGCTATTTTCCAACCGCATACGTTCACT 

CXjTACGATAGCTCTTTTAGACGAAITTGCCCATC 

CT CG CTCAAATATATGGTT CTG CT AGAGAAGTAGATAATGGTGAGGTGA^GGTAGAAGAT 
TT AG CTG CTAAGATTGTCAAACACT CAGATTTAGT GACAGTCGAAAATGT CT CG C CTTTA 
CrCAATGATGATAATGCTGTCTATGTCTT^ 

CG CT CTTTTGAAGAATT ATTAG CTAACCT AACTAAAAATACACAA 

SEQ ID NO. 4605 
STRAIN 18RS21 

AAAGCAGG CT CTAGTGACGTTG ACAAATATTATTTTACC CAACGTGGTTT AG AG CAAG CA 
GGTGTAACTATATTACCTTTCTCACCGAATAATATCAGTGAGGATTTAGAGAT^ 
GGAAATG CTTTT CGT CCAGATAACAATGAAGAGTTGG CTT ATGTTATTGA 
CATTTTAAACGATATCATGAATTTCTC 
GTAGCTGGGGCACATGGAAAAACCTCAACGACa 

ATTACAGACACirCTTTCCT AATTGGAGATGGTAC^ CTAATG CTAAT 

TACTTTGTGTTTGAAGCTGATGAATACGAACGTCATTTT 

TCAATTATTACCAATATTGATTTTG ACCAT CCTGAT^ CTTAG AGGACGTA 
TT CAATG CCTrT AATGACTATG CTAAG CAAGTTCAAAAAGGTTT ATTCATTTATGGAGAA 
GATC CAAAACITCATGAAATCACTTCTGAGG CACCAATATATTATTATGGTTTTGAAGAT 
T CAAATGATTTTATAG CAA7VAGACATCACTCGAACTGTTAATGGTT CTGACTTTAAGGTT 
TTCT ATAAC CAAGAAGAAATTGGT CAGTTT CATGTACCAGCATACGGTAAACAT AATATC 
TTAAATG CAACTGCTGTTATTG CTAACCTT 1 ACATAATGGGAATTGATATGG CATTAGTA 
GCTGAGCATTTGAAGACGTTTTCAGGGGTAAAGCGTCGTTT^ 
GATACTGTCATTATTGATGACTTTGCTCACCATCCrACT 
GCTGCTCGACAAAAATACCCGTCAAAAGAAATTGTAGCTATTTTCC^ 
ACTCGTACGATAGOTCTTTTAGACGAATTTG CCCATG C CTTG AGT CAAG CGGATAGCGTT 
TATCT CG CT CAAATATATGGTTCTG CTAGAGAAGTAGAT AATGGTGAGGTGAAGGTAGAA 
GATTTAGCTGCTAAGATTGTCAAACACTCAGAT^ 

TTACTCAATCATGATAATG CTGTCTATGT CTTTATGGGTG CTGG AGACATTCAATTGTAT 
GAGCG CT CITTTGAAGAATTATTAG CTAAC CTAACTAAAAATACACAA 

SEQ ID NO. 4606 
STRAIN M732 

AAAAGCAGG CTCTAGTGACGT t GACAAATA t TATTTTAC CCAACGTGGTTT AGAG CAAG CAG 
GTGTAACTATATTACCTTTCTCACCGAATAATATCAGTGAGGATTTAGAGATTATTC 
GAAATGCTTTT CGT CCAGATAACAATGAAG AGTTGGCT^^ CTATC 
ATTTTAAACGATATCATGAATTTOTCGGAGATTTTATG 

TAG CTGGGGCACATGGAA^yUVCCT CAACGACAC^TTTATTAG CT CATGTTTT AAAAAATA 
TTACAGACACTTCTTT C CTAATTGG AGATGGTACAC^ ACGTGGTTCTG CTAATG CTAATT 
ACTTTGTGTTTGAAG CTGATGAATACGAACGT CATTTTATG C CGT ACCAT CCAGAATACT 
CAATTATTACCAATATTGATTTTGACCATCCTGATTATTTT 
TCAATGCCTTTAATGACTATGCTAAGCAAGTTCAAAAAGGTC 

ATCCAAAACTT CATGAAATCACTTCTGAGG CAC CAATATATT ATTATGGT TTTGAAGATT 
CAAATGATTTTATAG CAAAACmCAT CACT CGAACTGTTAATGGTTCTGACT^ 
TCTATAACCAAGAAGAAATTGGTCAGTTTCATGTACC^GCA^ 
TAAATGCAACTG CTGTTATTG CTAACCTTTACATAA 

CTGAG CATTTGAAGACATTTTCAGGGGTAAAG CGTCGTTTTACTG AGAAGATTATTG ACG 
ATACTGTCATTATTGATGACTTTGCTCACCATCCTACT 

CTG CT CGACAAAAATAC CCGTCAAAAGAAATTGTAG CTATTTT CCAACCG CATACGTTCA 
CrOSTACGATAGCTCTTTTAGACGAATTT 

ATCTCGCTCA^TATATGGTTCTGCTAGAGAAGTAGATAATGGTGAGGTGAAgGTAGAAG 
ATTTAG CTG CTAAgATTGT CAAACACTCAGATTTAGTGACAGTCGAAAATGTCT CGC CTT 
TACT CAATCATGATAATGCTGT CTATGTCTTT ATGGGTG CTGGAGACATTCAATTGTATG 
AG CG CT CTTTTGAAGAATTATTAG CTAAC CTAACT AAAAATACACAA 

SEQ ID NO. 4607 
STRAIN M781 

AAAG CAGG CTCTAGTGACGT t GACAAATATT ATTTTAC C CAACGTGGTTTAG AGCAAGCAG 
GTGTAACTATATTACCTTTCTCACCGAATAATA 

GAAATGCTTTTCGT C CAGATAACAATGAAGAGTTGGCTTATGTTATTGAAAAGGG CTATC 
ATTTT AAACGATAT CATGAATTTCT CGGAGATTTTATGCGTCAGTT CACT AGTCTAGGT 
GTAGCIGC^CACATGGAAAAACCTCAACGACAGGTTTATTAGCT 

TATTACAGACACiTCTTT'C CTAATTGG AGATGGTACAGGACGTGGTTCTG CT AATGCTAA 

TT ACTTTGTGTTTGAAG CTGATGAATACGAACGTCATTTTATG CCGTAC CAT CCAGAATA 

CT CAATTATTACCAATATTG ATTTTGAC CATCCTGATTATTTTACAGGC CTAGAGGACGT 

ATTCAATGCCITTAATGACTATGCTAAGCAAGTTCAAAAAGG 

AGATCCAAAACTTCATGAAATCACTTCTGAGGCACCAATATATTAT^ 

TT CA7\ATGATTTTATAG CAAAAGACATCACT CGAACTGTTAATGGTT CTG ACTTTAAGGT 

TTTCTATAAC CAAG AAGAAATTGGT CAGTTT CATGTAC CAGCAT ACGGTAAACATAATAT 

CTTAAATGCAACTGCTTOTATTGCTAACCT^ 

AG CTG AG CATTTGAAGACATTTTCAGGGGTAAAG CGT CGTTTTACTG AG AAGATTATTGA 
CG ATACTGT CATTATTGATG ACTTTG CTCACCAT C CTACTGAGATTATTG CGACATTAGA 
TGCTGCTCGACAAAAATACCCGTCAAAAGAAATTGTAGCTATTT^ 

CACT CGTACGATAG CTCTTTTAGACGAATTTG CC CGGATAG CGT 

TTATCTCGCTCAAATATATGGTTCTGCTAGAGAAGTAGATAATGGTGAGGTGAAGGTAGA 
AGATTTAGCTGCTAAGATTGTCAAACACTCAGATTTAGTGACAGT 

TTTACTCAATCATGATAATG CTGTCTATGTCTTTATGGGTGCT CAATTGT A 
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TG AG C G CTCTTTTGAAG AATTATTAG CTAACCTAACTAA?\AATACACAA 

SEQ ID NO. 4608 
STRAIN CJB110 

AAAAAGCAGGCTCTAGTGACGTtGACAAATAtTATTTTACCCA^ 

GGTGTAACTATATTACCTTT CTCAC CGAATAATAT CAGTG AGGATTT AG AGATTATTG CA 

GG AAATGCTTTT CGT CCAG AT AACAATGAAGAGTTGG CTTATGTTATTGAAAAGGG CTAT 

CATTTTAAACGATATCATGAATTTCrCGGAGATTTTATGCGTCAGTTCACTAGTCTAGGT 

GTAGCTGGGGCACATGGAAAAACCTCAACGACAGGTTTATTAGCTCATGTIU'TAAAAAAT 

ATTACAGAC^CTTCTTTCCTAATTGGAGATGGTACAGGACGTGGTTCTG 

TACTTTGTGTTTGAAGCTGATGAATACGAACGTCATTTTATGCCGTACCATCCAGAATAC 

TCAAT TATTACCAAT ATTG ATTTTGAC CAT CGTG ATTATTTT ACAGG CCT AG AGG ACGT A 

TT CAATG CTTTTAATG ACTATG CTAAG CAAGTTGAAAAAGGTTTATTCATTTATGGAGAA 

GATTCAAAACTT CATGAAAT CACTTCTAAGGCACCAATATATTATTATGGCTTTGAAGAT 

TCAAATGATTTTATAGCAAAAGACATCACrCGAACTGTTAATGGTTCTGACT 

TT CT AT AACCAAGAAGAAATTGGT CAGTTTCATGTAC CAG CATACGGTAAACATAAT ATC 

TT AAATG CAACTGCTGTTATTG CTAAC CTTTACATAATG G GAATTGATATGG CATTAGTA 

G CTG AGCATTTGAAG ACATTTTCAGGGGTAAAACGT CG1T1T ACTGAGAAGATTATTGAC 

GATACTGTCATTATTGATGACITTGCTCACCAT^ 

GCTGCTCGACAAAAATACCCGTCAAAAGAAATTGTAGCTATTTTCCAACCGCATACGTTC 
ACTCGTACGATAGCTCITTTAGACGATTTTGCCC^ 

TATCTTG CTCAAATATATGGTT CTG CTAGAGAAGT AGATAATGGTGAGGTGAAGGTAGAA 
GATTTAGCTG CTAAG ATTGT CAAACACTCAGATTT AGTGACAGT CGAAAATGTCTCG CCT 
TTACTCAAT(^TGATAATGCTGTCTATGTCTTTATGGGTGCTGGAGA 
GAGCG CT CTTTTGAAGAATTATTAG CTAAC CTAACTAAAAATACACAA 

SEQ ID NO. 4609 

STRAIN JM9130013 (reverse complement) 

GTTCAAAAAAGCAGGCTCTAGTGACGTTGACAAATATTAT^ 

GCAAGCAGGTATAACTATATTACCTTTCTCACCGAATAATATCAGTGA 

TATTGCAGGAAATGCTTTTCGTCCAGATAACAATGAAGAGTTG 

GGGCT AT CATTTTAAACGATATCAT GAATTT CT CGGAGATTTTATG CGT CAGTT CACTAG 
T CTAGGTGTAGCTGGGGCACATGGAAAAAC CT CAACX3ACAGGTTTATTAG CTCATGTTTT 
AAAAAATATTACAGACACCTCTCTCCTAATTG 

TG CTAATTACTTTGTGTTTGAAGCTG^TG AATACGAACGT CATTTTATG C CGT AC CATCC 
AG AATACT CAATTACTACCAAT ACTGATTTTGAC CATCCTGATTATTTTACAGG CCTAGA 
GGACGTATTCAATGCTTTTAATGACTATG CTAAG CAAGTT CAAAAAGGTTTATT CATTT A 
TG GAGAAGAT CCAAAA.CTT CATGAAAT CACTTCTGAGGCACCAATATATTATTATGGTTT 
TG AAG ATTCAAATG ATTTT ATAGCAAAAGAT AT CACT CGAACTGTTAATGGTTCTGACTT 
TAAGGTTTT CTATAACCAAGAAGAAATTGGTCAGTTT CACGTAC CAG CATACGGTAAACA 
T AATATCTT AAATGCAACTG CTGTTATTG CTAAC CTTTACATAATGGGAATTGATATGG C 
ATTAGTAGCTGAGCATTTGAAGACATTTTCAGGGGTAAAACGTCGT^ 
TACTGACGATACTGTCATT ACTGATGACTTTG CTCACCATCCTACTGAG A CGAC 
ATTAGATG CTG CTCGACAAAAATACCCGT CAAAAGAAATTGT AGCTATTTTC CAACCGCA 
TACGTTCACT CGTACGATAG CTCTTCTAGACGAATTTGC CCATG CCTTGAGT CAAGCGGA 
TAGCGTCTATCTCGCTCAAATATATGGCTCTGCTAGAGAAGTAGATAATGGTGAGGTGAA 
GGTAGAAGATTTAG CTG CTAAGATTGT CAAACACT CAGATTTAGTGACAGTCGAAAATGT 
CTCGC CTTT ACT CAATCATGAT AATGCTGTCTATGT CTTT ATGGGTG CTGGAGACATTCA 
ATTGTATGAGCX3CTCTTTTGAAGAACTACTAGCT 

SEQ ID NO. 4610 

STRAIN COH1 reverse complement 

CAGGCTCTAGTGACGTGACAAATATtATTTTACCCAACGTGGCTAGAGCAAGCAGG 

CTATATTACCITTCTCA.CCGAATAATATCA.GTGAGGATT^ 

CTTTTCX5TCCAGATAACAATGAAGAGTTGGCTTATGTTACTG 

AACGATATCATGAATCTCTCGGAGATTTTATG CGTCAGTT CACTAGTCTAGGTGTAGCTG 

GGGC^CATGGAAAAACCTCAACGACAGGTTTATTAGCTC^ 

ACACTTCTTTCCTAATTGGAGATGGTACAGGACGTGGTTCTG 

TGTTTGAAG CTGATGAATACGAACGTCATTCTATGCCGTACCATCCAGAATACT CAATT A 
TTAC CMTATTG ATTCTGACCATCCTGACTATCTTACAGGCCTAGAGGACGT ATT CAATG 
CCTTT AATGACTATG CTAAG CAAGTTCAJ\AAAGGTTTATT CATTTATGGAGAAGATC CAA 
AACTT CATGAAATCACTT CTGAGG CACCAATATATT ATT ATGGTTTTGAAGATT CAAATG 
ATTTTATAG CAAAAGACAT CACTCG AACTGCTAATGGTT CTGACTCTAAGGTTTT CTATA 
ACCAAGAAGAAACTGGTCAGTTTCATGTACCAGCATACGGTAAACATAATATCT^ 
CAACTGCTGTTATTGCTAACCTTTACATAATGGGAATTGATA 
ATCTGAAGACATTCTCAGGGGTAAAGCGTCGTTTTACT 
TCATTATTGATGACTCTGCTC^CCATCCTACTGAGATTATTGCGA 

GACAAAAATAC CCGT CAAAAGAAATTGTAG CTATTTT CCAAC CG CATACGTTCACT CGTA 
CGATAGCTCTTTTAGACGAATTTGCCCAT 

CTCAAATATATGGCT CTG CTAGAGAAGTAGATAATGGTGAGGTGAAGGTAGAAGATTTAG 
CTGCTAAGACTGTCAAACACTCAGATTTAGTGACAGTCGAAAA 

ATCATGATAATGCTGTCTATGTCTTTATGGGTGCTGGAGAC^CTCAACTGTATGAGCGCT 
<CTTTTGAAGAATTACTAGCTAACCTAACTAAAAATACACAA 

SEQ ID NO. 4611 
STRAIN 2603 

atgtcaaaaacttatcattttattggtattaaaggatccggaatgagtgccctagcactg 
atgcttcatcaaatgggacataacgtccaaggaagtgacgttgacaaatattattttacc 
caacgtggtttagagcaagcaggtgtaactatattacctttctcaccgaataatatcagt 
gaggatttagagattattgcaggaaatgcttttcgtccagataacaatgaagagttggct 
tatgttattgaaaagggctatcaatttaaacgatatcatgaatttctcggagattttatg 
cgtcagttcactagtctaggtgtagctggggcacatggaaaaacctcaacgacaggttta 
ttagctcatgttttaaaaaatattacagacacttctttcctaattggagatggtacagga 
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cgtggttctgctaatgctaattactttgtgtttgaagctgatgaatacgaacgtcatttt 
atgccgtaccatccagaatactcaattattaccaatattgattttgaccatcctgattat 
tttacaggcttagaggacgtattcaatgcctttaatgactatgctaagcaagttcaaaaa 
ggtttattcatttatggagaagatccaaaacttcatgaaatcacttctgaggcaccaata 
tattattatggttttgaagattcaaatgattttatagcaaaagacatcactcgaactgtt 
aatggttctgactttaaggttttctataaccaagaagaaattggtcagtttcatgtacca 
gcatacggtaaacataatatcttaaatgcaactgctgttattgctaacctttacataatg 
ggaattgatatggcattagtagctgagcatttgaagacgttttcaggggtaaagcgtcgt 
tttactgagaagattattgacgatactgtcattattgatgactttgctcaccatcctact 
gagattattgcgacattagatgctgctcgacaaaaatacccgtcaaaagaaattgtagct 
attttccaaccgcatacgttcactcgtacgatagctcttttagacgaatttgcccatgcc 
ttgagtcaagcggatagcgtttatctcgctcaaatatatggttctgctagagaagtagat 
aatggtgaggtgaaggtagaagatttagctgctaagattgtcaaacactcagatttagtg 
acagtcgaaaatgtctcgcctttactcaatcatgataatgctgtctatgtctttatgggt 
gctggagacattcaattgtatgagcgctcttttgaagaattattagctaacctaactaaa 
aatacacaa 

SEQ ID NO. 4612 

STRAIN COH1 reverse complement 

CAGG CT CTAGTGACGT t GACAAAT A t TATTTT AC C CAACGTGG t TTAGAG CAAGCAGGTGTAA 
CTATATTACCTTTCTCACCGAATAATAT 

CTTTT CGTC CAG ATAACAATGAAG AGTTGG CTTATGTTATTG AAAAGGG CTATCATTTTA 
AACGATATCATG AATTT CT CGGAGATTTTATG CGT CAGTT CACTAGTCTAGGTGTAGCTG 
GGGCACATGGAAAAACCTCAACGACAGGTTTATTAGCT 

ACACTT CU'IU' CCTAATTGGAGATGGTACAGGACGTGGTT CTG CTAATGCTAATTACTTT G 
TGTTTGAAG CTGATG AATACGAACGTCATTTTATG CCGTACCAT C CAGAATACT CAATTA 
TTAC CAATATTGA1' l"i"l'GACCATC CTGATTATTTTACAGG CCTAG AGGACGTATTCAATG 
CCTTTAATG ACTATG CTAAG CAAGTTCAAAAAGGTTTATT (^TTTATGG AGAAGATCCAA 
AACTTCATG AAATCACTTCTGAGG CAC CAATATATTATTATGGTTTTGAAGATTCAAATG 
ATTTTATAG CAAAAGACAT CACT CGAACTGTT AATGGTT CTGACTTTAAGGTTTT CT ATA 
AC CAAGAAG AAATTGGT CAGTTTCATGTACCAG CATACGGTAAACATAATATCTTAAATG 
CAACTGCTGTTATTGCTAACCTTTACATAATGGGAATT 

ATTTGAAGACATTTT CAGGGGTAAAG CGT CGTTTTACHX3AGAAGATT ATTGACGATACTG 

TCATTATTGATGACITTGCTCACCATCCrACTGAG 

GACAAAAATACCCGTCAAAAGAAATTGTAGCTATTTTCCA^^ 

CGATAGCTCTTTTAGACGAATTTG C CCATG CCTTGAGTCAAGCGG ATAG CGTTTATCTCG 
CTCAAATATATGGTTCTGCTAGAGAAGTAGATAATGGTGAGGTGAAGGTAGAAGATTTAG 
CTGCTAAGATTG'TCAAACACT CAGATTTAGTGACAGT CGAAAATGTCTCG CCTTTACTCA 
ATCATGATAATG CTGTCTATGTCTTTATGGGT G CTGG AGAC^TTCAATTGTATG AG CGCT 
CTTTTGAAGAATTATTAG CT AACCTAACTAAAAATACACAA 
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100 

-aaagcaggc tctagtgacg 

A Aaaagcaggc tctagtgacg 

~ Aaaagcaggc tctagtgacg 

-GttcaA Aaaagcaggc tctagtgacg 
. Aaaagcaggc tctagtgacg 



caggc tctagtgacg 

Aaaagcaggc tctagtgacg 
-aaagcaggc tctagtgacg 
-aaagcaggc tctagtgacg 
cctagcactg atgcttcatc aaatGggacA taacgtccaa ggaagtgacg 



********** ********** ********** *_ 



msa2 53045.2{l57__090j 
msa253045.2{l57_CJB110) 
msa253045.2{l57_H36B) 
msa253045.2{l57_JM9130013) 
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msa253045 .2 
msa253045 .2 
msa253045.2 
msa253045 .2 



157_A909 
157_COHlj 
157_M732] 
157_M78l} 



msa253 045 . 2 { 157_1BRS21 } 



101 

t TGACAAAT A 
tTGACAAATA 
t TGACAAATA 
tTGACAAATA 
tTGACAAATA 
-TGACAAATA 
tTGACAAATA 
tTGACAAATA 
tTGACAAATA 
tTGACAAATA 



TTATTTTACc 
TTATTTTACc 
TTATTTTACt 
TTATTTTACt 
TTATTTTACc 
TTATTTTACc 
TTATTTTACc 
TTATTTTACc 
TTATTTTACc 
TTATTTTACc 



CAACGTGGTT 
CAACGTGGTT 
CAACGTGGTT 
CAACGTGGTT 
CAACGTGGTT 
CAACGTGGTT 
CAACGTGGTT 
CAACGTGGTT 
CAACGTGGTT 
CAACGTGGTT 



TAGAGCAAGC 
TAGAGCAAGC 
TAGAGCAAGC 
TAGAGCAAGC 
TAGAGCAAGC 
TAGAGCAAGC 
TAGAGCAAGC 
TAGAGCAAGC 
TAGAGCAAGC 
TAGAGCAAGC 



150 

AGGTgTAACT 
AGGTgTAACT 
AGGTaTAACT 
AGGTaTAACT 
AGGTgTAACT 
AGGTgTAACT 
AGGTgTAACT 
AGGTgTAACT 
AGGTgTAACT 
AGGTgTAACT 



801 



WO 2004/018646 



PCT/US2003/026827 



Table 46: Comparative Sequences relating to SAG1615 (strain info highlighted in BOLD) 

msa253045.2{l57__2603} t TGACAAATA TTATTTTACc CAACGTGGTT TAGAGCAAGC AGGTgTAACT 
Consensus _********* *********_ ********** ********** ****_***** 



msa253045 
msa253045.2{ 

msa253045 . 
msa253045.2{l57 
msa253045.2{ 

msa253045 . 

msa253045. 

msa2S3045 . 

msa253045 . 
msa253045.2{ 

msa253045. 



2{l57_090> 
157_CJB110} 
2{l57_H36B} 
_JM9130013} 
157_1169NT} 
2{157_A909} 
2{l57_COHl} 
2{157_M732} 
2{l57_M78l} 
157_18RS2l} 
2{157_2603} 
Consensus 



151 

ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
********** 



TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
********** 



TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
********** 



GAGGATTTAG 
GAGGATTTAG 
GAGGATTTAG 
GAGGATTTAG 
GAGGATTTAG 
GAGGATTTAG 
GAGGATTTAG 
GAGGATTTAG 
GAGGATTTAG 
GAGGATTTAG 
GAGGATTTAG 
********** 



200 

AGATTATTGC 
AGATTATTGC 
AGATTATTGC 
AGATTATTGC 
AGATTATTGC 
AGATTATTGC 
AGATTATTGC 
AGATTATTGC 
AGATTATTGC 
AGATTATTGC 
AGATTATTGC 
********** 



msa253045 
msa253045.2{ 

msa253045 . 
msa253045.2{l57 
msa253045.2{' 

msa253045 . 

msa253045 . 

msa253045 . 

msa253045 . 
msa253045 .2{ 

msa253045 



2{157_090} 
157_CJB110} 
2{l57_H36B} 
_JM9130013} 
157_1169NT} 
2f 157_A909 
2{l57_COHl' 
2{157_M732; 
2{157__M781" 
157__18RS21 < 
2{157_2603] 
Consensus 



201 

AGGAAATGCT 
AGGAAATGCT 
AGGAAATGCT 
AGGAAATGCT 
AGGAAATGCT 
AGGAAATGCT 
AGGAAATGCT 
AGGAAATGCT 
AGGAAATGCT 
AGGAAATGCT 
AGGAAATGCT 
********** 



TTTCGTCCAG 
TTTCGTCCAG 
TTTCGTCCAG 
TTTCGTCCAG 
TTTCGTCCAG 
TTTCGTCCAG 
TTTCGTCCAG 
TTTCGTCCAG 
TTTCGTCCAG 
TTTCGTCCAG 
TTTCGTCCAG 



ATAACAATGA 
ATAACAATGA 
ATAACAATGA 
ATAACAATGA 
ATAACAATGA 
ATAACAATGA 
ATAACAATGA 
ATAACAATGA 
ATAACAATGA 
ATAACAATGA 
ATAACAATGA 



********** ********** 



AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
********** 



250 

TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
********** 



msa253 045.2{l57_090} 
msa253045 . 2 { 157_CJB110 } 
msa253045 . 2 { 157JH36B} 
msa253045 . 2 {l57_CTM9130013 } 
msa253045.2{l57_1169NT} 
rasa253045.2{l57_A909} 
msa253045 . 2 I 157_C0H1 } 
msa253045 . 2 { 157_M732 } 
msa253 045 . 2 { 157_M781 } 
msa253045 . 2 { 157_18RS2l} 
msa253045 . 2 { 157_2603 } 
Consensus 



251 

AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 



TCAtTTTAAA 
TCAtTTTAAA 
TCAtTTTAAA 
TCAtTTTAAA 
TCAtTTTAAA 
TCAtTTTAAA 
TCAtTTTAAA 
TCAtTTTAAA 
TCAtTTTAAA 
TCAtTTTAAA 
TCAaTTTAAA 



CGATATCATG 
CGATATCATG 
CGATATCATG 
CGATATCATG 
CGATATCATG 
CGATATCATG 
CGATATCATG 
CGATATCATG 
CGATATCATG 
CGATATCATG 
CGATATCATG 



********** ***_****** ********** 



AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
********** 



300 

AGATTTTATG 
AGATTTTATG 
AGATTTTATG 
AGATTTTATG 
AGATTTTATG 
AGATTTTATG 
AGATTTTATG 
AGATTTTATG 
AGATTTTATG 
AGATTTTATG 
AGATTTTATG 
********** 



301 

msa253045.2{l57_090} CGTCAGTTCA CTAGTCTAGG 

msa253045.2{l57_CJB110} CGTCAGTTCA CTAGTCTAGG 

msa253 045.2{l57_H36B} CGTCAGTTCA CTAGTCTAGG 

msa253045.2{l57_JM9130013} CGTCAGTTCA CTAGTCTAGG 

msa253045.2{l57_1169NT} CGTCAGTTCA CTAGTCTAGG 

msa253045.2{l57_A909} CGTCAGTTCA CTAGTCTAGG 

msa253045.2{!57_COHl} CGTCAGTTCA CTAGTCTAGG 

msa253045.2{l57_M732} CGTCAGTTCA CTAGTCTAGG 

msa2 53045.2{l57_M78ll CGTCAGTTCA CTAGTCTAGG 

msa253045.2{l57_18RS2l} CGTCAGTTCA CTAGTCTAGG 

msa253045.2{157_2603} CGTCAGTTCA CTAGTCTAGG 

Consensus ********** ********** 



TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 



GCACATGGAA 
GCACATGGAA 
GCACATGGAA 
GCACATGGAA 
GCACATGGAA 
GCACATGGAA 
GCACATGGAA 
GCACATGGAA 
GCACATGGAA 
GCACATGGAA 
GCACATGGAA 



********** ********** 



350 

AAACCTCAAC 
AAACCTCAAC 
AAACCTCAAC 
AAACCTCAAC 
AAACCTCAAC 
AAACCTCAAC 
AAACCTCAAC 
AAACCTCAAC 
AAACCTCAAC 
AAACCTCAAC 
AAACCTCAAC 
********** 



msa253045 
msa253045.2{ 

msa253045. 
msa253045.2{l57 
msa253045.2{' 

msa253045 . 

msa253045 . 

msa253045. 

msa253045 . 
msa253045.2{ 

msa253045 



2{157_090} 
157_CJB110} 
2{157_H36B} 

JM9130013} 
157_1169NT} 
2{157_A909} 
2(157_C0H1} 
2{157_M732} 
2{157_M781} 
157_18RS21} 
2{157_2603} 
Consensus 



351 

GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
********** 



TTAG CTCATG 
TTAGCTCATG 
TTAG CTCATG 
TTAGCTCATG 
TTAGCTCATG 
TTAGCTCATG 
TTAGCTCATG 
TTAGCTCATG 
TTAGCTCATG 
TTAGCTCATG 
TTAGCTCATG 
********** 



TTTTAAAAAA 
TTTTAAAAAA 
TTTTAAAAAA 
TTTTAAAAAA 
TTTTAAAAAA 
TTTTAAAAAA 
TTTTAAAAAA 
TTTTAAAAAA 
TTTTAAAAAA 
TTTTAAAAAA 
TTTTAAAAAA 
********** 



TATTACAGAC 
TATTACAGAC 
TATTACAGAC 
TATTACAGAC 
TATTACAGAC 
TATTACAGAC 
TATTACAGAC 
TATTACAGAC 
TATTACAGAC 
TATTACAGAC 
TATTACAGAC 
********** 



400 

ACTTCTTTCC 
ACTTCTTTCC 
ACTTCTTTCC 
ACTTCTTTCC 
ACTTCTTTCC 
ACTTCTTTCC 
ACTTCTTTCC 
ACTTCTTTCC 
ACTTCTTTCC 
ACTTCTTTCC 
ACTTCTTTCC 
********** 



msa253045 
msa253045.2{ 
msa253045 . 
msa253045.2{!57 
rasa253045.2{ 
msa253045. 
msa253045 . 
msa253045 . 
msa253045 . 



2{157_090} 
157_CJB110} 
2{157_H36B} 

JM9130013} 
157_1169NT} 
2{157_A909} 
2{157_C0H1} 
2{157__M732} 
2{157_M781} 



401 

TAATTGGAGA 
TAATTGGAGA 
TAATTGGAGA 
TAATTGGAGA 
TAATTGGAGA 
TAATTGGAGA 
TAATTGGAGA 
TAATTGGAGA 
TAATTGGAGA 



TGGTACAGGA 
TGGTACAGGA 
TGGTACAGGA 
TGGTACAGGA 
TGGTACAGGA 
TGGTACAGGA 
TGGTACAGGA 
TGGTACAGGA 
TGGTACAGGA 



CGTGGTTCTG 
CGTGGTTCTG 
CGTGGTTCTG 
CGTGGTTCTG 
CGTGGTTCTG 
CGTGGTTCTG 
CGTGGTTCTG 
CGTGGTTCTG 
CGTGGTTCTG 



CTAATGCTAA 
CTAATGCTAA 
CTAATGCTAA 
CTAATGCTAA 
CTAATGCTAA 
CTAATGCTAA 
CTAATGCTAA 
CTAATGCTAA 
CTAATGCTAA 



450 

TTACTTTGTG 
TTACTTTGTG 
TTACTTTGTG 
TTACTTTGTG 
TTACTTTGTG 
TTACTTTGTG 
TTACTTTGTG 
TTACTTTGTG 
TTACTTTGTG 



802 



WO 2004/018646 



PCT/US2003/026827 



Table 46: Comparative Sequences relating to SAG1615 (strain info highlighted in BOLD) 



msa253 045 . 2 { 157_18RS21 } 
msa253045 .2{l57_2603} 
Consensus 



TAATTGGAGA TGGTACAGGA CGTGGTTCTG CTAATGCTAA TTACTTTGTG 
TAATTGGAGA TGGTACAGGA CGTGGTTCTG CTAATGCTAA TTACTTTGTG 
********** ********** ********** ********** ********** 



msa253045 
msa253045.2{ 

msa253045 . 
msa253045.2{l57 
msa253045.2{ 

msa253045. 

msa253045 . 

msa253045 . 

msa253045 . 
msa253045.2{ 

msa253045 



.2 {157J390} 
157_CJB110) 
2{157_H36B} 
JM9130013) 
157_1169NT) 
2{l57_A909} 
2{157_C0H1} 
2{157_M732> 
2{157_M781} 
157_18RS21} 
2{157_2603) 
Consensus 



451 

TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
********** 



ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
********** 



ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
********** 



ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
********** 



500 

ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
********** 



msa253045 
msa253045.2{ 

msa253045. 
msa253045.2{!57 
msa253045.2{ 

msa253045 . 

msa253045 . 

msa253045 . 

msa253045 . 
msa253045.2'{ 

msa253045 . 



.2{157_090} 
157__CJB110} 
2{157_H36B) 
JM9130013} 
157_1169NT) 
2{157_A909) 
2{157_C0H1} 
2f 157_M732} 
2{157_M781} 
157_18RS2l} 
2{157_2603} 
Consensus 



501 

CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
********** 



ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
********** 



ATTTTGACCA 
ATTTTGACCA 
ATTTTGACCA 
ATTTTGACCA 
ATTTTGACCA 
ATTTTGACCA 
ATTTTGACCA 
ATTTTGACCA 
ATTTTGACCA 
ATTTTGACCA 
ATTTTGACCA 
********** 



TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
********** 



550 

TTTACAGGCc 
TTTACAGGCc 
TTTACAGGCc 
TTTACAGGCC 
TTTACAGGCc 
TTTACAGGCc 
TTTACAGGCc 
TTTACAGGCc 
TTTACAGGCc 
TTTACAGGCt 
TTTACAGGCt 
*********_ 



551 600 

msa253045 .2{157_090} TAGAGGACGT ATTCAATGCt TTTAATGACT ATGCTAAGCA AGTTCAAAAA 

msa253045.2{l57_CJB110.} TAGAGGACGT ATTCAATGCt TTTAATGACT ATGCTAAGCA AGTTCAAAAA 

msa253045.2{l57_H36Bl TAGAGGACGT ATTCAATGCt TTTAATGACT ATGCTAAGCA AGTTCAAAAA 

msa253045.2{l57_JM9130013) TAGAGGACGT ATTCAATGCt TTTAATGACT ATGCTAAGCA AGTTCAAAAA 

msa253045 . 2 { 157_1169NT} TAGAGGACGT ATTCAATGCc TTTAATGACT ATGCTAAGCA AGTTCAAAAA 

msa253045.2{l57_A909} TAGAGGACGT ATTCAATGCc TTTAATGACT ATGCTAAGCA AGTTCAAAAA 

msa253045. 2{157_C0H1} TAGAGGACGT ATTCAATGCc TTTAATGACT ATGCTAAGCA AGTTCAAAAA 

msa253045.2{l57_M732} TAGAGGACGT ATTCAATGCc TTTAATGACT ATGCTAAGCA AGTTCAAAAA 

msa2 53045.2(15 7_M7 8 1 } TAGAGGACGT ATTCAATGCc TTTAATGACT ATGCTAAGCA AGTTCAAAAA 

msa253045 .2 { 157_18RS2l) TAGAGGACGT ATTCAATGCc TTTAATGACT ATGCTAAGCA AGTTCAAAAA 

msa253045.2{l57_2603) TAGAGGACGT ATTCAATGCc TTTAATGACT ATGCTAAGCA AGTTCAAAAA 

Consensus ********** *********- ********** ********** ********** 

601 650 

msa253045.2{l57_090} GGTTTATTCA TTTATGGAGA AGATtCAAAA CTTCATGAAA TCACTTCTaA 

msa253 045 . 2 { 157_CJB110 } GGTTTATTCA TTTATGGAGA AGATtCAAAA CTTCATGAAA TCACTTCTaA 

msa253045.2{l57_H36B} GGTTTATTCA TTTATGGAGA AGATc CAAAA CTTCATGAAA TCACTTCTgA 

msa253045.2{l57_JM9130013} GGTTTATTCA TTTATGGAGA AG AT C CAAAA CTTCATGAAA TCACTTCTgA 

msa253045 .2 { 157_1169NT} GGTTTATTCA TTTATGGAGA AG AT c CAAAA CTTCATGAAA TCACTTCTgA 

msa253045.2{l57_A909) GGTTTATTCA TTTATGGAGA AGATc CAAAA CTTCATGAAA TCACTTCTgA 

msa253045.2{l57_COHl} GGTTTATTCA TTTATGGAGA AGAT c CAAAA CTTCATGAAA TCACTTCTgA 

msa253045 . 2 (157_M732 } GGTTTATTCA TTTATGGAGA AGATcCAAAA CTTCATGAAA TCACTTCTgA 

msa253045.2{l57_M781) GGTTTATTCA TTTATGGAGA AGATcCAAAA CTTCATGAAA TCACTTCTgA 

msa253045.2{l57_18RS2l} GGTTTATTCA TTTATGGAGA AGATcCAAAA CTTCATGAAA TCACTTCTgA 

msa253045 . 2 (157_2603 } GGTTTATTCA TTTATGGAGA AGATcCAAAA CTTCATGAAA TCACTTCTgA 

Consensus ********** ********** ****_***** ********** ********_* 



msa253045 
msa253045 .2{ 
msa253045 
msa253045.2{l57 
msa253045.2{ 
msa253045. 
msa253045 
msa253045 
msa253045. 
msa253045.2{ 
msa253045. 



.2{l57_090j 
157_CJB110 
2{157_H36B] 
JM9130013; 
157JL169NT 
2{157_A909] 
2{l57__COHl 
2{157_M732] 
2{157_M781) 
157_18RS2l} 
2 { 157_2603} 
Consensus 



651 

GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
********** 



TATTATTATG 
TATTATTATG 
TATTATTATG 
TATTATTATG 
TATTATTATG 
TATTATTATG 
TATTATTATG 
TATTATTATG 
TATTATTATG 
TATTATTATG 
TATTATTATG 
********** 



GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
********** 



TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 



700 

TTTATAGCAA 
TTTATAGCAA 
TTTATAGCAA 
TTTATAGCAA 
TTTATAGCAA 
TTTATAGCAA 
TTTATAGCAA 
TTTATAGCAA 
TTTATAGCAA 
TTTATAGCAA 
TTTATAGCAA 



********** ********** 



701 750 

msa253045 .2 {l57_090} AAGAcATCAC TCGAACTGTT AATGGTTCTG ACTTTAAGGT TTTCTATAAC 

msa253045.2{l57_CJB110} AAGAcATCAC TCGAACTGTT AATGGTTCTG ACTTTAAGGT TTTCTATAAC 

msa253045.2{l57_H36B> AAGAtATCAC TCGAACTGTT AATGGTTCTG ACTTTAAGGT TTTCTATAAC 

msa253045 .2 {157_JM9 130013} AAGAtATCAC TCGAACTGTT AATGGTTCTG ACTTTAAGGT TTTCTATAAC 

msa253045.2{l57_1169NT} AAGAcATCAC TCGAACTGTT AATGGTTCTG ACTTTAAGGT TTTCTATAAC 

msa253045 . 2 {l57_A909} AAGAcATCAC TCGAACTGTT AATGGTTCTG ACTTTAAGGT TTTCTATAAC 

msa253045.2{l57_COHl] AAGAcATCAC TCGAACTGTT AATGGTTCTG ACTTTAAGGT TTTCTATAAC 

msa253045.2{l57JM732) AAGAcATCAC TCGAACTGTT AATGGTTCTG ACTTTAAGGT TTTCTATAAC 



803 



WO 2004/018646 



PCT/US2003/026827 



Table 46: Comparative Sequences relating to SAG1615 (strain info highlighted in BOLD) 



msa253045 . 2 { 157_M781 
msa253 045 . 2 { 157_18RS21 
msa253045.2{l57_2S03 
Consensus 



AAGAcATCAC TCGAACTGTT AATGGTTCTG ACTTTAAGGT TTTCTATAAC 
AAGACATCAC TCGAACTGTT AATGGTTCTG ACTTTAAGGT TTTCTATAAC 
AAGAcATCAC TCGAACTGTT AATGGTTCTG ACTTTAAGGT TTTCTATAAC 
****_***** ********** ********** ********** ********** 



msa253045 
msa253045.2{ 
msa253045 
msa253045.2{l57 
msa253045.2{ 
msa253045 
msa253045 
msa253045 
msa253045 . 
msa253045.2{ 
msa25304S. 



•2{157_090} 
157_CJB110} 
2{157_H36B} 
_JM9130013} 
157_1169NT} 
2{l57_A909} 
2{l57_COHl} 
2{157_M732} 
2{l57_M7Sl} 
157__18RS2l} 
2{157_2603J 
Consensus 



751 

CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
********** 



TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 



TCAt GTACCA 
TCACGTACCA 
TCAcGTACCA 
TCAcGTACCA 
TCAt GTACCA 
TCAtGTACCA 
TCAtGTACCA 
TCAtGTACCA 
TCAtGTACCA 
TCAtGTACCA 
TCAtGTACCA 



GCATACGGTA 
GCATACGGTA 
GCATACGGTA 
GCATACGGTA 
GCATACGGTA 
GCATACGGTA 
GCATACGGTA 
GCATACGGTA 
GCATACGGTA 
GCATACGGTA 
GCATACGGTA 



800 

AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 



********** ***-****** ********** ********** 



msa253045 
msa253045.2{ 

msa253045 
msa2 53 045-2(157 
msa253045.2{ 

msa253045 . 

maa253045 . 

msa253045 . 

msa253045. 
msa253 045.2{ 

msa253045 . 



2 {157^090} 
157_CJB110} 
2{l57_H36B> 
JM9130013) 
157 1169NT} 



157_A909] 
157__COHl] 
157_M732l 
157_M781 
157_18RS21j 
2{157_2603} 
Consensus 



801 

CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
********** 



ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 



TTGCTAACCT 
TTGCTAACCT 
TTGCTAACCT 
TTGCTAACCT 
TTGCTAACCT 
TTGCTAACCT 
TTGCTAACCT 
TTGCTAACCT 
TTGCTAACCT 
TTGCTAACCT 
TTGCTAACCT 



********** ********** 



TTACATAATG 
TTACATAATG 
TTACATAATG 
TTACATAATG 
TTACATAATG 
TTACATAATG 
TTACATAATG 
TTACATAATG 
TTACATAATG 
TTACATAATG 
TTACATAATG 
********** 



850 

GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
********** 



851 900 

msa253045.2{l57_090} TGGCATTAGT AGCTGAGCAT TTGAAGACaT TTTCAGGGGT AAAaCGTCGT 

msa253045.2{l57_CJB110j TGGCATTAGT AGCTGAGCAT TTGAAGACaT TTTCAGGGGT AAAaCGTCGT 

msa253045.2{l57_H36B) TGGCATTAGT AGCTGAGCAT TTGAAGACaT TTTCAGGGGT AAAaCGTCGT 

msa253045.2{l57_JM9130013} TGGCATTAGT AGCTGAGCAT TTGAAGACaT TTTCAGGGGT AAAaCGTCGT 

ntsa253045.2{l57_1169NT} TGGCATTAGT AGCTGAGCAT TTGAAGACaT TTTCAGGGGT AAAgCGTCGT 

msa253045.2{l57_A909) TGGCATTAGT AGCTGAGCAT TTGAAGACaT TTTCAGGGGT AAAgCGTCGT 

msa253045.2{157_COHl} TGGCATTAGT AGCTGAGCAT TTGAAGACaT TTTCAGGGGT AAAgCGTCGT 

msa253045 -2{157_M732} TGGCATTAGT AGCTGAGCAT TTGAAGACaT TTTCAGGGGT AAAgCGTCGT 

msa253045.2{l57_M78l} TGGCATTAGT AGCTGAGCAT TTGAAGACaT TTTCAGGGGT AAAgCGTCGT 

msa253045.2{l57_18RS2l} TGGCATTAGT AGCTGAGCAT TTGAAGACgT TTTCAGGGGT AAAgCGTCGT 

msa253045 .2{157_2603} TGGCATTAGT AGCTGAGCAT TTGAAGACgT TTTCAGGGGT AAAgCGTCGT 

Consensus ********** ********** ********_* ********** ***_****** 



901 

msa253045.2{l57_090) TTTACTGAGA 

msa253045.2{l57_CJB110} TTTACTGAGA 

msa253045.2{l57_H36B} TTTACTGAGA 

msa253045.2{l57_JM9130013} TTTACTGAGA 

msa253045.2{l57_1169NT} TTTACTGAGA 

msa253045 .2{157_A909} TTTACTGAGA 

msa253045 . 2 [ 157_COHl \ TTTACTGAGA 

msa253045.2{157_M732} TTTACTGAGA 

msa253045-2{l57_M781} TTTACTGAGA 

msa253045.2{l57_18RS2lJ TTTACTGAGA 

msa253045.2{l57_2603} TTTACTGAGA 

Consensus ********** 



AgATTATTGA 
AgATTATTGA 
AaATTATTGA 
AaATTATTGA 
AgATTATTGA 
AgATTATTGA 
AgATTATTGA 
AgATTATTGA 
AgATTATTGA 
AgATTATTGA 
AgATTATTGA 



CGATACTGTC 
CGATACTGTC 
CGATACTGTC 
CGATACTGTC 
CGATACTGTC 
CGATACTGTC 
CGATACTGTC 
CGATACTGTC 
CGATACTGTC 
CGATACTGTC 
CGATACTGTC 



ATTATTGATG 
ATTATTGATG 
ATTATTGATG 
ATTATTGATG 
ATTATTGATG 
ATTATTGATG 
ATTATTGATG 
ATTATTGATG 
ATTATTGATG 
ATTATTGATG 
ATTATTGATG 



*~******** ********** ********** 



950 

ACTTTGCTCA 
ACTTTGCTCA 
ACTTTGCTCA 
ACTTTGCTCA 
ACTTTGCTCA 
ACTTTGCTCA 
ACTTTGCTCA 
ACTTTGCTCA 
ACTTTGCTCA 
ACTTTGCTCA 
ACTTTGCTCA 
********** 



msa253045 . 2 { 157_090 ) 
msa253045.2{l57_CJB110} 
«\sa253045 . 2 { 157JH36B} 
msa253045.2{l57_JM9130013j 
msa253045.2{l57__1169NT} 
msa253045 . 2/l57_A909} 
msa253045 . 2 { 157_C0H1 ' 
msa253045 . 2 { 157_M732 
maa253045.2{l57_M781 
msa253045 . 2 { 157_18RS21} 
msa253045 . 2 { 157_2603 } 
Consensus 



951 

CCATCCTACT 
CCATCCTACT 
CCATCCTACT 
CCATCCTACT 
CCATCCTACT 
CCATCCTACT 
CCATCCTACT 
CCATCCTACT 
CCATCCTACT 
CCATCCTACT 
CCATCCTACT 
********** 



GAGATTATTG 
GAGATTATTG 
GAGATTATTG 
GAGATTATTG 
GAGATTATTG 
GAGATTATTG 
GAGATTATTG 
GAGATTATTG 
GAGATTATTG 
GAGATTATTG 
GAGATTATTG 
********** 



CGACATTAGA 
CGACATTAGA 
CGACATTAGA 
CGACATTAGA 
CGACATTAGA 
CGACATTAGA 
CGACATTAGA 
CGACATTAGA 
CGACATTAGA 
CGACATTAGA 
CGACATTAGA 



TGCTGCTCGA 
TGCTGCTCGA 
TGCTGCTCGA 
TGCTGCTCGA 
TGCTGCTCGA 
TGCTGCTCGA 
TGCTGCTCGA 
TGCTGCTCGA 
TGCTGCTCGA 
TGCTGCTCGA 
TGCTGCTCGA 



********** ********** 



1000 
CAAAAATACC 
CAAAAATACC 
CAAAAATACC 
CAAAAATACC 
CAAAAATACC 
CAAAAATACC 
CAAAAATACC 
CAAAAATACC 
CAAAAATACC 
CAAAAATACC 
CAAAAATACC 
********** 



1001 1050 

msa253045.2{l57__090} CGTCAAAAGA AATTGTAGCT ATTTTCCAAC CGCATACGTT CACTCGTACG 

msa253045 .2 {157_CJB110) CGTCAAAAGA AATTGTAGCT ATTTTCCAAC CGCATACGTT CACTCGTACG 

msa253045.2{l57_H36B) CGTCAAAAGA AATTGTAGCT ATTTTCCAAC CGCATACGTT CACTCGTACG 

msa253045.2{l57_JM9130013 \ CGTCAAAAGA AATTGTAGCT ATTTTCCAAC CGCATACGTT CACTCGTACG 

msa253045.2{l57_JL169NTj CGTCAAAAGA AATTGTAGCT ATTTTCCAAC CGCATACGTT CACTCGTACG 

tcisa253045,2f 157__A909) CGTCAAAAGA AATTGTAGCT ATTTTCCAAC CGCATACGTT CACTCGTACG 

ms a2 53045. 2 { 157_C0H1 } CGTCAAAAGA AATTGTAGCT ATTTTCCAAC CGCATACGTT CACTCGTACG 



804 
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Table 46: Comparative Sequences relating to SAG1615 (strain info highlighted in BOLD) 



tnsa253045 . 2 { 157J4732 J- 
msa253045 . 2 { 157_M78 1 } 
msa253 045 . 2 { 157_18RS2 1 } 
msa253045.2{l57_2603} 
Consensus 



CGTCAAAAGA AATTGTAGCT ATTTTCCAAC CGCATACGTT CACTCGTACG 
CGTCAAAAGA AATTGTAGCT ATTTTCCAAC CGCATACGTT CACTCGTACG 
CGTCAAAAGA AATTGTAGCT ATTTTCCAAC CGCATACGTT CACTCGTACG 
CGTCAAAAGA AATTGTAGCT ATTTTCCAAC CGCATACGTT CACTCGTACG 
********** ********** ********** ********** ********** 



1051 1100 

msa253045.2{l57_090} ATAGCTCTTT TAGACGAtTT TGCCCATGCt TTGAGTCAAG CGGATAGCGT 

msa253045.2{l57_CJB110V ATAGCTCTTT TAGACGAtTT TGCCCATGCt TTGAGTCAAG CGGATAGCGT 

msa253045.2{l57_H36B} ATAGCTCTTT TAGACGAaTT TGCCCATGCc TTGAGTCAAG CGGATAGCGT 

ntsa253045.2{l57_JM9130013} ATAGCTCTTT TAGACGAaTT TGCCCATGCc TTGAGTCAAG CGGATAGCGT 

msa253045.2{l57_1169NT} ATAGCTCTTT TAGACGAaTT TGCCCATGCc TTGAGTCAAG CGGATAGCGT 

ma a2 53 045 . 2{l57_A909} ATAGCTCTTT TAGACGAaTT TGCCCATGCc TTGAGTCAAG CGGATAGCGT 

• msa253045,2{157_COHl} ATAGCTCTTT TAGACGAaTT TGCCCATGCc TTGAGTCAAG CGGATAGCGT 

msa253045.2{l57_M732} ATAGCTCTTT TAGACGAaTT TGCCCATGCc TTGAGTCAAG CGGATAGCGT 

tnsa253045.2{l57_M78lj ATAGCTCTTT TAGACGAaTT TGCCCATGCc TTGAGTCAAG CGGATAGCGT 

msa253045.2{l57__18RS2l} • ATAGCTCTTT TAGACGAaTT TGCCCATGCc TTGAGTCAAG CGGATAGCGT 

msa253045.2{l57_2603} ATAGCTCTTT TAGACGAaTT TGCCCATGCc TTGAGTCAAG CGGATAGCGT 

Consensus ********** *******_** *********_ ********** ********** 



msa253045 
msa253045.2{ 

msa253045 
msa253045.2{l57 > 
msa253 045 .2 { 

msa253045. 

msa253045 . 

msa253045. 

msa253045 . 
msa253045.2{ 

msa253045 . 



.2{157_090 
157_CJB110' 
2{l57__H36B] 

JM9130013 
l'57_1169NT) 
2{157__A909] 
2{l57_COHl] 
2fl57__M732 ( 
2{l57_M78l' 
157_18RS2l} 
2 { 157_2603} 
Consensus 



1101 

TTATCT t GCT 
TTATCT t GCT 
TTATCTcGCT 
TTATCTcGCT 
TTATCTcGCT 
TTATCTcGCT 
TTATCTcGCT 
TTATCTcGCT 
TTATCTcGCT 
TTATCTcGCT 
TTATCTcGCT 
******_ *** 



CAAATATATG 
CAAATATATG 
CAAATATATG 
CAAATATATG 
CAAATATATG 
CAAATATATG 
CAAATATATG 
CAAATATATG 
CAAATATATG 
CAAATATATG 
CAAATATATG 
********** 



GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
********** 



AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
********** 



1150 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
********** 



msa253045 
msa253045.2{ 

msa253045 
msa253045.2{l57 
msa253045.2{ 

msa253045 . 

msa253045 . 

msa253045 . 

msa253045 . 
msa253045.2{ 

rasa253045 . 



.2{157_090] 
157_CJB110 
2{157_H36B] 
JM9130013* 
157_1169NT) 
2{157_A909} 
2{l57_COHl} 
2{157_M732} 
2{157_M781} 
157_18RS2l} 
2{157_2603} 
Consensus 



1151 

TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
********** 



AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
********** 



GCTAAGATTG 
GCTAAGATTG 
GCTAAGATTG 
GCTAAGATTG 
GCTAAGATTG 
GCTAAGATTG 
GCTAAGATTG 
GCTAAGATTG 
GCTAAGATTG 
GCTAAGATTG 
GCTAAGATTG 
********** 



TCAAACACTC 
TCAAACACTC 
TCAAACACTC 
TCAAACACTC 
TCAAACACTC 
TCAAACACTC 
TCAAACACTC 
TCAAACACTC 
TCAAACACTC 
TCAAACACTC 
TCAAACACTC 
********** 



1200 
AGATTTAGTG 
AGATTTAGTG 
AGATTTAGTG 
AGATTTAGTG 
AGATTTAGTG 
AGATTTAGTG 
AGATTTAGTG 
AGATTTAGTG 
AGATTTAGTG 
AGATTTAGTG 
AGATTTAGTG 
********** 



msa253045 
msa253045.2{ 

n\sa253045. 
msa253045.2{l57 
msa253045.2{' 

msa253045. 

msa253045. 

msa253045 . 

msa253045 . 
msa253045.2{ 

msa253045. 



2{157_090] 
157_CJB110| 
2{157_H36B] 
JM9130013; 
157_1169NT] 
2{157_A909] 
21157 COHl' 
2{157_M732; 
2{157_M781] 
157_18RS21 1 
2{157__2603] 
Consensus 



1201 

ACAGTCGAAA 
ACAGTCGAAA 
ACAGTCGAAA 
ACAGTCGAAA 
ACAGTCGAAA 
ACAGTCGAAA 
ACAGTCGAAA 
ACAGTCGAAA 
ACAGTCGAAA 
ACAGTCGAAA 
ACAGTCGAAA 
********** 



ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 



TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 



********** ********** 



CATGATAATG 
CATGATAATG 
CATGATAATG 
CATGATAATG 
CATGATAATG 
CATGATAATG 
CATGATAATG 
CATGATAATG 
CATGATAATG 
CATGATAATG 
CATGATAATG 
********** 



1250 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
********** 



msa253045 
msa253045.2{ 
. msa253045. 
msa253045.2{l57 
msa253045.2{ 
msa253045 . 
msa253045 . 
msa253045 . 
msa253045 
msa253045.2{ 
msa253045 . 



.2{157_090' 
157__CJB11.0 
2{157JH36B 
JM9130013 t 
157_1169NT 
2{157_A909 
2fl57_COHl 1 
2{157__M732| 
2{l57_M78i: 
157_18RS2l] 
2 {157^2603, 
Consensus 



1251 

CTTTATGGGT 
CTTTATGGGT 
CTTTATGGGT 
CTTTATGGGT 
CTTTATGGGT 
CTTTATGGGT 
CTTTATGGGT 
CTTTATGGGT 
CTTTATGGGT 
CTTTATGGGT 

CTTTATGGGT 
********** 



GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
********** 



TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 



TGAGCGCTCT 
TGAGCGCTCT 
TGAGCGCTCT 
TGAGCGCTCT 
TGAGCGCTCT 
TGAGCGCTCT 
TGAGCGCTCT 
TGAGCGCTCT 
TGAGCGCTCT 
TGAGCGCTCT 
TGAGCGCTCT 



********** ********** 



1300 
TTTGAAGAAT 
TTTGAAGAAT 
TTTGAAGAAT 
TTTGAAGAAT 
TTTGAAGAAT 
TTTGAAGAAT 
TTTGAAGAAT 
TTTGAAGAAT 
TTTGAAGAAT 
TTTGAAGAAT 
TTTGAAGAAT 
********** 



msa253045 .2{157_090 J 
msa253045 . 2 { 157_CJB110 } 
msa253045.2{l57_H36B} 
msa253045 . 2 { 157_JM9130013 } 
msa253045.2{l57_1169NT) 
msa253045.2{l57_A909} 



1301 

TATTAGCTAA 
TATTAGCTAA 
TATTAGCTAA 
TATTAGCTAA 
TATTAGCTAA 
TATTAGCTAA 



CCTAACTAAA 
CCTAACTAAA 
CCTAACTAAA 
CCTAACTAAA 
CCTAACTAAA 
CCTAACTAAA 



1329 
AATACACAA 
AATACACAA 
AATACACAA 
AATACACAA 
AATACACAA 
AATACACAA 



805 



WO 2004/018646 PCT/US2003/026827 
Table 46: Comparative Sequences relating to SAG1615 (strain info highlighted in BOLD) 



msa253045 . 2 { 157_COHl } 
msa253045 . 2 { 157_M732 } 
msa253045 .2{l57__M78l} 
msa253045 . 2 { 157_18RS21 } 
msa25304S .2{157_2603} 
Consensus 



T ATT AG CT AA 
TATTAGCTAA 
TATTAG CTAA 
TATTAGCTAA 
TATTAGCTAA 



CCTAACTAAA 
CCTAACTAAA 
CCTAACTAAA 
CCTAACTAAA 
CCTAACTAAA 



********** ********** 



AATACACAA 
AATACACAA 
AATACACAA 
AATACACAA 
AATACACAA 
********* 



SKQ ID NO. 4613 
STRAIN A909 frame: 2 

DKYYFTQRGLEQAGVTI LPFSPNNI SEDLEI I AGNAFRPDNNEELAYVI EKGYHFKRYHE 
FLGD FMRQFTSLGVAGAHGKTSTTGLLAHVLKNI tdts fli gdgtgrgsananyf vfead 
EYERHFMPYHPEYS I ITNI D FDHPDYFTGLEDVFNAFND YAKQVQKGLF I YGEDPKLHEI 
TSEAP I YYYGFEDSNDFI AKDITRTVNGSDFKVFYNQEE IGQFHVPAYGKHNILNATAVI 
ANLYIMGIDMALVAEHLKTFSGVKRRFTEKI IDDTVI I DDFAHHPTE 1 I ATLDAARQKYP 
SKEI VAI FQPHTFTRT I ALLDE FAHALSQADS VYLAQ I YGSARE VDNGE VKVEDLAAKI V 
KHSDLVTVENVS PLLNHDNAVYVFMGAGD I QL YERS FEELLANLTKNTQ 

SEQ ID NO. 4614 
STRAIN 1169NT frame: 2 

KAGS SDVDKYYFTQRGLEQAGVT I LPFSPNNI SEDLEI I AGNAFRPDNNEELAYVI EKGY 
HFKRYHEFLGDFMRQFTSLGVAGAHGKTSTTGLIiAHVLKNITDTSFLIGDGTGRGSANAN 
YFVFEADEYERHFMPYHPEYS 1 1 TNI DFDHPDYFTGLEDVFNAFND YAKQVQKGLF I YGE 
DPKLHEITSEAP I YYYGFEDSNDFIAKDITRTVNGSDFKVFYNQEEI GQFHVPAYGKHNI 
LNATAVI ANLY I MG I DMAL VAEHLKTF SG VKRRFTE K I IDDTVI IDDFAHHPTEI IATLD 
AARQKYPS KE I VAI FQPHTFTRTIALLDEFAHALSQADS VYLAQ I YGSAREVDNGEVKVE 
DLAAKI VKHSDLVTVENVS PLLNHDNAVYVFMGAGD I QLYERSFEELLANLTKNTQ 

SEQ ID NO. 4615 
STRAIN 090 FRAME: 1 

KAGS SDVDKYYFTQRGLEQAGVTI LPFSPNNI SEDLEI I AGNAFRPDNNE ELAYV I E KG Y 
HFKRYHEFLGDF^QFTSLGVAGAHGKTSTTGLLAHVLKNITDTSFLIGDGTGRGSANAN 
YFVFEADEYERHFMPYHPE YS 1 1 TNI DFDHPDYFTGLEDVFNAFND YAKQVQKGLF I YGE 
DS KLHE ITS KAP I YYYGFEDSNDF I AKD I TRTVNGSDFKVFYNQEE I GQFHVPAYGKHN I 
LNATAVI ANLY I MG I DMAL VAE HL KTF S G VKRRFTEK I IDDTVI I DDFAHHPTE I IATLD 
AARQKYPSKE I VAI FQPHTFTRTIALLDDFAHALSQADS VYLAQ I YGSAREVDNGEVKVE 
DLAAKI VKHSDLVTVENVSPLLNHDNAVYVFMGAGD I QLYERS FEELLANLTKNTQ 

SEQ ID NO. 4616 
STRAIN H36B frame: 2 

KAGS SDVDKY YFTQRGLEQAGI T I LPFSPNNI SEDLE 1 1 AGNAFRPDNNEELAYVI EKGY 
HFKRYHEFLGDFMRQFTSLGVAGAHGKTSTTGLLAHVLKNITDTS FL I GDGTGRGSANAN 
YFVFEADEYERHFMP YHPEYS 1 1 TNI DFDHPDYFTGLEDVFNAFND YAKQVQKGLF I YGE 
DPKLHEITSEAP I YYYGFEDSNDF I AKD ITRTVNGSDFKVFYNQEE I GQFHVPAYGKHN I 
LNATAVIANLYIMG I DMALVAEHLKTFSGVKRRFTEKI IDDTVI IDDFAHHPTEI IATLD 
AARQKYPSKE I VAI FQPHTFTRTIALLDEFAHALSQADSVYLAQ I YGSAREVDNGEVKVE 
DLAAKI VKHSDLVTVENVS PLLNHDNAVYVFMGAGD I QLYERS FEELLANLTKNTQ 

SEQ ID NO. 4617 
STRAIN 18RS21 frame: 1 

KAGS SDVDKYYFTQRGLEQAGVT I LPFSPNNI SEDLE 1 1 AGNAFRPDNNEELAYVI EKGY 
HFKRYHEFLGDFMRQFTSLGVAGAHGKTSTTGLIiAHVLKN ITDTSFL I GDGTGRGSANAN 
YFVFEADEYERHFMPYHPEYS I ITNI DFDHPDYFTGLED VFNAFND YAKQVQKGLF I YGE 
DPKLHEITSEAP I YYYGFEDSNDF I AKDITRTVNGSDFKVFYNQEE I GQFHVPAYGKHN I 
LNATAVIANLYIMGI DMALVAEHLKTFSGVKRRFTEKI IDDTVI IDDFAHHPTEI IATLD 
AARQKYPSKE I VAI FQPHTFTRT I ALLDEFAHALSQADSVYLAQI YGSAREVDNGEVKVE 
DLAAKI VKHSDLVTVENVS PLLNHDNAVYVFMGAGD IQLYERSFEELLANLTKNTQ 

SEQ ID NO. 4618 ' 
STRAIN M732 frame: 2 

KAGS SDVDKYYFTQRGLEQAGVT I LPFS PNNI SEDLE 1 1 AGNAFRPDNNEELAYVI EKGY 
HFKRYHEFLGDFMRQ FT SLGVAGAHGKTSTTGLLAHVLKN I TDTS FLI GDGTGRGSANAN 
YFVFEADEYERHFMPYHPEYS I ITNIDFDHPDYFTGLEDVFNAFNDYAKQVQKGLFI YGE 
DPKLHEITSEAPIYYYGFEDSNDFIAKDITRTVNGSDFKVFYNQEEIGQFHVPAYGKHNI 
LNATAVIANLYIMG I DMALVAEHLKTFSGVKRRFTEKI IDDTVI IDDFAHHPTEI IATLD 
AARQKYPS KE I VAI FQPHT FTRT I ALLDEFAHALSQADS VYLAQ I YGSAREVDNGEVKVE 
DLAAKI VKHSDLVTVENVS PLiLNHDNAVYVFMGAGD I QLYERS FEELLANLTKNTQ 

SEQ ID NO. 4619 

STRAIN JM9130013 frame: 2 

FKKAGSSDVDKYYFTQRGLEQAGIT I LPFS PNNI SEDLE 1 1 AGNAFRPDNNEELAYVI EK 
GYHFKRYHEFLGDFMRQFTSLGVAGAHGKTSTTGLLAHVLKNITDTSFLIGDGTGRGSAN 
ANYFVFEADEYERHFMPYHPEYSI ITNIDFDHPDYFTGLEDVFNAFNDYAKQVQKGLFIY 
GEDPKLHEITSEAPIYYYGFEDSNDFIAKDITRTVNGSDFKVFYNQEEIGQFHVPAYGKH 
NI LNATAVI ANLY IMG I DMALVAEHLKTFS GVKRRFTEKI IDDTVI IDDFAHHPTEI IAT 
LDAARQKYP SKEI VAI FQPHTFTRT I ALLDEFAHALSQADS VYLAQ I YGSAREVDNGEVK 
VEDIAAKI VKHSDLVTVENVSPLIiNHDNAVYVFMGAGDI QLYERS FEELLANLTKNTQ 

SEQ ID NO. 4620 
STRAIN M781 frame: 1 

KAGS SDVDKYYFTQRGLEQAGVT I LPFS PNNI SEDLE 1 1 AGNAFRPDNNEELAYVI EKGY 
HFKRYHEFLGDFMRQ FT SLGVAGAHGKTSTTGLLAHVLKN I TDTS FLI GDGTGRGSANAN 



806 
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PCT/US2003/026827 



Table 46: Comparative Sequences relating to SAG1615 (strain info highlighted in BOLD) 



YFVFEADEYERHFMPYHPEYSIITNIDFDHPDYFTGLEDVFNAFNDYAKQVQKGLFIYGE 
DPKLHEITSEAP I YYYGFEDSNDF I AKD I TRTVNGSDFKVFYNQEE I GQFHVPAYGKHNI 
LNATAVI ANL Y I MG I DMALVAEHLKT FSG VKRRFTE KI IDDTVI IDDFAHHPTEI IATLD 
AARQKYPSKEIVAI FQPHTFTRTIALLDEFAHALSQADSVYLAQI YGSAREVTJNGEVKVE 
DLAAKI VKHSDL VTVENVS PLLNHDNAVYVFMGAGD I QLYERS FEELLANLTKNTQ 

SEQ ID NO. 4621 
STRAIN GTB110 frame: 3 

KAGSSDVDKYYFTQRGLEQAGVTILPFSPNNI SEDLE 1 1 AGNAFRPDNNE ELAYV I EKGY 
HFKRYHE FLGDFMRQFTS LG VAGAHGKTSTTGLIjAHVLKN ITDTS FL I GDGTGRGSANAN 
YFVFEADEYERHFMPYHPEYSI ITNIDFDHPDYFTGLEDVFNAFNDYAKQVQKGLFIYGE 
DSKLHE I TSKAP I YYYGFEDSNDF X AKDITRTVNGSDFKVFYNQEE IGQFHVPAYGKHNI 
LNATAVI ANLYIMGI DMALVAEHLKTFSGVKRRFTEKI IDDTVI IDDFAHHPTEI IATLD 
AARQKYPSKEIVAI FQPHTFTRTI ALLDDFAHALSQADSVYLAQ I YGSAREVDNGEVKVE 
DLAAKI VKHSDL VTVENVSPLLNHDNAVYVFMGAGDIQLYERSFEELLANLTKNTQ 

SEQ ID NO. 4622 
STRAIN 2603 frame: 1 

MSKTYHF IG I KGSGMSALALMLHQMGHNVQGSDVDKYYFTQRGLEQAGVTILPFSPNNI S 
EDLE 1 1 AGNAFRPDNNE ELAYVI EKGYQFKRYHEFLGDFMRQFTSLGVAGAHGKTSTTGL 
LAHVLKNITDTSFLIGDGTGRGSANANYFVFEADEYERHFMPYHPEYSIITNIDFDHPDY 
FTGLEDVFNAFNDYAKQVQKGLF I YGEDPKLHE I T SEAP I YYYGFEDSNDFI AKD I TRTV 
NGSDFKVFYNQEE I GQFHVPAYGKHNI LNATAVI ANLYI MGI DMALVAEHLKTFSGVKRR 
FTEKI IDDTVI IDDFAHHPTEI I ATLDAARQKYPS KE I VAI FQPHTFTRTI ALLDEFAHA 
LSQADSVYLAQI YGS AREVDNGEVKVEDLAAKI VKHSDLVTVENVS PLLNHDNAVYVFMG 
AGDI QLYERS FEELLANLTKNTQ 

SEQ ID NO. 4623 
STRAIN COH1 frame: 3 

GSSDVDKYYFTQRGLEQAGVTI LPFSPNNI SEDLEI IAGNAFRPDNNEELAYVIEKGYHF 
KRYHEFIX3DFMRQFTSLGVAGAHGK^STTGLI J AHVLKNITDTSFLIGDGTGRGSANANYF 
VFEADEYERHFMPYHPEYSIITNIDFDHPDYFTGLEDVFNAFNDYAKQVQKGLFIYGEDP 
KLHE ITSEAPI YYYGFEDSNDFI AKDI TRTVNGSDFKVFYNQEEI GQFHVPAYGKHNILN 
ATAVIANLY IMG IDMALVAEHLKTFSGVKRRFTEKI IDDTVI IDDFAHHPTEI IATLDAA 
RQKYPSKEIVAI FQPHTFTRTIALLDEFAHALSQADSVYLAQIYGSAREVDNGEVKVEDD 
AAKI VKHSDL VTVENVSPLLNHDNAVYVFMGAGD I QLYERS FEELLANLTKNTQ 

PRETTY of: /biotmp/msa56635 . 2 { *} November 26, 2002 08:08 



1 50 

msa253220 -2{157_090} kag ssdvDKYYFT QRGLEQAGvT 

msa253220 .2 {157_CJB110} kag ssdvDKYYFT QRGLEQAGvT 

msa253220.2{l57_1169NT} kag ssdvDKYYFT QRGLEQAGvT 

msa253220.2{l57_18RS2l) kag ssdvDKYYFT QRGLEQAGvT 

msa253220 . 2 { 1S7_M732) kag ssdvDKYYFT QRGLEQAGvT 

msa253220.2{lS7_M78l} r — kag ssdvDKYYFT QRGLEQAGvT 

msa253220.2{l57_COHlj g ssdvDKYYFT QRGLEQAGvT 

msa253220.2{l57_H36B) kag ssdvDKYYFT QRGLEQAGiT 

msa253220 . 2 { 157_JM9130013 } fkkag ssdvDKYYFT QRGLEQAGiT 

msa253220 .2{157_2603} msktyhfigi kgsgmsalal mlhqmghnvq gsdvDKYYFT QRGLEQAGvT 

msa253220 .2{157_A909} DKYYFT QRGLEQAGvT 

Consensus ********** ********** ******* ****** ********_* 



msa253220 
msa25322 
msa253 22 
msa253220 .2{ 

msa253220. 

msa253220 . 

msa253220 . 

msa253220 . 
msa253220.2{l57. 

msa253220 . 

msa253220 . 



.2{157__090} 
0.2{157_CJB110} 
0.2{l57_1169NT} 
"l57_18RS2l} 
2(157_M732) 
2(157_M781} 
2{157_C0H1} 
2{l57_H36B} 
_JM9130013} 
2(157__2603} 
2{157_A909} 
Consensus 



51 

ILPFSPNNIS 
I LPFSPNNI S 
ILPFSPNNIS 
ILPFSPNNIS 
ILPFSPNNIS 
ILPFSPNNIS 
ILPFSPNNIS 
ILPFSPNNIS 
ILPFSPNNIS 
ILPFSPNNIS 
ILPFSPNNIS 
********** 



EDLEI IAGNA 
EDLEIIAGNA 
EDLEI IAGNA 
EDLEI IAGNA 
EDLEIIAGNA 
EDLEIIAGNA 
EDLEIIAGNA 
EDLEIIAGNA 
EDLEIIAGNA 
EDLEIIAGNA 
EDLEIIAGNA 
********** 



FRPDNNEELA 
FRPDNNEELA 
FRPDNNEELA 
FRPDNNEELA 
FRPDNNEELA 
FRPDNNEELA 
FRPDNNEELA 
FRPDNNEELA 
FRPDNNEELA 
FRPDNNEELA 
FRPDNNEELA 
********** 



YVIEKGYhFK 
YVIEKGYhFK 
YVIEKGYhFK 
YVIEKGYhFK 
YVIEKGYhFK 
YVIEKGYhFK 
YVIEKGYhFK 
YVIEKGYhFK 
YVIEKGYhFK 
YVI EKGYqFK 
YVIEKGYhFK 
*******_** 



100 

RYHEFLGDFM 
RYHEFLGDFM 
RYHEFLGDFM 
RYHEFLGDFM 
RYHEFLGDFM 
RYHEFLGDFM 
RYHEFLGDFM 
RYHEFLGDFM 
RYHEFLGDFM 
RYHEFLGDFM 
RYHEFLGDFM 
********** 



msa253220 . 2 { 157_090 } 
msa253220.2{l57_CJB110} 
msa253220 . 2 f 157_1169NT) 
msa253220 . 2 { 157_18RS21 } 
msa253220 . 2 { 157_M732 * 
msa2 5 3 2 2 0 . 2 { 1 5 7_M7 8 1 ] 
msa2 53220.2fl57_COHl] 
msa2 53220 . 2 { 157_H36B] 
msa253220 . 2 { 157_JM9130013 
msa253220.2{l57_2603; 
msa253220.2{l57__A909} 
Consensus 



101 

RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 



AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 



LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 



TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 



150 

RGSANANYFV 
RGSANANYFV 
RGSANANYFV 
RGSANANYFV 
RGSANANYFV 
RGSANANYFV 
RGSANANYFV 
RGSANANYFV 
RGSANANYFV 
RGSANANYFV 
RGSANANYFV 



********** ********** ********** ********** ********** 



msa253220.2{l57_090} 



151 . 200 

FEADEYERHF MPYHPEYSII TNI DFDHPD Y FTGLEDVFNA FNDYAKQVQK 
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Table 46: Comparative Sequences relating to SAG1615 (strain info highlighted in BOLD) 



msa253220 . 2 
msa2.53220 . 2 
msa253220 .2{ 

msa253220 . 

msa253220 . 

msa253220 . 

msa2 5322 0 - 
msa253220 .2(157 

msa253220 

msa253220 



157J2JB110) 
157_1169NT} 
157_18RS2l) 
2{157_M732} 
2{l57_M78lJ 
2{l57_COHl) 
2{157_H36B} 
_JM9130013> 
2{157_2603) 
2{l57_A909} 
Consensus 



FEADEYERHF 
FEADEYERHF 
FEADEYERHF 
FEADEYERHF 
FEADEYERHF 
FEADEYERHF 
FEADEYERHF 
FEADEYERHF 
FEADEYERHF 
FEADEYERHF 
********** 



MPYHPEYSII 
MPYHPEYSII 
MPYHPEYSII 
MPYHPEYSII 
MPYHPEYSII 
MPYHPEYSII 
MPYHPEYSII 
MPYHPEYSII 
MPYHPEYSII 
MPYHPEYSII 



TNIDFDHPDY 
TNIDFDHPDY 
TNIDFDHPDY 
TNIDFDHPDY 
TNIDFDHPDY 
TNIDFDHPDY 
TNIDFDHPDY 
TNIDFDHPDY 
TNIDFDHPDY 
TNIDFDHPDY 



FTGLEDVFNA 
FTGLEDVFNA 
FTGLEDVFNA 
FTGLEDVFNA 
FTGLEDVFNA 
FTGLEDVFNA 
FTGLEDVFNA 
FTGLEDVFNA 
FTGLEDVFNA 
FTGLEDVFNA 



FNDYAKQVQK 
FNDYAKQVQK 
FNDYAKQVQK 
FNDYAKQVQK 
FNDYAKQVQK 
FNDYAKQVQK 
FNDYAKQVQK 
FNDYAKQVQK 
FNDYAKQVQK 
FNDYAKQVQK 



********** ********** ********** ********** 



201 250 

msa253220.2{l57_090} GLFIYGEDsK LHEITSkAPI YYYGFEDSND FIAKDITRTV NGSDFKVFYN 

msa253220.2{l57_CJB110} GLFIYGEDsK LHEITSkAPI YYYGFEDSND FIAKDITRTV NGSDFKVFYN 

msa253220 . 2 { 157_1169NT) GLFIYGEDpK LHEITSeAPI YYYGFEDSND FIAKDITRTV NGSDFKVFYN 

msa2 53220 . 2 { 157_lBRS2l} GLFIYGEDpK LHEITSeAPI YYYGFEDSND FIAKDITRTV NGSDFKVFYN 

msa253220.2{l57_M732} GLFIYGEDpK LHEITSeAPI YYYGFEDSND FIAKDITRTV NGSDFKVFYN 

msa253220.2{l57_M78l} GLFIYGEDpK LHEITSeAPI YYYGFEDSND FIAKDITRTV NGSDFKVFYN 

msa253220.2(l57_COHl) GLFIYGEDpK LHEITSeAPI YYYGFEDSND FIAKDITRTV NGSDFKVFYN 

msa253220-2{l57_H36B} GLFIYGEDpK LHEITSeAPI YYYGFEDSND FIAKDITRTV NGSDFKVFYN 

msa253220 .2{157_JM9130013} GLFIYGEDpK LHEITSeAPI YYYGFEDSND FIAKDITRTV NGSDFKVFYN 

msa253220.2{l57_2603} GLFIYGEDpK LHEITSeAPI YYYGFEDSND FIAKDITRTV NGSDFKVFYN 

msa25322 0.2{l57_A909} GLFIYGEDpK LHEITSeAPI YYYGFEDSND FIAKDITRTV NGSDFKVFYN 

Consensus ********_* ******_*** ********** ********** ********** 



msa253220.2{157_090, 
msa253220 . 2 { 157_CJB110 J 
msa253220 . 2 { 157_1169NT) 
msa253220.2{l57_18RS2l] 
msa2 53220 . 2 { 157_M732 j 
msa253220 . 2 { 157_M78 1 j 
msa253220.2{l57_COHl] 
msa2 53220-2(15 7_H3 SB } 
msa253220 . 2 (l57_JM9130013 ] 
msa253220.2{l57_2603; 
msa253220 - 2 { 157_A909} 
Consensus 



251 

QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 



AYGKHNI LNA 
AYGKHN ILNA 
AYGKHNILNA 
AYGKHNILNA 
AYGKHNILNA 
AYGKHNILNA 
AYGKHNILNA 
AYGKHNILNA 
AYGKHNILNA 
AYGKHNILNA 
AYGKHNILNA 



********** ********** 



TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
********** 



GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
********** 



300 

LKTFSGVKRR 
LKTFSGVKRR 
LKTFSGVKRR 
LKTFSGVKRR 
LKTFSGVKRR 
LKTFSGVKRR- 
LKTFSGVKRR 
LKTFSGVKRR 
LKTFSGVKRR 
LKTFSGVKRR 
LKTFSGVKRR 
********** 



msa253220 
msa253220.2{ 
msa253220 
msa253220 
msa253220 
msa253220 
msa253220 . 
msa2 5322 0. 
msa253220.2{!57 
msa253220 
msa253220 



2{157__090} 
... jlSV^CJBllOi 
.2{157_1169NT} 
.2{157_18RS21} 
2{157_M732} 
2{157_M781) 
2{157_C0H1) 
2{157_H36B} 
_JM9130013} 
2{157_2603} 
2{157_A909) 
Consensus 



301 

FTEKIIDDTV 
FTEKIIDDTV 
FTEKIIDDTV 
FTEKIIDDTV 
FTEKIIDDTV 
FTEKIIDDTV 
FTEKIIDDTV 
FTEKIIDDTV 
FTEKIIDDTV 
FTEKIIDDTV 
FTEKIIDDTV 
********** 



I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
********** 



EI IATLDAAR 
EI IATLDAAR 
EI IATLDAAR 
EI IATLDAAR 
EI IATLDAAR 
EI IATLDAAR 
EI IATLDAAR 
EI IATLDAAR 
EI IATLDAAR 
EI IATLDAAR 
EI IATLDAAR 
********** 



QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 



350 

I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 



********** ********** 



msa253220 
msa253220 .2 
msa253220 . 2 
msa253220 .2 
msa253220 
msa253220 . 
msa253220 
msa253220 . 
msa253220.2{!57 
msa253220 . 
msa253220 . 



.2{l57_090} 
157_CJB110} 
157_1169NT} 
157_18RS21} 
2{157_M732) 
2{157_M781} 
2{157_CGH1} 
2{157_H36B} 
_JM9130013} 
2{157_2603} 
2{157_A909} 
Consensus 



351 

lALLDdFAHA 
IALLDdFAHA 
IALLDeFAHA 
I ALLDe FAHA 
IALLDeFAHA 
IALLDeFAHA 
IALLDeFAHA 
IALLDeFAHA 
IALLDeFAHA 
IALLDeFAHA 
IALLDeFAHA 
*****_**** 



LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 



QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 



NGEVKVEDLA 
NGEVKVEDLA 
NGEVKVEDLA 
NGEVKVEDLA 
NGEVKVEDLA 
NGEVKVEDLA 
NGEVKVEDLA 
NGEVKVEDLA 
NGEVKVEDLA 
NGEVKVEDLA 
NGEVKVEDLA 



********** ********** ********** 



400 

AKI VKHSDLV 
AKIVKHSDLV 
AKI VKHSDLV 
AKIVKHSDLV 
AKIVKHSDLV 
AKIVKHSDLV 
AKIVKHSDLV 
AKIVKHSDLV 
AKIVKHSDLV 
AKIVKHSDLV 
AKIVKHSDLV 
********** 



msa253220 
msa253220 .2{ 
msa253220 .2 { 
msa253220 .2 { 

msa253220 . 

msa253220 . 

msa253220 . 

msa253220 . 
msa2 53220.2(157 

msa253220 . 

tnsa253220 



2 {157_090' 
157__CJB110j 
157__1169NT] 
157_18RS2l] 
2{l57__M732j 
2{157_M781 1 
2{l57J20Hl] 
2{157_H36B] 
_JM9130013' 
2{157_2603' 
2{157_A909] 
Consensus 



401 

TVENVS PLLN 
TVENVSPLLN 
TVENVSPLLN 
TVENVSPLLN 
TVENVSPLLN 
TVENVSPLLN 
TVENVSPLLN 
TVENVSPLLN 
TVENVSPLLN 
TVENVSPLLN 
TVENVSPLLN 
********** 



HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
********** 



AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
********** 



FEELLANLTK 
FEELLANLTK 
FEELLANLTK 
FEELLANLTK 
FEELLANLTK 
FEELLANLTK 
FEELLANLTK 
FEELLANLTK 
FEELLANLTK 
FEELLANLTK 
FEELLANLTK 
********** 



443 
NTQ 
NTQ 
NTQ 
NTQ 
NTQ 
NTQ 
NTQ 
NTQ 
NTQ 
NTQ 
NTQ 



808 
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Table 47: Comparative Sequences relating to SAG0739 (strain info highlighted in BOLD) 



SEQ ID NO. 4701 
STRAIN A909 

TATTTTTTAACAACAAAAAAAGGAAAAGAGCTAAGGAJ^^ 
ATTCTATGGAGAATATAAAGAAAATCCAGAAGAATATCATCAAATAGCTA 
AAGATAAAG CAAGTG AATATT CAAATTTAG CTGTTGATAGTTTTAAAGAT 
TATAAAGGTAAATTTGAAT CAGGTGAATTGACAACAGAGGATAT CGT CT C 
AGCCGTTAAGGAAAAAAGCGGAGAAGTAGTTGACTTTGCTAATGATTTTG 
T CAAT CAAG CTAAAT CAAAATT CTCAGACGAGGAT ACTG CTAAAAAAGAA 
GATAAGG CT C CTGAAACAAAAGTAG AAGATATTGTCATT G ATTATAAAG A 
AAACACAGAAGATAAAGAAAAA 

SEQ ID NO. 4702 
STRAIN H3 6B 

TATTTTTTAACAACAAAAAAAGGAAAAGAG CTAAGGAAAAATG CAGAAAA 
ATTCTATGGAGAATATAAAGAAAATCCAGAAGAATATCATCAAATAGCTA 
AAGATAAAGC^GTGAATATTCAAATTTAGGrGTTGATACTTTTAAAGAT 
TATAAAGGTAAATTTGAATCAGGTG AATTGACAACAGAGGATAT CGT CT C 
AG C CGTTAAGGAAAAAAG CGGAGAAGTAGTTG ACTTTG C TAATGATTTTG 
TCAATCAAGCT AAAT CAAAATT CT CAGACGAGGAT ACTG CTAAAAAAGAA 
GATAAGG CT C CTGAAACAAAAGTAGAAGATATTGT CATT GATTATAAAG A 
AAACACAGAAGATAAAGAAAAA 

SEQ ID NO. 4703 
STRAIN 18RS21 

TATTTTTTAACAACAAAAAAAGGAAAAGAG CTAAGGAAAAATG CAGAAAA 

ATTCTATGGAGAATATAAAGAAAAT CCAGAAGAATATCATCAAATAGCTA 

AAGATAAAG CAAGTGAATATTCAAATTT AG CTGTTGATACTTT^ 

TATAAAGGTAAATTTGAAT CAGGTGAATTGACAACAGAGGATAT CGT CTC 

AGCCGTTAAGGAAAAAAGCGGAGAAGTAGTTGACITTGCTAATGATT^ 

TCAATCAAGCTAAATCAAAATTCTCAGACC^GGATACTO 

GATAAGG CT C CTGAAACAAAAGTAGAAGATATTGTCATTGATTATAAAGA 

AAACACAGAAGATAAAGAAAAA. 

SEQ ID NO. 4704 
STRAIN M732 

TATTTTTTAACAACAAAAAAAGGAAAAGAG CTAAGGAAAAATG CAGAAAA 
ATTCTATGGAGAATATAAAGAAAATCCAGAAGAATATCATCAAATAGCTA 
AAGATAAAGCAAGTGAATATTCAAATTTAGCTGTTGATACTTTTA^ 
TATAAAGGTAAATTTGAAT CAGGTGAATTGACAACAGAGGATAT CGT CTC 
AGCCGTTAAGGAAAAAAG CGGAGAAGTAGTTGACTTTG CTAATGATTTTG 
T CAAT CAAG CTAAATCAAAATT CTCAGACGAGGATACTG CTAAAAAAGAA 
GATAAGGCTCCTGAAACAAAAGTAGAAGATATTGTCATTGATTATAAAGA 
AAACACAGAAGATAAAGAAAAA 

SEQ ID NO. 4705 
STRAIN COH1 

TATTTTTTAACAACAAAAAAAGGAAAAGAG CTAAGGAAAAATG CAGAAAA 
ATTCTATGGAGAATATAAAGAAAAT CCAGAAGAATATCAT CAAATAG CTA 
AAGATAAAG CAAGTGAATATTCAAATTTAG CTGTTGATACTTTTAAAG^ 
TATAAAGGTAAATTTGAAT CAGGTGAATTGACAACAGAGGATATCGT CTC 
AGCCGTTAAGGAAAAAAGCGGAGAAGTAGTTGACTTTGCTAATGATTTTG 
TCAATCAAG CTAAAT CAAAATT CT CAGACGAGGATACTG CTAAAAAAGAA 
GATAAGGCTCCTGAAACAAAAGTAGAAGATATTGTGATTGATTATAAAGA 
AAACACAGAAGATAAAGAAAAA 

SEQ ID NO. 4706 
STRAIN M781 

TATTTTTTAACAACAAAAAAAGGAAAAGAGC 

TAAGGAAAAATGCAGAAAAATTCTATGGAGAATATAAAGAAAATCCAGAA 
GAATATCAT CAAAT AGCTAAAGATAAAG CAAGTGAATATT CAAATTTAG C 
TGTTGATACTTTTAAAGATTATAAAGGTAAATTTGAATCAGGTGAATTGA 
CAACAGAGGATATCGTCTCAGCCGTTAAGGAAAAAAGCGGAGAAGTAGTT 
GACITTGCTAATGATTTTGTCAATCAAG CTAAATCAAAATTCT CAGACGA 
GGATACTG CTAAAAAAG AAGATAAGG CT CCTGAAAGAAAAGTAGAAGATA 
TTGTCATTGATTATAAAGAAAACACAGAAGATAAAGAAAAA 

SEQ ID NO. 4707 
STRAIN 2603 

tattttttaacaacaaaaaaaggaaaagagctaaggaaaaatgcagaaaa 
attctatggagaatataaagaaaatccagaagaatatcatcaaatagcta 
aagataaagcaagtgaatattcaaatttagctgttgatacttttaaagat 
tataaaggtaaatttgaatcaggtgaattgacaacagaggatatcgtctc 
agccgttaaggaaaaaagcggagaagtagttgactttgctaatgattttg 
tcaatcaagctaaatcaaaattctcagacgaggatactgctaaaaaagaa 
gataaggctcctgaaacaaaagtagaagatattgtcattgattataaaga 
aaacacagaagataaagaaaaa 

SEQ ID NO. 4708 
STRAIN 090 

TAlT*lTTT'aACaACAAAAAAAGGAAAAGAGCTAAGGAAAAATGCAGAAAA 
ATTCTATGGAGAATATAAAGAAAAT CCAGAAGAATATCAT CAAATAG CTA 
AAGATAAAG CAAGTG AATATTCAAATTTAG CTGTTGATACTTTT AAAGAT 



809 



WO 2004/018646 



PCT/US2003/026827 



Table 47: Comparative Sequences relating to SAG0739 (strain info highlighted in BOLD) 



TATAAAGGT AAATTTGAAT CAG GTGAATTGACAACAGAGG AT AT CGT CT C 
AGCCGTTAAGGAAAAAAGCGGAGAAGTAGTTGACTTTGCTAATGATTTTG 
TCAAT CAAG CTAAAT CAAAATT CT CAGAC GAGGAT ACTG CTAAAAAAG A a 
GATAAGG CT C CTGAAACAAAaGTAGAAGAT ATTGT CATTG ATTATAAAGA 
AAACACAGAAGATAAAGAAAAA 

SEQ ID NO. 4709 
STRAIN CJB110 

TATTTTTTAACAACAAAAAAAG GAAAAGAG CT AAG GAAAA 
ATGCAGAAAAA.TTCT ATGGAGAATATAAAGAAAAT C CAGAAG AATAT CAT 
CAAATAGCTAAAGATAAAGCAAGTGAATATTCAAATTTAGCTGTTGATAC 
TTTTAAAGATTATAAAGGT AAATTTGAAT CAGGTgAATTG ACAACAG AG G 
ATATCGTCTCAGCCGtTAAGGAAAAAAGCGGAGAAGTAGTTGACTTTGCT 
AATG ATTTTGT CAAT CAAG CT AAATCAAAATT CT CAGACGAGGAT ACTGC 
TAAAAAAG AAGATAAGG CTC CTGAAA.CAAAAGTAGAAGATAT TGTCA.TTG 
ATTATAAAGAAAAC^CAGAAGATAAAGAAAAA 

SEQ ID NO. 4710 
STRAIN 1169NT 

TATTTTTTAACAACAAAAAAAGGAAAAa^GCTAAGGAAA 
AATG CAGAAAAATT CT ATGG AG AATAT AAAGAAAATCCAGAAGAAT AT CA 
T CAAATAGCTAAAGATAAAG CAAGTGAATATT CAAATTTAGCTGTTGATA 
CTTTTAAAGATT AT AAAGGT AAATTTGAAT CAGGTGAATTGACAACAGAG 
GATATCGTCT CAG C CGTTAAGGAAAAAAG CGG AGAAGTAGTTGACTTTG C 
TAATG ATTTTGT CAATCAAG CT AAATCAAAATTCTCAGATGAGGATACTG 
CTAAAAAAG AAAATAAGG CT CCTGAAACAAAAGTAGAAG ATATTGT CATT 
GATTATAAAGAAAACACAGAAGATAAAGAAAAA 

SEQ ID NO. 4711 
STRAIN JK9130013 

TATTTTTTAa CAACAAAAAAAGGAAAA.GAG CTAAGGAAAA 

AT G CAGAAAAATT CTATGG AGAATATAAAGAAAAT C CAGAAG AATATCAT 

CAAATAG CT AAAGATAAAG CAAGTG AATATT CAAATTTAG CTGTTGATAC 

TTTTAAAGATTATAA^GGTAAATTTGAATCAGGTGAATTGACAACAGAGG 

ATATCGTCTCAGCCGTTAAGGAAAAAAGCGGAGAAGTAGTTGACTTTGCT 

AATGATTTTGTCAAT CAAG CTAAATCAAAATTCT CAGACGAGGATACTG C 

TAAAAAAGAAGATAAGGCT CCTGAAACAAAAGTAGAAGAT ATTGT CATTG 

ATTATAAAGAAAACACAGAAGATAAAGAAAAA 

PRETTY of: /biotmp/msa68511 . 2 { *} January 22, 2003 05:47 

1 50 

msa68511 . 2{164_090} TATTTTTTAA CAACAAAAAA AGGAAAAGAG CTAAGGAAAA ATGCAGAAAA 

msa68511 .2 {l64_18RS2ll TATTTTTTAA CAACAAAAAA AGGAAAAGAG CTAAGGAAAA ATGCAGAAAA 

msa68511 . 2{l64_2603 } TATTTTTTAA CAACAAAAAA AGGAAAAGAG CTAAGGAAAA ATGCAGAAAA 

maa68511.2{l64_A909} TATTTTTTAA CAACAAAAAA AGGAAAAGAG CTAAGGAAAA ATGCAGAAAA 

msa68511 . 2 { 164__CJB110} TATTTTTTAA CAACAAAAAA AGGAAAAGAG CTAAGGAAAA ATGCAGAAAA 

ms a68 511 .2(16 4_COHl} TATTTTTTAA CAACAAAAAA AGGAAAAGAG CTAAGGAAAA ATGCAGAAAA 

msa68511. 2{164__H3 6B} TATTTTTTAA CAACAAAAAA AGGAAAAGAG CTAAGGAAAA ATGCAGAAAA 

msa68511 . 2 { 164_JM9130013 } TATTTTTTAA CAACAAAAAA AGGAAAAGAG CTAAGGAAAA ATGCAGAAAA 

msa68511 .2{164_M732} TATTTTTTAA CAACAAAAAA AGGAAAAGAG CTAAGGAAAA ATGCAGAAAA 

msa6S511 . 2{l64_M78l} TATTTTTTAA CAACAAAAAA AGGAAAAGAG CTAAGGAAAA ATGCAGAAAA 

msa68511 . 2 { 164_1169NT) TATTTTTTAA CAACAAAAAA AGGAAAAGAG CTAAGGAAAA ATGCAGAAAA 

Consensus ********** ********** ********** ********** ********** 



msa68511 
msa68511.2{ 

msa68511. 

msa68511. 
msa68511.2{ 

msa68511 

msa68511. 
msa68 511.2(164 

msa68511. 

msa68511. 
msa68511.2{ 



2{164_090} 
164__18RS2l} 
2{l64_2603> 
2{164_A909} 
164_OTB110} 
2{l64_COHl} 
2{164_H36B} 
_JM9130013} 
2{l64_M732} 
2{l64_M78l} 
164_1169NT} 
Consensus 



51 

ATT CTATGGA 
ATT CTATGGA 
ATT CTATGGA 
ATTCTATGGA 
ATTCTATGGA 
ATT CTATGGA 
ATTCTATGGA 
ATTCTATGGA 
ATTCTATGGA 
ATTCTATGGA 
ATTCTATGGA 
********** 



GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
********** 



AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
********** 



AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
********** 



100 

CAAATAGCTA 
CAAATAG CT A 
CAAATAGCTA 
CAAATAGCTA 
CAAATAGCTA 
CAAATAGCTA 
CAAATAGCTA 
CAAATAGCTA 
CAAATAGCTA 
CAAATAGCTA 
CAAATAGCTA 
********** 



msa68511 
msa68511.2{ 

msa68511. 

msa68511. 
msa68511.2{ 

msa€8511 

msa68511 
msa68511.2{l64 

msa68511. 

msa68511 
msa685ll.2{ 



.2 { 164JD90) 
164_18RS2l} 
2 { 164_2603} 
2{164_A909} 
164_CJB110) 
2{164_C0H1) 
2{164_H36B} 
_JM9130013} 
2{164_M732) 
2{164_M78l) 
164_1169NT) 
Consensus 



101 

AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
********** 



AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
********** 



T CAAATTTAG 
T CAAATTTAG 
T CAAATTTAG 
TCAAATTTAG 
T CAAATTTAG 
T CAAATTTAG 
TCAAATTTAG 
TCAAATTTAG 
TCAAATTTAG 
TCAAATTTAG 
TCAAATTTAG 
********** 



CTGTTGATAC 
CTGTTGATAC 
CTGTTGATAC 
CTGTTGATAC 
CTGTTGATAC 
CTGTTGATAC 
CTGTTGATAC 
CTGTTGATAC 
CTGTTGATAC 
CTGTTGATAC 
CTGTTGATAC 
********** 



150 

TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 
********** 



810 
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Table 47: Comparative Sequences relating to SAG0739 (strain info highlighted in BOLD) 



msa68511 
tnsa68511 . 2{ 

msa68511 . 

msa68511 . 
msa68511.2{ 

msa68511 

msa68511 
msa68511.2{l64 

msa68511. 

m9a68511. 
msa68511.2{ 



2{l64_090} 
164_18RS2l} 
2{l64_2603} 
2{164_A909} 
164_CJB110) 
2{l64_COHl} 
2{164JK36B) 
_JM9130013) 
2{164 M732) 
2{lS4~M78l} 
164_1169NT} 
Consensus 



151 

TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
********** 



AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
********** 



AGGTGAATTG 
AGGTGAATTG 
AGGTGAATTG 
AGGTGAATTG 
AGGTGAATTG 
AGGTGAATTG 
AGGTGAATTG 
AGGTGAATTG 
AGGTGAATTG 
AGGTGAATTG 
AGGTGAATTG 
********** 



ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 
********** 



200 

ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
********** 



msa68511 . 2 { 164_090 } 
msa68511.2{l64_18RS2l} 
maa68511 . 2 { 164J2603 } 
msa68511 . 2 { 164_A909} 
tnsa68511 . 2 { 164_CJB110' 
msa68511 . 2 { 164_COHl . 
msa68511 . 2 { 164_H36B] 
msa68511 . 2{l64_JM9130013 ' 
msa68511.2{l64_M732, 
msaS8 511 . 2 { 164_M7 81 } 
. msa68511.2{l64_1169NT} 
Consensus 



201 

AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 
********** 



GAAAAAAGCG 
GAAAAAAGCG 
GAAAAAAGCG 
GAAAAAAGCG 
GAAAAAAGCG 
GAAAAAAGCG 
GAAAAAAGCG 
GAAAAAAGCG 
GAAAAAAGCG 
GAAAAAAGCG 
GAAAAAAGCG 
********** 



GAGAAGTAGT 
GAGAAGTAGT 
GAGAAGTAGT 
GAGAAGTAGT 
GAGAAGTAGT 
GAGAAGTAGT 
GAGAAGTAGT 
GAGAAGTAGT 
GAGAAGTAGT 
GAGAAGTAGT 
GAGAAGTAGT 
********** 



TGACTTTGCT 
TGACTTTGCT 
TGACTTTGCT 
TGACTTTGCT 
TGACTTTGCT 
TGACTTTGCT 
TGACTTTGCT 
TGACTTTGCT 
TGACTTTGCT 
TGACTTTGCT 
TGACTTTGCT 
********** 



250 

AATGATTTTG 
AATGATTTTG 
AATGATTTTG 
AATGATTTTG 
AATGATTTTG 
AATGATTTTG 
AATGATTTTG 
AATGATTTTG 
AATGATTTTG 
AATGATTTTG 
AATGATTTTG 
********** 



251 

msa68511.2{l64_090} TCAATCAAGC TAAATCAAAA 

msa68511.2{l64_18RS2l} TCAATCAAGC TAAATCAAAA 

msa68511.2{l64_2603} TCAATCAAGC TAAATCAAAA 

msa68511 .2{164_A909} TCAATCAAGC TAAATCAAAA 

msa68511.2{l64_CJB110} TCAATCAAGC TAAATCAAAA 

msa68511.2{l64_COHl} TCAATCAAGC TAAATCAAAA 

msa68S11.2{l64_H36B> TCAATCAAGC TAAATCAAAA 

msa68511.2{l64_JM9130013) TCAATCAAGC TAAATCAAAA 

msa68511.2{l64_M732} TCAATCAAGC TAAATCAAAA 

msa68511 - 2 {164_M7Bl} TCAATCAAGC TAAATCAAAA 

msa68511. 2 { 164^116 9NT} TCAATCAAGC TAAATCAAAA 

Consensus ********** ********** 



TTCTCAGAcG 
TTCTCAGAcG 
TTCTCAGAcG 
TTCTCAGAcG 
TTCTCAGACG 
TTCTCAGAcG 
TTCTCAGAcG 
TTCTCAGAcG 
TTCTCAGAcG 
TTCTCAGAcG 
TTCTCAGAtG 
********_ 



AGGATACTGC 
AGGATACTGC 
AGGATACTGC 
AGGATACTGC 
AGGATACTGC 
AGGATACTGC 
AGGATACTGC 
AGGATACTGC 
AGGATACTGC 
AGGATACTGC 
AGGATACTGC 



300 

TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 



********** ********** 



301 

msa68511.2{l64_090} gATAAGGCTC CTGAAACAAA 

msa68511.2{l64__18RS2l) gATAAGGCTC CTGAAACAAA 

msa68511 .2{164_2603> gATAAGGCTC CTGAAACAAA 

msa68S11.2{l64_A909) gATAAGGCTC CTGAAACAAA 

msa685ll.2{l64_CJB110} gATAAGGCTC CTGAAACAAA 

msa68511.2{l64__COHl} gATAAGGCTC CTGAAACAAA 

msa68511.2{l64_H36B} gATAAGGCTC CTGAAACAAA 

msa68511.2{l64_JM913 0013} gATAAGGCTC CTGAAACAAA 

msa68511.2{l64_M732} gATAAGGCTC CTGAAACAAA 

msa68511.2{l64_M78l} gATAAGGCTC CTGAAACAAA 

Tnsa68511.2{l64_1169NT} aATAAGGCTC CTGAAACAAA 

Consensus _********* ********** 



AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 



ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 



350 

ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 



********** ********** ********** 



msa68511 . 2 { 164_090 } 
msa68511 . 2 { 164_18RS2lJ 
msa68S11.2{l64_2603} 
msa68511 - 2 { 164_A909 J 
msa685ll . 2 { 164_CJB110 ) 
msa68 511 . 2 ( 164_C0H1 > 
msa68511 . 2 { 164_H36B} 
msa68511.2{l64_JM913 0013} 
msa68511.2fl64_M732) 
msa68511.2{164_M781} 
msa68511.2{l64_1169NT} 
Consensus 



351 372 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
********** ********** ** 



SEQ ID NO. 4712 
STRAIN 2603 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKAS 

TTED I VSAVKEKSGE WDFANDFVNQAKS KFSDEDTAKKEDKAPETKVED IVIDYKENTE 
DKEK 

SEQ ID NO. 4713 
STRAIN A909 frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNIAVDTFKDYKGKFESGEL 
TTED I VSAVKEKSGE WDFANDFVNQAKS KFSDEDTAKKEDKAPETKVED IVIDYKENTE 
DKEK 



SEQ ID NO. 4714 



811 



WO 2004/018646 
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Table 47: Comparative Sequences relating to SAG0739 (strain info highlighted in BOLD) 



STRAIN H36B frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNIAVDTFKDYKGKFESGEL 
TTEDI VSAVKE KSGE VVDFANDFVNQAKSKFSDEDTAKKEDKAPETKVED IV I DYKENTE 
DKEK 

SEQ ID NO. 4715 
STRAIN 18RS21 frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNLAVDTFKDYKGKFESGEL 
TTEDIVSAVKEKSGEVVDFANDFVNQAKSKFSDEDTAKKEDKAPETKVEDIVIDYKENTE 
DKEK 

SEQ ID NO. 4716 
STRAIN M732 frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNLAVDTFKDYKGKFESGEL 
TTEDI VSAVKEKSGEVVDFANDFVNQAKSKFSDEDTAKKEDKAPETK^DIVIDYKENTE 
DKEK 

SEQ ID NO. 4717 
STRAIN _COHl frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNLAVDTFKDYKGKFESGEL 
TTEDIVSAVKEKSGEVVDFANDFVNQAKSKFSDEDTAKKEDXAPETKX^EDIVIDYKENTE 
DKEK 

SEQ ID NO. 4718 
STRAIN _M7 81 frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNIiAVDTFKDYKGKFESGEL 
TTEDIVSAVKEKSGEVVDFANDFWQAKSKFSDEDTAKKEDKAPETKVEDIVIDYKENTE 
DKEK 

SEQ ID NO. 4719 
STRAIN _090 frames 1 

TTEDI VSAVKE KS GEWD FAND FVNQAKS KFS DEDTAKKEDKAPETKVE D I V I DYKENTE 
DKEK 

SEQ ID NO. 4720 

STRAIN _CJB110 frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNIAVDTFKDYKGKFESGEL 
TTED I VSAVKEKSGE WDFAND FVNQAKSKFSDEDTAKKEDKAPETKVED I VI DYKENTE 
DKEK 

SEQ ID NO. 4721 
STRAIN 116 9NT frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQ 

TTED I VSAVKEKSGE WDFAND FVNQAKS KFSDEDTAKKENKAPETKVED I VI DYKENTE 
DKEK 

SEQ ID NO. 4722 

STRAIN _JM9130013 frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNLAVDTFKDYKGKFESGEL 
TTEDIVSAVKEKSGEVVDFANDFWC^KSKFSDEDTAKKEDKAPETKVEDIVIDYKENTE 
DKEK 

PRETTY of: /biotmp/rasa68746.2{*} January 22, 2003 05:54 

1 50 

msa68746.2{l64 090} YFLTTKKGKE LRKNAEKFYG EYKENPEEYH QIAKDKASEY SNLAVDTFKD 

msa68746 . 2 { 164_1169NT} YFLTTKKGKE LRKNAEKFYG EYKENPEEYH QIAKDKASEY SNLAVDTFKD 

msa68746.2{l64_18RS2l} YFLTTKKGKE LRKNAEKFYG EYKENPEEYH QIAKDKASEY SNLAVDTFKD 

msa68746.2{!64_2603} YFLTTKKGKE LRKNAEKFYG EYKENPEEYH QIAKDKASEY SNLAVDTFKD 

msa68746.2{l64_A909) YFLTTKKGKE LRKNAEKFYG EYKENPEEYH QIAKDKASEY SNLAVDTFKD 

msa68746.2{l64_CJB110) YFLTTKKGKE LRKNAEKFYG EYKENPEEYH QIAKDKASEY SNLAVDTFKD 

msa68746 . 2 { 164_COHl } YFLTTKKGKE LRKNAEKFYG EYKENPEEYH QIAKDKASEY SNLAVDTFKD 

msa68746 . 2 { 164_H36B} YFLTTKKGKE LRKNAEKFYG EYKENPEEYH QIAKDKASEY SNLAVDTFKD 

msa68746.2{l64_JM9130013} YFLTTKKGKE LRKNAEKFYG EYKENPEEYH QIAKDKASEY SNLAVDTFKD 

msa68746.2{l64_M732} YFLTTKKGKE LRKNAEKFYG EYKENPEEYH QIAKDKASEY SNLAVDTFKD 

msa68746.2{l64__M78l} YFLTTKKGKE LRKNAEKFYG EYKENPEEYH QIAKDKASEY SNLAVDTFKD 

Consensus ********** ********** ********** ********** ********** 

51 100 

msa68746. 2 {164_090} YKGKFESGEL TTEDIVSAVK EKSGEWDFA ND FVNQAKS K FSDEDTAKKE 

msa68746.2ll64_1169NT} YKGKFESGEL TTEDIVSAVK EKSGEWDFA ND FVNQAKS K FSDEDTAKKE 

msa68746.2{l64_18RS2l) YKGKFESGEL TTEDIVSAVK EKSGEWDFA ND FVNQAKS K FSDEDTAKKE 

msa68746 .2 { 164_2603 } YKGKFESGEL TTEDIVSAVK EKSGEWDFA ND FVNQAKS K FSDEDTAKKE 

msa68746.2{l64_A909} YKGKFESGEL TTEDIVSAVK EKSGEWDFA ND FVNQAKS K FSDEDTAKKE 

msa68746.2{l64__CJB110} YKGKFESGEL TTEDIVSAVK EKSGEWDFA ND FVNQAKS K FSDEDTAKKE 

msa68746.2fl64_COHl} YKGKFESGEL TTEDIVSAVK EKSGEWDFA NDFVNQAKSK FSDEDTAKKE 

msa68746 .2 {164_H36B} YKGKFESGEL TTEDIVSAVK EKSGEWDFA NDFVNQAKSK FSDEDTAKKE 

msa68746.2{l64_JM913 0013} YKGKFESGEL TTEDIVSAVK EKSGEWDFA NDFVNQAKSK FSDEDTAKKE 

msa68746.2 (164_M732) YKGKFESGEL TTEDIVSAVK EKSGEWDFA NDFVNQAKSK FSDEDTAKKE 

msa68746 .2{164_M781) YKGKFESGEL TTEDIVSAVK EKSGEWDFA NDFVNQAKSK FSDEDTAKKE 



812 
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Table 47: Comparative Sequences relating to SAG0739 (strain info highlighted in BOLD) 



Consensus 



********** ********** ********** ********** ********** 



msa68746.2{l64_090} 
msa68746.2(l64_1169NT} 
msa68746 . 2 { 164_18RS2l} 
msa68746 . 2 { 164J2603 } 
msa68746.2{l64__A909} 
msa68746.2{l64_CJB110) 
msa6 8746 . 2 { 164_COHl } 
tnsa68746.2{l64_H36B} 
rasaG 8 74 6 . 2 { 164_JM9 13 0 0 13 } 
msa68746 . 2 { 164_M732 } 
msa68746 . 2 {164_M78l} 
Consensus 



101 

dKAPETKVED 
nKAPETKVED 
dKAPETKVED 
dKAPETKVED 
dKAPETKVED 
dKAPETKVED 
dKAPETKVED 
dKAPETKVED 
dKAPETKVED 
dKAPETKVED 
dKAPETKVED 



IVIDYKENTE 
IVIDYKENTE 
IVIDYKENTE 
IVIDYKENTE 
IVIDYKENTE 
IVIDYKENTE 
IVIDYKENTE 
IVIDYKENTE 
IVIDYKENTE 
IVIDYKENTE 
IVIDYKENTE 



********** ********** 
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DKEK 
DKEK 
DKEK 
DKEK 
DKEK 
DKEK • 
DKEK 
DKEK 
DKEK 
DKEK 
DKEK 
**** 
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SEQ ID NO: 4801 
STRAIN 2603 

aatagtactgagacaagtgctt cagt agt t cc tac tacaaat actat cgt 
tcaaactaatgacagtaafccctaccgcaaaatttgtatcagaatcaggac 
aatctgtaataggtcaagtaaaaccagataattctgcggcgcttacaaca 
gttgacacgcctcatcatatttcagctccagafcgctttaaaaacaactca 
atcaagtcctgtcgttgagagtacttctactaagttaactgaagagactt 
acaaacaaaaagatggtcaagatttagccaacatggtgagaagtggtcaa 
gttactagtgaggaactcgttaatatggcatacgatattattgctaaaga 
aaacccatctttaaatgcagtcattactactagacgccaagaagctattg 
aagaggc t agaaaac 1 1 aaagat ac caa t cagccg 1 1 1 1 1 aggtgt t ccc 
ttgttagtcaaggggttagggcacagtattaaaggtggtgaaaccaataa 
tggcttgatctatgcagatggaaaaattagcacatttgacagtagctatg 
tcaaaaaatataaagatttaggatttattattttaggacaaacgaacttt 
ccagagtatgggtggcgtaatataacagattctaaattatacggtctaac 
gcataatccttgggatcttgctcataatgctggtggctcttctggtggaa 
gtgcagcagccattgctagcggaatgacgccaattgctagcggtagtgat 
gctggtggttctatccgtattccatcttct tggacgggc 1 1 gg t agg 1 1 1 
aaaaccaacaagaggattggtgagtaatgaaaagccagattcgtatagta 
cagcagttcattttccattaactaagtcatctagagacgcagaaacatta 
ttaacttatctaaagaaaagcgatcaaacgctagtatcagttaatgattt 
aaaatctttaccaattgcttatactttgaaatcaccaatgggaacagaag 
ttagtcaagatgctaaaaacgctattatggacaacgtcacattcttaaga 
aaacaaggattcaaagtaacagagatagacttaccaattgatggtagagc 
attaatgcgtgattattcaaccttggctattggcatgggaggagcfctttt 
caacaattgaaaaagacttaaaaaaacatggttttactaaagaagacgtt 
gatcctattacttgggcagttcatgttatttatcaaaattcagataaggc 
tgaacttaagaaatctattatggaagcccaaaaacatatggatgattatc 
gtaaggcaatggagaagcttcacaagcaatttcctattttcttatcgcca 
acgaccgcaagtttagcccctctaaatacagatccatafcgtaacagagga 
agataaaagagcgatttataatatggaaaacttgagccaagaagaaagaa 
ttgctctctttaatcgccagtgggagcctatgttgcgtagaacacctttt 
acacaaattgctaatatgacaggactcccagctatcagtatcccgactta 
cttatctgagtctggtttacccatagggacgatgttaatggcaggtgcaa 
actatgatatggtattaattaaatttgcaactttctttgaaaaacatcat 
ggttttaatgttaaatggcaaagaataatagataaagaagtgaaaccatc 
tactggcctaatacagcctactaactccctctttaaagctcattcatcat 
tagtaaatttagaagaaaattcacaagttactcaagtatctatctctaaa 
aaatggatgaaatcgtctgttaaaaataaaccatccgtaatggcatatca 
aaaagca 

SEQ ID NO: 4802 
STRAIN 090 

* AATAGTACTGAGACAAGTG CTT CAGTAGTT CCTACTACAA 

ATACTAT CGTT CAAACT AATGACAGTAAT C CT AC CG CAAAATTTGTAT CA 
GAATGAGGACAATCTGTAAT AGGTCAAGTAAAAC CAGAT AATT CTG CGG C 
GCTTACAACAGTTGACACG C CT CATCATATTTCAG CT CCAGATG CTTTAA 
AAACAACTCAATCAAGT CCTGT CGTTGAGAGTACTTCTACTAAGTTAACT 
GAAGAGACITACAAACAAAAAGATGGTAAAGATTTAGCCAACATGGTGAG 
AAGTGGTCAAGTTACTAGTGAGGAACrCGTTAATATGGCATACGATATTA 
TTGCTAAAGAAAACC CATCTTT AAATG CAGT CATTACTACTAGACG C CAA 
GAAG CTATTGAAGAGG CTAGAAAACTTAAAGATAC CAAT CAG C CGTTTTT 
AGGTGTT C CCTTGTTAGTCAAGGGGTTAGGG CACAGTATTAAAGGTGGT G 
AAAC CAATAATGGCTTGAT CTATG CAGATGGAAAAATTAG CACATTTGAC 
AGTAG CTATGTCAAAAAATATAAAGATTTAGGATTTATT ATTTTAGGACA 
AACGAACTTT CCAGAGTATGGGTGG CGTAATATAACAGATTCTAAATTAT 
ACGGTCTAACGCATAATCCTTGGGATCTTG CT CATAATG CTGGTGGCTCT 
TCTGGTGGAAGTGCAGCAGCCATTGCTAGCGGAATGACGCCAATTGCTAG 
CGGTAGTGATGCTGGTGGTTCTATCCGTATTCCATCTTCTTGGACGGGCT 
TGGTAGGTTTAAAAC CAACAAGAGGATTGGTGAGTAATGAAAAG C CAGAT 
TCGTATAGTACAGCAGTT CATTTTC CATTAACTAAGT CAT CT AG AGACG C 
AGAAACATT ATTAACTT AT CTAAAGAAAAG CG AT CAAACGCT AGTAT CAG 
TTAATGATTTAAAAT CTT t AC CAATTGCITATACTTTGAAATCAC CAATG 
GGAACAG AAGTTAGT CAAG ATG CTAAAAACG CTATTATGGACAACGT CAC 
ATTCTTAAGAAAACAAGGATTCAAAGTAACAGAGATAGACrTACCA 
ATGGTAGAG CATTAATGCGTGATTATT CAACCTTGG CTATTGG CATGGG A 
GGAGCITTTTCAACAATTGAAAAAGACITA 

■ AGAAGACGTTGATCOTATTACTTGGG CAGTTCATGTTATTTAT CAAAATT 
CAGATAAGG CTGAACTTAAGAAAT OTATTATGGAAGCCCAAAA^CAT ATG 
GATGATTATCGT AAGGCAATGGAG AAG CTT CACAAGCA^TTTCCTATTTT 
CTTATCG CCAACGAC CG CAAGTTTAGC CC CTCTAAATACAGATC CATATG 
TAACAGAGGAAGATAAAAGAGCGATTTATAATATGGAAAACTTGAGCCAA 
GAAGAAAGAATTGCTCTCTTTAATCGCCAGTGGGAGCCTATGTTGCGTAG 
AACAC(T[TTTACACAAATTGCTAATATGACAGGACTCCCAGCT 
TC C CGACTTACTTAT CTGAGTCTGGTTTAC C CATAGGGACGATGTTAATG 
G CAGGTG CAAACTATG ATATGGTATTAATTAAATTTG CAACTTTCTTTG A 
AAAACAT CATGGTTTTAATGTT AAATGG CAAAGAATAAT AGATAAAG AAG 
TGAAACCATCTACTGGCCTAATACAGCCTACTAACTCCCrCTTTAAAGCT 
CATTCAT CATTAGTAAATTT AGAAGAAAATTCACAAGTTACT CAAGT AT C 
TATCT CT AAAAAATGGATGAAATCGT CTGTTAAAAATAAACCATC CGTAA 
TGGCATATCAAAAAGCA 

SEQ ID NO: 4803 
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STRAIN A909 

TACTACAAATACTATCGTTCAAACTAATGAC^GTAATCCTACCGCAAAAT 
TTGT ATCAG AAT CAGGACAAT CTGTAATAGGT CAAGT AAAAC CAG ATAAT 
TCTGCGGCGCTTACAACAGTTGACACGCCTCATCATATTTCAGCTCCAGA 
TGCTTTAAAAACAACTCAATCAAGTCCTGTCGTTGAGAGTACTTCTACTA 
AGTTAACTGAAGAGACTTACAAACAAAAAGATGGTCAAGATTTAGCCAAC 
ATGGTGAGAAGTGGTCAAGTTACTAGTGAGGAACT CGTT AAT ATGG CATA 
CGATATTATTGCTAAAGAAAACCCATCTTTAAATGCAGTCATTACTACTA 
GACGCCMGAAGCTATTGAAGAGGCTAGAAAACTTAAAGATACCAATCAG 
CCGTTTTTAGGTGTTCCCTTGTTAGTCAAGGGGTTAGGGCACAGTATTAA 
AGGTGGTGAAACCAATAATGGCTTGATCTATGCAGATGGAAAAATTAGCA 
CATTTGACAGTAG CTATGT CAAAAAAT ATAAAGATTT AGGATTTATT AT T 
TTAGGACAAACGAACTTTCCAGAGTATGGGTGGCGTAATATAACAGATTC 
TAAATTATACGGTCTAACGCATAATCCTTGGGATCTTGCTCATAATGCTG 
GTGGCTCTT CTGGTGGAAGTGCAG CAGCCATTGCTAGCGGAATGACG CCA 
ATTGCTAGCGGTAGTGATGCTGGTGGTTCTATCCGTATTCCATCTTCTTG 
GACGGGCrrTGGTAGGTTTAAAACCAACAAGAGGATTGGTGAGTAATGAAA 
AG CCAGATTCGTATAGTACAGCAGTTGATTTT CCATTAAc TAAGT CAT CT 
AGAGACG CAGAAACATTATT AACTTAT CTAAAGAAAAGC GAT CAAACG CT 
AGTAT C^GTTAATGATTTAAAATCTTT AC CAATTG CTTATACTTTGAAAT 
CACCAATGGGAACAGAAGTTAGTCAAGATG CT AAAAACG CTATT ATGGAC 
AACGT C^C aTTCTT AAGAAAACAAGGATT CAAAGTAACAG AGAT AGACTT 
AC CAATTGATGGTAGAGCATTAATG CGTGATTATTCAACCTTGG CTATTG 
GCATGGGAGGAG CTTTTTCAACAATTGAAAAAGACT^ 
TTTACTAAAGAAGACGTTGATCCTATTACTTGGG CAGTT CATGTTATTTA 
T CAAAATT CAGATAAGG CTG AACTT AAGAAAT CTATT AT GGAAG C C CAAA 
AACATATGGATGATTAT CGTAAGG CAATGGAGAAG CTT CACAAGCAATTT 
CCTATTTTCITATCGC G^CGACCG CAAGTTTAGC CC CT CTAAATACAGA 
TCCATATGTaACAGAGGAAGATAAAAGAGCGATTTATAATATGGAAAACT 
TGAG C CAAGAAGAAAGAATTGCTCT CTTTAATCG C CAGTGGGAG C CTATG 
TTGCXSTAGAACACCTTTTACACAAATTGCTA 

TATCAGTAT C CCGACTTACTTATCTGAGTCTGGTTTACCCATAGGGACGA 
TGTTAATGGCAGGTGCAAACTATGATATGGTATTAATTAAATTT 
TTCTTTGAAAAACATC^TGGTTTTAATGTTAAATGGCAAAGAATAATAGA 
TAAAGAAGTGAAACCATCTACTGGCCTAATACAGCCTACrAACTCCCTCT 
TTAAAGCTCATT CAT CATTAGTAAATTTAGAAGAAAATT CACAAGTTACT 

ATCCGTAATGGCATATCAAAAAGCA 

SEQ ID NO: 4804 
STRAIN COH1 

AATAGTACTGAGACAAGTGCTTCAGTAGCTCCTACTACAAAT 
ACTAT CGTT CAAACTAATGACAGTAAT CCTAC CG CAAAATTTGCATCAGA 
AT CAGGACAATCTGTAATAGGT CAAGTAAAAC CAGCTAATT CTG CGG CG C 
TTACAACAGTTGACACG C CT CATATTT CAG CT CCAGATG CTTTAAAAACA 
ACTCAATCAAGT C CTGTCX^TTGAGAGT CCTT CTACTAAGTTAACTGAAGA 
GACATACAAACAAAAAGATGGTCAAGATTTAGCCAACATGGTGAGAAGTG 
GT CAAGTTACTAGTGAGGAACTCGT CAATATGGCATACGATATT ATCGCT 
AAAGAAAACC CATCTTTAAATG CAGT CATTACTACTAGACGC CAAGAAG C 
C7VITGAAGAGGCTAGAAAACTTAAAGATACTAAT CAG C CGTTTTTAGGTG 
TT CC CTTGTT AGT CAAGGGGTTAGGG CACAGTATTAAAGGTGGTGAAAC C 
AATAATGGCTTG AT CTATG CAGATGGAAAAATTAG CACATTTGACAGTAG 
CTATGTCAAAAAATATAAAGATTTAGGATTTATTATTTTAGGA 
ATTTTCCAGAGTATGGGTGGCXjTAATATAACAGACrCTAAATTATACGGT 
CCAACGCATAATCCTTGGAATCTTGCTCATAACGCTGGTGGCTCTTCT 
TGGAAGTGCAG CAG CTATTG CTAGCGGAATGACGC CAATTG CTAGCGGCA 
GTGATGCTGGTGGTT CTAT CCGTATTCCATCTTGTTGGACGGGCTTAGTA 
GGTTTAAAACCAACAAGAGGATTGGTGAGTAATGAAAAG CCAGATTCGTA 
TAGT ACAGCAGTTCATTTT C CATTAACTAAGT CATCTAG AGACG CAG AAA 
C^TTGTTAACITACCTAAAGAAAAGCGATCAAACGCTAGTATCAGTT 
GATTTAAAATCTTTAC CAATTG CTTATACTTTGAAAT CAC CAATG GG AAC 
AGAAGTTAGTCAAGATGCTAAAAATGCTATTATGGACAACGTCAGATTCT 
TAAGAAAACAAGGATT CAAAGTGACAGAGATAGATT t ACCAATTGATGGT 
AGAG CATTAATG CGTGATTATT CAAC CTTGG CTATTGG GATGGGAGGAG C 
TTTTTCAACAATTGAAAAAGACTTAAAAAAACATGGTTTTACTAAA 
ACGTTGATC C CATT ACTTGGGCAGTT CATGTTATTTAT CAAAATT CAGAT 
AAGGCTGAACTTAAG AAAT CTATTGTGGAAGCC CAAAAACATATGGATGA 
TTAT CGTAAGGCAATGGAG AAGCTT CACAAGCAATTTCCT CTTAT 
CG CCAACGACCG CAAglTTAG CCC CTCTAAATACAGATC CATATGTAACA 
GAGAAAGATAAAAGAGCGATTTATAATATGGAAAACTTGAGCCAAGAAGA 
AAGAATTGCTCTCTTTAATCGCCAGTGGGAGCCTATGTTGCGTAGAACAC 
CTTTTACACCAATTGCTAATAtGACAGGACTCCCAGCTATCAGTATCCCG 
ACTTACTTAT CTGAGTCTGGTTTAC C CATAGGGACGATGTTAATGGCAGG 
TG CAAACTATGATATGGTATTAATTAAATTTG CAACITTCTTTGAAAAAC 
AT CATGGTTTTAATGTTAAATGG CAAAG AATAAT AGATAAAGAAGTGAAA 
CCAT CTG CT GAC CTAATACAG CCTACTAACTCCCT CTTTAAAGCTCATT C 
ATCATTAGTAAATTTAGAAG AAAATTCACAAGTT ACT CAAGTATCTATCT 
CTAAAAAATGGATGAAATCGTCTGTTAAAAATAAACCAT CCGTAATGGCA 
TATCAAAAAGCA 

SEQ ID NO: 4805 
STRAIN M732 

TCAGTAG CT CCT ACT ACAAATACTATCGTT CAAACTAATGACAGTAAT CC 



815 



WO 2004/018646 



PCT/US2003/026827 



Table 48: Comparative Sequences relating to SAG1474 



TACCGCAAAATTTGCATCAGAATCAGGACAATCTGTAATAGGTCAAGTAA 
AACCAGCTAATTCTGCGGCGCrTACAACAGTTGACACGCCTCATATTTCA 
GCTCCAGATGCTTTAAAAACAACTCAATCAAGTCCTGTCGTTGAGAGTCC 
TT CTACTAAGTT AACTG AAG AG ACATACAAACAAAAAGAT GGTCAAGATT 
TAGCGAA.CATGGTGAGAAGT GGTCAAGTTACTAGTGAGGAACTCGT CAAT 
ATGG CATACG AT ATT AT GG CTAAAGAAAACCCAT CTTTAAATG CAGTCAT 
TACTACTAGACG CCAAGAAG CCATTGAAGAGG CTAG AAAACTTAAAGATA 
CT AAT CAG C CGTTTTTAGGTGTTC C CTTGTTAGT CAAGGG GTTAGGG CAC 
AGTATTAAAGGTGGTGAAAC CAATAATGG CTT GATCT ATGCAGATGGAAA 
AATTAGCACATTTGACAGTAG CTATGT CAAAAAATATAAAGATTT AGGAT 
TTATT ATTT TAGGACAAACG AATTTTC CAG AGTATGGGTGG CGTAATATA 
ACAG ACT CT AAATTATACGGTCnAACG CATAATC CTTGGGAT CTTGCT CA 
TAACGCTC^TGGCTCTTCrGGTGGAAGTGCAGCAGCTATTGCrAGCGGAA 
TGACGCCAATTGCTAGCGGCAGTGATGCTGGTGGTTCTATCCGTATTCCA 
T CTT CTTGGACG GG CTTAGTAG GTTTAAAAC CAACAAGAGGATT GGTGAG 
TAATGAAAAGCCAGATTCGTATAGTACAGCAGTTCATTTTCCATTAACTA 
AGT CAT CTAGAGACG CAGAAACATTGTTAACTTAC CTAAAGAAAAG CGAT 
CAAACGCTAGTATCAGTTAATGATTTAAAATCTTTACCAATTGCTTATAC 
TTTGAAATCAC CAATGGGAACAGAAGTTAGTCAAGATGCTAAAAATGCTA 
TTATGGACAACGTCACATT CTTAAGAAAACAAGGATT CAAAGTGACAGAG 
AT AG ATTTAC CAATTGATGGTAGAG CATTAATGCGTG ATTATTCAACCTT 
GGCTATTGG CATGGG AGGAG CTTT TTCAACAATTGAAAAAGACTTAAAAA 
AACATGGTTTTACTAAAGAAGACGTTGATCCCATTACTTGGG CAGTT CAT 
GTTATTTAT CAAAATT CAG ATAAGG CTGAACTTAAGAAAT CTATTGTGGA 
AGCCCAAAAACATATGGATGATTATOSTAAGGC^TGGAGAAGCTTCACA 
AG CA ATTTC CTATTTT CTTATCGCCAACG ACCGCAAGTTTAG CC C CT CTA 
AATACAGAT C CATATGTTACAGAGAAAGATAAAAGAG CGATTTATAATAT 
GGAAAACTTGAGC(2AAGAAGAAAGAATTGCTCTCTTTA^ 
AG CCTATGTTGCGTAGAACACCTTTTACAC CAATTG CTAATATGACAGGA 
CT C C CAGCTATCAGTAT C C CGACTTACTTAT CTGAGTCTGGTTTAC C CAT 
AGGGACGATGTTAATGG CAGGTGCAAACT ATGATATGGTATTAATTAAAT 
TTGCAACTTTCTTTGAAAAACATCATGGTTT^ 

ATAATAGATAAAGAAGTGAAACCATCTGCTGACCTAATAC^GCCrACTAA 
CT CC CT CTTTAAAG CTCATT CATCATTAGTAAATTTAGAAGAAAATTCAC 
AAGTTACTCAAGTATCTATCTCTAAAAAATGGATGAAATCGTCTGTTAAA 
AATAAACCAT CCGTAATGG CATAT CAAAAAG CA 

SEQ XD NO: 4806 
STRAIN 18RS21 

AATAGTACTGAGACAA.GTGOTTCAGTAGTTCCTACTACAAATACTATCGT 
TCAAACTAATGACAGTAAT C CTACCGCAAAATTTGTATCAGAAT CAGGAC 
AATCTGTAAT AGGT CAAGT AAAACCAGATAATTCTG CGG CGCTTACAACA 
GTTG ACACG C CT CAT CATATTT CAG CTCCAGATG CTTTAAAAACAACTCA 
ATCAAGTCCTGTCGTTGAGAGTACTTCTACTAAGTTAACTGAAGAGACTT 
ACAAACAAAAAG ATGGT CAAGATTTAG CCAACATGGTGAG AAGTGGTCAA 
GTTACTAGTGAGGAACTCGTTAATATGGCATACGATATTATTGCTAAAGA 
AAACC CATCTTTAAATG CAGTCATTACTACTAGACGC CAAGAAG CTATTG 
AAGAGGCTAGAAAACTTAAAGATAC CAAT CAG CCGTTTTTAGGTGTT CC C 
TTGTTAGTCAAGGGGTTAGGGCACAGTATTAAAGGTGGTGAAACCAATAA 
TGGCTTGATCTATG CAGATGGAAAAATTAG CACATTTGACAGTAGCTATG 
TCAAAAAATATAAAGATT/TAGGATTTATTATTTTA 

CCAGAGTATGGGTGGCGTAATATAACZVGATTCTAAATTATACGGTCTAAC 
G(^TAATCCTTGGGATCTTG CT CATAATGCTGGTGGCTCTTCTGGTGGAA 
GTGCAG CAG CCATTG CTAG CGGAATGACG C CAATTG CTAG CGGTAGTGAT 
G CTGGTGGTT CTAT CCGTATT CCAT CTTCTTGGACGGGCITGGTAGGTTT 
AAAAC CAACAAGAGGATTGGTGAGTAATGAAAAG CCAGATTCGTATAGTA 
CAGCAGTTCATTTTCCATTAACTAAGTCATCTAG 

TTAACTTAT CTAAAGAAAAGCGATCAAACG CTAGTATCAGTTAATGATTT 
AAAATCTTTACCAATTGCTTATACTTTGAAATCACC^ 
TTAGT CAAGATGCTAAAAAC G CTATTATGGACAACGTCACATT CTTAAGA 
AAACAAGGATTCAAAGTAACAGAGATAGACTTACCAATTGATGGTAGAGC 
ATTAATG CGTGATTATT CAAC CTTGG CTATTGGCATGGGAGG AG CTTTTT 
CAACAATTGAAAAAGACTTAAAAAAACATGGTTTTACTAAAGAAGACGTT 
GAT C CTATTACTTGGG CAGTT CATGTTATTTATCAAAAT T CAGATAAGG C 
TGAACTTAAGAAAT CTATTATGGAAGC CCAAAAACATATGGATGATT AT C 
GTAAGGCAATGGAGAAGCTT CACAAGCAATTTCCTATT^ 
ACGACCG CAAGTTT AG C CCCT CTAAATACAGATC CATATGTAACAGAGG A 
AG a t AAAAG AGCGATTTATAATATGGAAAACTTGAG C CAAGAAGAAAGAA 
TTGCT CTCTTTAAT CG C CAGTGGGAGC CTATGTTGCX3TAGAACAC CTTTT 
ACACAAATTG CTAATATGACAGGACTC CCAGCTATCAGTATCCCG ACTTA 
CTTAT CTGAGTCTGGTTTAC C CATAGGGACGATGTTAATGGCAGGTG CAA 
ACTATGAT ATGGTATTAATTAAATTTG CAACTIT CTT^ CAT 
GGTTTTAATGTTAAATGGCAAAGAATAATAGATAAAGAAGTGAAA.CCATC 
TACTGG CCT AATACAGCCTACTAACTC CCT CTTTAAAG CT CATTCAT CAT 
TAGT AAATTTAGAAG AAAATTCACAAGTTACTCAAGT ATCT 
AAATGGATGAAA.TCGT CTGTTAAAAAT AAACCAT C CGTAATGGCATATCA 
AAAAG CA 

SEQ XD NO: 4807 
STRAIN M781 

TG CTTCAGTAG CT CCTACTACAAATACTATCGTT CAAACTAATG ACAGTA 
ATCCTAC CG CAAAATTTG CATCAGAAT CAGGACAATCTGT AATAGGTCAA 
GTAAAAC CAG CTAATTCTG CGG CG CTT ACAACAGTTGACACG CCT CATAT 
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TTCAGCTCCAGATGCTTTAAAAACAACrCAATCAAGTCCTGTCGTTGAGA 
GTCCTTCTACTAAGTTAACTGAAGAGACATACAAACAAAAAGATGGTCAA 
GATTTAGCCAACATGGTGAGAAGTGGTCAAGTTACTAGTGAGGAACTCGT 
CAATAIGGCATACGATATTATCGCTAAAGAAAACCCATCTTTAAATGCAG 
TCATT ACTACT AG ACG C CAAGAAG C CATTGAAGAGG CTAG AAAACTTAAA 
GATACTAATCAGCCGTTTTTAGGTGTTCCCTTGTTAGTCAAGGGGTTAGG 
GCACAGTATtAAAGGTGGTGAAACCAATAATGGCTTGATCTATGCAGATG 
GAAAAATTAG CACATTTGACAGTAG CT ATGTCAAAAAAT ATAAAGATTTA 
GGATTTATTATTTTAGGACAAACGaATTTTCCAGAGTATGGGTGGCGTAA 
TATAACAGACTCTAAATTATACGGTCCAACGCATAATCCTTGGAaTCTTG 
CTCATAACGCTGGTGGCTCTTCTGGTGGAAGTGCAGCAGCTATTGCTAGC 
GGAATGACGCCAATTGCTAGCGGCAGTGATGCTGGTGGTTCTATCCGTAT 
TCCATCTTCTTGGACGGGC!TTAGTAGGTTTAAAACCAACAAGAGGATTGG 
TGAGTAATGAAAAGCCAGATTCGTATAGTACAGCAGTTCATTTTCCATTA 
ACTAAGT CAT CTAG AGACG CAG AAACATTGTTAACTTAC CTAAAGAAAAG 
CG AT CAAACGCTAGTATCAGTTAATG^TTTAAAaTCTTTACCAATTGC^ 
ATACTTTGAAAT CAC CAATGGG AACAG AAgTTAGTCAAG ATG CTAAAAAT 
GCTATTATGG ACAACGT CACATT CTTAAGAGAACAAGGATTCAAAGTGAC 
AGAG ATAGATTTAC CAATTG ATGGTAG AG CATTAATG CGTGATTATTCAA 
CCTTGGCTATTGGCATGGGAGGAGCTTTTTCAACAATTGAA 
AAAAAACATGGTTTTACTAAAGAAGACGTTGATCCCATTACTTGGGCAGT 
TCATGTTATTTAT CAAAATT CAGATAAGG CTGAACTTAAGAAAT CTATTG 
TGGAAGCCCAAAAACATATGGATGATT AT CGTAAGG CAATGGAGAAG CTT 
CACAAGCAATTT C CTAT^ITT CITATCG CCAACGACCG CAAGTTTAG C CC C 
TCTAAAT ACAGAT C CATATGTAACAGaG aAAGATAAAAGAGCGATTTATA 
ATATGGAAAACTTG AGC CAAGAAGAAAGAATTG CTCT CiTT 
TGGGAGC CTATGTTG CGTAGAACAC CTTTT ACAC CAATTG CTAATAt GAC 
AGGACTCCCAGCTATCAGTATCCCGACTTACrTATCTGAGTCTG^ 
CCATAGGGACGATGTTAATGGCAGGTGCAAACTATGATATGGTATTAATT 
AAATTTG CAACTTT CTTTGAAAAACAT CATGGTTTTAATGTTAAATGGCA 
AAGAATAATAGATAAAGAAGTGAAAC CAT CTG CTG AC CTAATACAGCCTA 
CTAACTC CCT CT"1T'AAAGCT CATT CATCATTAGTAAATTT AGAAGAAAAT 
T CACAAGTTACTCAAGTAT CTATCT CTAAAAAATGGATGAAATCGTCTGT 
TAAAAATAAACCAT CCGTAATGG CATAT CAAAAAGCA 

SEQ ID MO: 4810 
STRAIN CJB110 

TAGTT CCTACTACAAAT ACTAT CX3TT CAAACTAATGACAGTAATCCTAC C 
GCAAAATTTGTATCAGAATCAGGACAATCrGTAATAGGTCAAGTAAAACC 
AGATAATTCTG CGGCGCTTACAACAGTTGACACG CCT CAT CATATTTCAG 
CTCCAGATG CTTTAAAAACAACTCAAiTCAAGTC CTGT CGTTGAGAGTACT 
TCTACTAAGTTAACTGAAGAGACITACAAACAAAAAGATGGTAA^GAT^ 
AGCCAACATGGTGAGAAGTGGTCAAGTTACTAGTGAGGAACTCGTTAATA 
TGGCATACGATATTATTGCTAAAGAAAACCCATCTTTAAATGCAGTCATT 
ACTACTAGACG C CAAGAAG CTATTGAAGAGGCTAGAAAACTTAAAGATAC 
CAATCAGCCGTTTTTAGGTGTTCCCTTGTTAGTCAAGGGGTTAGGGC^ 
GTATTAAAGGTGGTGAAAC CAATAATGG CTTGATCTATG CAGATGGAAAA 
ATTAGCACATTTGACAGTAGC TATG TC^AAAAATATAAAGATTTAGGATT 
TATTATTTT AGGACA^CGAACTTT C CAGAGTATGGGTGGCGTAATATAA 
CAGATTCTAAATTATACGGT CTAACG CATAATC CTTGGGATCTTGCTCAT 
AATGCTGGTGG CTCTT CTGGTGGAAGTG CAGCAG CCATTG CTAGCGGAAT 
GACGCCAATTGCTAGCGGTAGTGATGCTGGTGGTTCTATCCGTATTCCAT 
CTTCTTGGACGGGCTTGGTAGGTTTAAAACCAACAAGAGGATTGGTG 
CATGAAAAG C CAGATTCGTATAGTACAGCAGTTCATTTTC CATTAACTAA 
GTCATCTAGAGACGCAGAAACATTATTAACTTATCTAAAGAAAAGCGATC 
AAACG CT AGT AT CAGTTAATGATTTAAAATCTTTACCAATTGCTTATACT 
TTGAAAT CAC CAAT GGGAACAG AAGTTAGT CAAGATG CTAAAAACG CTAT 
TATGGACAACGT CACATT CTT AAGAAAACAAGGATTCAAAGTAACAGAGA 
TAGAC3?TACC^TTGATGGTAGAGCATTAATGCGTGATTATTCAACCTTG 
GCTATTGGCATGGG AgGAG CTTTTT CAACaATTGAAAAAG AcTTAaAAAA 
AcATGGTTTTACTAAAGAAGACGTTGATCCTATTACTTG<3GC^GTTCATG 
TTATTTATCAAAATTCAGATAAGG CTG AACHTAAG AAATCTATTATGGAA 
G CCCAAAAACATATGGATGATTAT CGTAAGG CAATGGAGAAG CTTCACAA 
GCAATTTCCTATTTTCTTATCGCCAACGACCGCAAGTTT 
ATACAGATCCIATATGTAACAGAGGAAGATAAAAGAGCGATTTATAATATG 
GAAAACTTGAG C CLAAGAAG AAAGAATTGCT CT CTTTAATCGCCAGTGGGA 
GCCTATGTTGCGTAGAACACCTTTTAG&CAA^ 

TCC CAGCTAT CAGTATC CCGACTT ACTTAT CTGAGTCTGGTTTACCCATA 

gGGACgATGTTAATGGCAGGTGazy\ACTATGATATGGTATTAATTAAATT 

TGCAAC1TTCTTTGAAAAACATCATGGTTTTAATGTTAAATGG 

TAATAGATAAAGAAGTGAAACCATCTACTGGCCTAATACAGCCTACTAAC 

TCC CT CTTTAAAGCTCATTCATCATTAGTAA^TTTAGAA 

AGTTACT CAAGT AT CTATCT CTAAAAAATGGATGAAATCGTCTGTTAA^A 

ATAAAC CATCCGTAATGGCATATCAAAAAG CA 

SEQ ID NO: 4811 
STRAIN 1169NT 

AATAGTACTG AGACAAGTG CTT CLA.GTAG CTCCTACTACAAATACTATCGT 
TCAAACT AATGACAGTAAT C CTAC CGCAAAATTTG CATCAGAAT CAGGAC 
AATCTGTAATATGTCAAGTAAAACCAGATAATTCTGCGGCGCTTACAACA 
GTTGACACG C CTCATATTT CAG CTCCAGAT GATTTAAAAACAACT CAATC 

AACAAAAAG ATGGTCAAGATTT AG CCAACATGGTGAG AAGTGGT CAAGTT 
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Table 48: Comparative Sequences relating to SAG1474 



ACTAGTGAGGAACTCGTCAATATGGCATACGATATTATTGCTAAAGAAAA 
CCCTT CTTTAAATG CAGTCATT ACT ACTAG ACGC CAAGAAGC CATTGAAG 
AGGCTAGAAAACTTAAAGATACTAATCAGCCATTTTTAGGTGTTCCCTTG 
TTAGTCAAGGGGTTAGGGCACAGTATTAAAGGTGGTGAAACCAATAATGG 
CTTG ATCTATGCAGATGGAAAAAT t aG CACATTTG ACAGT AG CTATGTCA 
AAAAATATAAAGATTTAGGATTTATTATTTTAGGACAAACGAACTTTCGA. 
GAGTATGGGTGGCGTAATATAACAGATTCTAAATTATACGGTCCAACGCA 
TAACCCTCGGAATCTTGCTCATAATGCTGGTGGCTCTTCTGGTGGAAGTG 
CAGC^GCCATTGCTAGCGGrATGACGCCAATTGCTAGCGGTAGTGATGCT 
GGTGGTTCTATCCGtATTCCATCTTCTTGGACGGGCITGGTAGGTTTAAA 
ACCAACAAGAGGATTGGTGAGTAATGAAAAGCCAGATTCGTATAGTACAG 
CAGTT CATT TTC CATTAAC TAAGT CATCTAGAG ACGCAGAAACATTATT A 
ACTTATCTAAAGAAAAGCGATCAAACGCTAGTATCAGTTAATGATTTAAA 
ATCTTTACCAATTGCTTATACTTTGAAATCACCAATGGGAACAGAAGTTA 
GT CAAGATG CTAAAAACGCT ATTATGGACAACGT CACATTCTTA^GAAAA 
GAA.GGATTCAAAGTAAGA.GAGATAGACTTAC CAATTG ATGGT AGAG CATT 
AATG CGTGATTATTCAACCTTGGCTATTGG CATGGGAGGAGCTTTTT CAA 
CAATTGAAAAAGACTTAAAAAAACATGGTTTTACTAAAGAAGACGTTGAT 
CCTATTACTTGGG CAGTT CATGTTATTTAT CAAAATT CAG ATAAGG CTGA 
ACITAAGAAATCTATTATGGA^G CC CAAAAA.CATATGGATGATTATCGTA 
AGGCAATGG AGAAG CITCAGAAGCAATTT CCTAT TTTCTTAT CG C CAACG 
ACCGCAAGTTTAGCCCCTCTAAATACAGAt CCATATGTAACAGAGGAAGA 
TAAAAGAGCGATTTATAATATGGAAAACTTGAGCCAAGAAGAAAGAATTG 
CT CTCTTTAATCG C CAGTGGGAG C CTATGTTG CGTAG AACACCTTTTACA 
CAAATTGCTAATATGACAGG ACTC CCAG CT AT CAGTAT CC CG ACTTACTT 
AT OTGAGTCTGGTTTAC CCATAGGGACGATGTTAATGGCAG^TG CAAACT 
ATGATATGGTATTAATT AAATTTG CAACTTT CTTTGAAAAACAT CATGGT 
TTTAATGTTAAATGG CAAAGAATAATAGAT AAAGAAGTGAAACCAT CTAC 
TGGCCTAATACAGCCTACTAACTC C CTCTTTAAAG CT CATTCATCATTAG 
TAAATTTAGAAGAAAATTCACAAGTTACTCAAGTATCTATCTCTAAAAAA 
TGGATGAAATCGTCTGTTAAAAATAAACCATCCGTAATGGCATATCAAAA 
AGCA 

SEQ ID HO: 4812 
STRAIN JM9130013 

TTCAGTAGCT CCTACTACAAATACT ATCGTTCAAACT AATGACAGTAAT C 

CTACCGCAAAATTTT CAT CAGAATCAGGACAAT CTGTAATAGGTCAAGTA 

AAAC(^GCTAATTCTGTGGCGCTTACAACAGTTGACACGCCTCATATTTC 

AGCTCCAGATGCTTTAAAAACAACTCAATCAAGTCCrGTCGTTGAGAGTC 

CTTCTACTAAGTTAACTGAAGAGACATACAAACAAAAAGATGGTCAAGAG 

TTAG C CAACATGGTGAGAAGTGGT CAAGTTACTAGTGAGGAACT CGTCAA 

TATGGCATACGATATTATTGCTAAAGAAAACCCATCTTTAAATGCAGTCA 

TTACTACTAGAC GCCAAGAAGCTATTGAAGAGG CTAGAAAACTT AAAGAT 

ACCAATCAGCCX5TTTTT , AGGTGTTCCCITGTTAGTCAAGG 

CAGTATTAAAGGTGGTGAAACCAATAATGGCTTGATCTATGCAGGTGGAA 

AAATTAG CACATTTGACAGTAG CTATGT CAAAAAATATAAAGATTTAGGA 

TTTATTATTTTAGGACAAACGAAC TTT CCAGAGTATGGATGG CGCLA^TAT 

AACAGATTCTAAATTATACGGTC CAACGCATAAC C CTTGGAATCTTGCT C 

ATAATGCTGGTGGCT CTTCTGGTGGAAGTG CAG CAGTTATTGCTAG CGGG 

ATGACGCCAATTG CTAG CGGTAGTGATGCTGGTGGTT CTATC CGTATTC C 

ATCIT CTTGG ACGGG CTTGGTAGGTTTAAAAC CAACAAGAGGATTGGTGA 

GTAATGAAAAGCCAGATTCGTATAGTACAGCAGTTCATTTTCCATTA^CT 

AAGT CATCTAGAGACGCAGAAACATTATTAACITAT CGA 

TCAAACG CT AGT AT CAGTTAATGATTT AAAAT CTTTAC CAATTG CTTATA 

CTTTGAAAT CAC CAATGGGAACAGAAGTTAGT CAAGATG CTAAAAATGCT 

ATTATGGACAACGTCATATTCTTAAGAAAACAAGGATTCA 

GATAG ACTT ACCAATTGATGGTAGAG CATTAATG CGTGATTATTCAACCT 

TGGCTATTGGTATGGGAGGAGCTTTTTCAACAATTGAAAA^ 

AAACATGGTTTTACTAAAGAAGACGTTGAT CCCATT^ CA 

TGTTATTTATCAAA^TTCAGATAAGGCTGAACTTAAGAA^TCTATTATGG 

AAGCC CA^AAACAT ATGGATGATT ATCGTAAGGCAATGGAGAAG CTT CAC 

AAGCAATTTCCTATTTTCTTATCGCCAACGACCG 

AAATAC^GATCCATATGTAACAGAGGAAGATAAAAGAGCGATTTATAATA 
TGGAAA^CTTGAGCCAAGAAGAAAGAATTGCTCTCTTTAATCGCCAGTGG 
GAG C CTATGTTG CGTAG AACAC CTTTTACACAAATTG CTAATATG ACAG G 
ACTCCCAGCTATCAGTATCCCGACTTACrT^ 

TAGGGACGATGTTAATGGCAGGTGCAAACTATGATATGGTATTAATTAAA 
TTTGCAACTTTCTJ." i' GAAAAATAT CATGGTTTTAATGTTAAATGG CAAAG 
AATAATAGATAAAGAAGTGAAACCATCTACTGGCCTAATACAG C CTACTA 
ACTCCCT CTTTAAAG CT CATTCAT CATTAGTAAATTTAGAAGAAAATTCA 
CAAGTTACT CAAGTATCTAT CT CTAAAAAATGGATGAAAT CGTCTGTTAA 
AAATAAACCATC CGTAATGG CATAT 

SEQ ID NO: 4813 
STRAIN H36B 

CTT CAGTAGTT CCTACTACAAATACTAT CGTT CAAACTAATGACAGT AAT 
CCTACCGCAAAATTTT CAT CAGAAT CAGGACAAT CTGTAATAGGT CAAGT 
AAAACCAGCTAATTCTGTGGCGCTTACAACAGTTGACACGCCTCATATTT 
CAGCTCCAGATG CITTAAAAACAACTCAAT CAAGTCCTGT CGTTG AG AGT 
CCTT CTACTAAGTTAACTG AAGAGACAT ACAAACAAAAAGATGGTCAAG A 
TTTAGCCAACATGGTGAGAAGTGGTCAAGTTACTAGTGAGGAACTCGTCA 
ATATGGCAT aCX?ATA t TATTGCTAAAGAAAACCCATCTTTAAATG CAGTC 
ATTACTACTAGACG C CAAGAAG CT ATTGAAGAGG CTAGAAAACTTAAAGA 
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Table 48: Comparative Sequences relating to SAG1474 



TACC7VATCAGCCGTTTTTAGGTGTTCCCTTGTTAGTCAAGGGGTTAGGGC 
ACAGTATTAAAGGTGGTGAAACCAATAATGGCTTGATCTATGCAGGTGGA 
AAAATTAGCACATTTGACAGTAGCTATGTCAAAAAATATAAAGATTTAGG 
ATTTATTATTTTAGGACAAACGAACTTTCCAGAGTATGGATGGCGCAATA 
TAACAGATTCTAAATTATACGGTCCAACGCATAACCCTTGGAATCTTGCT 
CATAATGCTGGTGGCTCTTCTGGTGGAAGTGCAGCAGTTATTGCTAGCGG 
GATGACX3CCAATTGCTAGCGGTAGTGATGCTGGTGGTTCTATCCGTATTC 
CATCTTCTTGGACGGGCTTGGTAGGTTTAAAACCAACAAGAGGATTGGTG 
AGTAATGAAAAGCGAGATTCGTATAGTACAGCAGTTCATTTTCCATTAAC 
TAAGT CATCT AG AGACG CAG AAACATTATT AACTTATCT AAAGAAAAGCG 
AT CAAACG CTAGTAT CAGTT AATG ATTTAAAATCT TT ACCAATTGCTTAT 
ACTTT GAAATCACCAATGGGAA.CAG AAGTTAGT CAAG ATG CTAAAAATG C 
TATTATGGACAACGTCATATTCTTAAGAAAACAAGGATTCAAAGTGACAG 
AG ATAGACTTAC CAATTGATGGTAGAG CATTAATG CGTGATTATTCAAC C 
TTGGCTATTGGTATGGGAGGAGCTTTTTCAACAATTGAAAAAGACTTAAA 
AAAACATGGTTTTACTAAAGAAGACGTTGATCCCATTACTTGGGCAGTTC 
ATGTTATTTATCAAAATT CAGATAAGG CTG AACTTAAGAAATCTATTATG 
GAAG CC CAAAAACAT ATGGATG ATTAT CGTAAGG CAATGGAGAAGCTTCA 
CAAGCAATTTCCTATTTTCTTATCGCCAACGACCGCAAGTTTAGCCCCTC 
TAAATACAGATC CATATGTAACAG AGGAAG ATAAAAGAG CGATTTATAAT 
ATGGAAAACTTGAG C CAAGAAGAAAGAATTGCT CTCTTT AATCGCCAGTG 
GGAG C CTATGTTGCGTAGAACACCTTTTACACAAATTGCTAATATGACAG 
GACTCCCAGCrTATCAGTATCCCGACTTACTTATCTGAGTCTGGTTTACCC 
ATAGGGACG ATGTT AATGG CAGGTG CAAACTATG ATATGGTATTAATTAA 
ATTTGCAACTTTCTTTGAAA^TATCATGGlUU-lAATGTTAAATGGCAAA 
GAATAATAGATAAAGAAGTGAAAC CATCTACTGG CCTAATACAGC CT ACT 
AACT C CCTCTTTAAAGCTCATTCAT CATTAGTAAATTTAGAAGAAAATT C 
ACAAGTTACTCAAGTAT CTATCTCTAAAAAATGG ATGAAATCGT CTGTTA 
AAAATAAA 

PRETTY of : /biotmp/msa71927 . 2 { * } January 22, 2003 07:23 

1 

msa71927 . 2{l73__18RS2l} aatagtactg agacaagtgc ttcagtagtt ccTACTACAA 
msa71927.2{l73_2603} aatagtactg agacaagtgc ttcagtagtt ccTACTACAA 

msa71927.2{l73_A909} — TACTACAA 

msa71927 .2{173_090} aatagtactg agacaagtgc ttcagtagtt ccTACTACAA 

msa71927 . 2 { 173_CJB110 } tagtt CCTACTACAA 

msa71927.2{l73_COHl} aatagtactg agacaagtgc ttcagtagct ccTACTACAA 

msa71927.2{l73_M78l} tgc ttcagtagct ccTACTACAA 

msa71927.2{l73_M732} -tcagtagct CcTACTACAA 

msa71927.2{l73_H36B} c ttcagtagtt ccTACTACAA 

msa71927.2{l73_JM9130013} ttcagtagct ccTACTACAA 

msa71927 .2{l73_1169NT} aatagtactg agacaagtgc ttcagtagct ccTACTACAA 
Consensus _-******** 



50 

ATACTATCGT 
ATACTATCGT 
ATACTATCGT 
ATACTATCGT 
ATACTATCGT 
ATACTATCGT 
ATACTATCGT 
ATACTATCGT 
ATACTATCGT 
ATACTATCGT 
ATACTATCGT 
********** 



51 100 

msa71927.2{l73_18RS2l} TCAAACTAAT GACAGTAATC CTACCGCAAA ATTTgtATCA GAATCAGGAC 

msa71927.2{l73_2603} TCAAACTAAT GACAGTAATC CTACCGCAAA ATTTgtATCA GAATCAGGAC 

msa71927.2{l73_A909} TCAAACTAAT GACAGTAATC CTACCGCAAA ATTTgtATCA GAATCAGGAC 

msa71927 .2{l73_090 ) TCAAACTAAT GACAGTAATC CTACCGCAAA ATTTgtATCA GAATCAGGAC 

msa71927.2{l73_CJB110} TCAAACTAAT GACAGTAATC CTACCGCAAA ATTTgtATCA GAATCAGGAC 

msa71927.2'{l73_COHl} TCAAACTAAT GACAGTAATC CTACCGCAAA ATTTg cATCA GAATCAGGAC 

msa71927 . 2 {173_M781} TCAAACTAAT GACAGTAATC CTACCGCAAA ATTTg cATCA GAATCAGGAC 

msa71927.2{l73_M732) TCAAACTAAT GACAGTAATC CTACCGCAAA ATTTg CATCA GAATCAGGAC 

msa71927.2{173_H36B} TCAAACTAAT GACAGTAATC CTACCGCAAA ATTTtcATCA GAATCAGGAC 

msa71927.2{l73_OM913 0013} TCAAACTAAT GACAGTAATC CTACCGCAAA ATTTtcATCA GAATCAGGAC 

msa71927.2{l73_1169NT} TCAAACTAAT GACAGTAATC CTACCGCAAA ATTTgcATCA GAATCAGGAC 

Consensus ********** ********** ********** ****__**** ********** 



msa71927.2{ 

msa71927. 

msa71927. 
msa71927 
msa71927.2{ 

msa71927. 

msa71927. 

msa71927. 

msa71927 
msa71927.2{l73 
msa71927.2{ < 



173_18RS2l] 
2{l73_2603] 
2{173_A909] 
2{173_090] 
173_CJB110 : 
2{l73_COHl] 
2{173_M78lj 
2{173_M732] 
2{173_H36B 
JM9130013 
l'73_1169NT) 
Consensus 



101 

AAT CTGTAAT 
AATCTGTAAT 
AATCTGTAAT 
AATCTGTAAT 
AATCTGTAAT 
AATCTGTAAT 
AATCTGTAAT 
AATCTGTAAT 
AATCTGTAAT 
AATCTGTAAT 
AATCTGTAAT 
********** 



AgGTCAAGTA 
AgGTCAAGTA 
AgGTCAAGTA 
AgGTCAAGTA 
AgGTCAAGTA 
AgGTCAAGTA 
AgGTCAAGTA 
AgGTCAAGTA 
AgGTCAAGTA 
AgGTCAAGTA 
AtGTCAAGTA 



AAACCAGaTA 
AAACCAGaTA 
AAACCAGaTA 
AAACCAGaTA 
AAACCAGaTA 
AAACCAGcTA 
AAACCAGcTA 
AAACCAGCTA 
AAACCAGcTA 
AAACCAGcTA 
AAACCAGaTA 



*_******** ********** 



ATTCTGcGGC 
ATTCTGcGGC 
ATTCTGcGGC 
ATTCTGcGGC 
ATTCTGcGGC 
ATTCTGcGGC 
ATTCTGcGGC 
ATTCTGcGGC 
ATTCTGtGGC 
ATTCTGtGGC 
ATTCTGcGGC 
******_*** 



150 

GCTTACAACA 
GCTTACAACA 
GCTTACAACA 
GCTTACAACA 
GCTTACAACA 
GCTTACAACA 
GCTTACAACA 
GCTTACAACA 
GCTTACAACA 
GCTTACAACA 
GCTTACAACA 
********** 



msa71927 .2 
msa71927 
msa71927 
msa7192 

msa71927 .2 
msa71927 
msa71927 
msa71927 



{l73_18RS2l} 
.2{173_2603} 
.2{173_A909) 
7.2{l73_090} 
{173_CJB110} 
.2{l73_COHl} 
.2{173_M781> 
•2{173_M732} 



151 

GTTGACACGC 
GTTGACACGC 
GTTGACACGC 
GTTGACACGC 
GTTGACACGC 
GTTGACACGC 
GTTGACACGC 
GTTGACACGC 



CtcaTCATAT 
Ctc aTCATAT 
CtcaTCATAT 
CtcaTCATAT 
CtcaTCATAT 
C - - . TCATAT 

C TCATAT 

C . . . TCATAT 



TTCAGCTCCA 
TTCAGCTCCA 
TTCAGCTCCA 
TTCAGCTCCA 
TTCAGCTCCA 
TTCAGCTCCA 
TTCAGCTCCA 
TTCAGCTCCA 



GATG CTTTAA 
GATGcTTTAA 
GATG cTTTAA 
GATGcTTTAA 
GATGcTTTAA 
GATGcTTTAA 
GATGcTTTAA 
GATGcTTTAA 



200 

AAACAACTCA 
AAACAACTCA 
AAACAACTCA 
AAACAACTCA 
AAACAACTCA 
AAACAACTCA 
AAACAACTCA 
AAACAACTCA 
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Table 48: Comparative Sequences relating to SAG1474 



ms a7 1 92 7 . 2 { 1 7 3_H3 6B } GTTGACACGC C . . . TCATAT TTCAGCTCCA GATGcTTTAA AAACAACTCA 

msa71927.2{173_JM9130013} GTTGACACGC C... TCATAT TTCAGCTCCA GATGcTTTAA AAACAACTCA 

msa71927.2{l73__1169NT} GTTGACACGC C... TCATAT TTCAGCTCCA GATGaTTTAA AAACAACTCA 

Consensus ********** * ****** ********** ****_***** ********** 



201 

msa71927.2{l73_18RS2l} ATCAAGTCCT 

msa71927.2{l73_2603} ATCAAGTCCT 

msa71927.2{l73_A909} ATCAAGTCCT 

msa71927 .2{173_090} ATCAAGTCCT 

msa71927.2{l73jZJB110} ATCAAGTCCT 

msa71927.2{l73_COHl} ATCAAGTCCT 

msa71927.2{l73_M78l} ATCAAGTCCT 

msa71927.2(173_M732} ATCAAGTCCT 

msa71927.2{l73__H36B} ATCAAGTCCT 

tnsa71927.2{l73_JM9130013 J ATCAAGTCCT 

msa71927.2{l73_1169NT) ATCAAGTCCT 

Consensus ********** 



GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 



GTaCTTCTAC 
GTaCTTCTAC 
GTaCTTCTAC 
GTaCTTCTAC 
GTaCTTCTAC 
GTcCTTCTAC 
GTcCTTCTAC 
GTcCTTCTAC 
GTcCTTCTAC 
GTcCTTCTAC 
GTaCTTCTAC 



********** **_******* 



TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
********** 



250 

GAAGAGAC t T 
GAAGAGAC t T 
GAAGAGAC tT 
GAAGAGAC tT 
GAAGAGAC tT 
GAAGAGACaT 
GAAGAGAC aT 
GAAGAGACaT 
GAAGAGACaT 
GAAGAGACaT 
GAAGAGACaT 
********_* 



msa71927.2{ 

msa71927. 

msa71927 
msa71927 
msa71927.2{ 

msa71927 . 

msa71927 . 

msa71927 . 

msa71927 . 
msa71927.2{l73 
rasa71927.2{' 



173_18RS2l} 
2{173_2603* 
2{173_A909] 
2{173_090] 
173_CJB110j 
2{l73_COHl] 
2{173__M781 ! 
2{l73_M732 ? 
2{173_H36B] 
_JM9130013] 
173_1169NT} 
Consensus 



251 

ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
********** 



AGATGGTcAA 
AGATGGTcAA 
AGATGGTcAA 
AGATGGTaAA 
AGATGGTaAA 
AGATGGTcAA 
AGATGGTcAA 
AGATGGTcAA 
AGATGGTcAA 
AGATGGTcAA 
AGATGGTCAA 
*******_** 



GAtTTAGCCA 
GAtTTAGCCA 
GAtTTAGCCA 
GAtTTAGCCA 
GAtTTAGCCA 
GAtTTAGCCA 
GAtTTAGCCA 
GAtTTAGCCA 
GAtTTAGCCA 
GAgTTAGCCA 
GAtTTAGCCA 

**w******* 



ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
********** 



300 

AAGTGGTCAA 
AAGTGGTCAA 
AAGTGGTCAA 
AAGTGGTCAA 
AAGTGGTCAA 
AAGTGGTCAA 
AAGTGGTCAA 
AAGTGGTCAA 
AAGTGGTCAA 
AAGTGGTCAA 
AAGTGGTCAA 
********** 



msa71927 . 2 { 173_18RS21 } 
msa71927 . 2 { 173_2603 } 
msa71927 . 2 {173_A909 } 
msa71927 . 2 { 173_090 } 
msa71927 . 2 { 173_CJB110 } 
msa7 192 7 . 2 { 173_COHl } 
msa71927 . 2 {173JM781 } 
msa71927 . 2 {l73_M732 } 
msa71927.2{l73JH36B} 
msa71927.2{l73_JM9130013} 
msa71927 . 2 { 173_11S9NT} 
Consensus 



301 

GTTACTAGTG 
GTTACTAGTG 
GTTACTAGTG 
GTTACTAGTG 
GTTACTAGTG 
GTTACTAGTG 
GTTACTAGTG 
GTTACTAGTG 
GTTACTAGTG 
GTTACTAGTG 
GTTACTAGTG 



AGGAACTCGT 
AGGAACTCGT 
AGGAACTCGT 
AGGAACTCGT 
AGGAACTCGT 
AGGAACTCGT 
AGGAACTCGT 
AGGAACTCGT 
AGGAACTCGT 
AGGAACTCGT 
AGGAACTCGT 



********** ********** 



tAATATGGCA 
tAATATGGCA 
tAATATGGCA 
tAATATGGCA 
tAATATGGCA 
cAATATGGCA 
cAATATGGCA 
cAATATGGCA 
CAATATGGCA 
cAATATGGCA 
cAATATGGCA 
_********* 



TACGATATTA 
TACGATATTA 
TACGATATTA 
TACGATATTA 
TACGATATTA 
TACGATATTA 
TACGATATTA 
TACGATATTA 
TACGATATTA 
TACGATATTA 
TACGATATTA 
********** 



350 

T t GCTAAAGA 
TtGCTAAAGA 
TtGCTAAAGA 
TtGCTAAAGA 
TtGCTAAAGA 
TcGCTAAAGA 
TcGCTAAAGA 
TcGCTAAAGA 
TtGCTAAAGA 
TtGCTAAAGA 
TtGCTAAAGA 
*_******** 



351 400 

msa71927.2{l73_18RS2l} AAACCCaTCT TTAAATGCAG TCATTACTAC TAGACGCCAA GAAGCtATTG 

msa71927.2{l73_2603} AAACCCaTCT TTAAATGCAG TCATTACTAC TAGACGCCAA GAAGCtATTG 

msa71927.2{173_A909} AAACCCaTCT TTAAATGCAG TCATTACTAC TAGACGCCAA GAAGCtATTG 

msa71927.2{l73_090} AAACCCaTCT TTAAATGCAG TCATTACTAC TAGACGCCAA GAAGCtATTG 

msa71927.2{l73_CJB110} AAACCCaTCT TTAAATGCAG TCATTACTAC TAGACGCCAA GAAGCtATTG 

msa71927.2{l73_COHl} AAACCCaTCT TTAAATGCAG TCATTACTAC TAGACGCCAA GAAGCcATTG 

msa71927.2{l73_M78l} AAACCCaTCT TTAAATGCAG TCATTACTAC TAGACGCCAA GAAGCcATTG 

msa71927 .2| 173_M732} AAACCCaTCT TTAAATGCAG TCATTACTAC TAGACGCCAA GAAGCcATTG 

msa71927.2{173_H36B} AAACCCaTCT TTAAATGCAG TCATTACTAC TAGACGCCAA GAAGCtATTG 

msa71927.2{l73_JM913 0013} AAACCCaTCT TTAAATGCAG TCATTACTAC TAGACGCCAA GAAGCtATTG 

msa71927.2{l73_1169NT} AAACCCtTCT TTAAATGCAG TCATTACTAC TAGACGCCAA GAAGCcATTG 

Consensus ******-*** ********** ********** ********** *****_**** 



401 

msa71927.2{l73_18RS2l} AAGAGGCTAG AAAACTTAAA GATACcAATC 

msa71927 . 2{l73_2603} AAGAGGCTAG AAAACTTAAA GATACcAATC 

msa71927.2{l73_A909} AAGAGGCTAG AAAACTTAAA GATACcAATC 

msa71927.2{!73__090} AAGAGGCTAG AAAACTTAAA GATACcAATC 

msa71927.2{l73_CJB110 j AAGAGGCTAG AAAACTTAAA GATACcAATC 

msa71927.2{l73_COHl} AAGAGGCTAG AAAACTTAAA GATACtAATC 

msa71927.2{l73_M78l} AAGAGGCTAG AAAACTTAAA GATACtAATC 

msa71927.2(l73_M732l AAGAGGCTAG AAAACTTAAA GATACtAATC 

maa71927.2{l73_H36B} AAGAGGCTAG AAAACTTAAA GATACcAATC 

msa71927.2{l73__OM913 0013} AAGAGGCTAG AAAACTTAAA GATACcAATC 

msa71927.2{l73_1169NT} AAGAGGCTAG AAAACTTAAA GATACtAATC 

Consensus ********** ********** *****_**** 



AGCCgTTTTT 
AGCCgTTTTT 
AGCCgTTTTT 
AGCCgTTTTT 
AGCCgTTTTT 
AGCCgTTTTT 
AGCCgTTTTT 
AGCCgTTTTT 
AGCCgTTTTT 
AGCCgTTTTT 
AGCCaTTTTT 
****_***** 



450 

AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
********** 



451 

msa71927.2{l73_18RS2l} TTGTTAGTCA AGGGGTTAGG GCACAGTATT AAAGGTGGTG 

msa71927.2fl73_2603| TTGTTAGTCA AGGGGTTAGG GCACAGTATT AAAGGTGGTG 

msa71927 .2{l73_A909> TTGTTAGTCA AGGGGTTAGG GCACAGTATT AAAGGTGGTG 

msa71927.'2{l73_090} TTGTTAGTCA AGGGGTTAGG GCACAGTATT AAAGGTGGTG 

msa71927 . 2 { 173_CJB110 } TTGTTAGTCA AGGGGTTAGG GCACAGTATT AAAGGTGGTG 

msa7 1927 . 2 1 173_C0H1 ) TTGTTAGTCA AGGGGTTAGG GCACAGTATT AAAGGTGGTG 

msa71927.2{173_M78l} TTGTTAGTCA AGGGGTTAGG GCACAGTATT AAAGGTGGTG 



500 

AAACCAATAA 
AAACCAATAA 
AAACCAATAA 
AAACCAATAA 
AAACCAATAA 
AAACCAATAA 
AAACCAATAA 



820 



WO 2004/018646 
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Table 48: Comparative Sequences relating to SAG1474 



msa71927 .2(l73_M732} TTGTTAGTCA AGGGGTTAGG GCACAGTATT AAAGGTGGTG AAACCAATAA 

msa71927.2{173_H3 6B} TTGTTAGTCA AGGGGTTAGG GCACAGTATT AAAGGTGGTG AAACCAATAA 

msa7 1927 .2(173 JM913 0013} TTGTTAGTCA AGGGGTTAGG GCACAGTATT AAAGGTGGTG AAACCAATAA 

msa71927.2(173_1169NT} TTGTTAGTCA AGGGGTTAGG GCACAGTATT AAAGGTGGTG AAACCAATAA 

Consensus ********** ********** ********** ********** ********** 

501 550 

msa71927.2(173_18RS2l} TGGCTTGATC TATGCAGaTG GAAAAATTAG CACATTTGAC AGTAGCTATG 

msa71927.2{l73_2603) TGGCTTGATC TATGCAGaTG GAAAAATTAG CACATTTGAC AGTAGCTATG 

msa71927.2{l73_A909} TGGCTTGATC TATGCAGaTG GAAAAATTAG CACATTTGAC AGTAGCTATG 

msa71927.2{l73_090} TGGCTTGATC TATGCAGaTG GAAAAATTAG CACATTTGAC AGTAGCTATG 

msa71927.2{l73_CJB110} TGGCTTGATC TATGCAGaTG GAAAAATTAG CACATTTGAC AGTAGCTATG 

msa71927.2fl73_COHlj TGGCTTGATC TATGCAGaTG GAAAAATTAG CACATTTGAC AGTAGCTATG 

msa71927 .2{173_M781} TGGCTTGATC TATGCAGaTG GAAAAATTAG CACATTTGAC AGTAGCTATG 

msa71927.2{l73_M732j TGGCTTGATC TATGCAGaTG GAAAAATTAG CACATTTGAC AGTAGCTATG 

msa71927.2{l73_H36B} TGGCTTGATC TATGCAGgTG GAAAAATTAG CACATTTGAC AGTAGCTATG 

msa71927.2(173_JM9130013} TGGCTTGATC TATGCAGgTG GAAAAATTAG CACATTTGAC AGTAGCTATG 

msa71927.2{l73_1169NT} TGGCTTGATC TATGCAGaTG GAAAAATTAG CACATTTGAC AGTAGCTATG 

Consensus ********** *******_** ********** ********** ********** 



551 

msa71927.2{l73_18RS2lJ TCAAAAAATA TAAAGATTTA 

msa71927.2{l73_2603) TCAAAAAATA TAAAGATTTA 
msa71927.2{l73_A909) ' TCAAAAAATA TAAAGATTTA 

msa71927.2{l73_090} TCAAAAAATA TAAAGATTTA 

msa71927.2{l73_CJB110} TCAAAAAATA TAAAGATTTA 

msa71927.2{l73_COHl} TCAAAAAATA TAAAGATTTA 

msa71927.2(173_M781} TCAAAAAATA TAAAGATTTA 

msa71927.2{173_M732} TCAAAAAATA TAAAGATTTA 

msa71927.2{l73_H36B} TCAAAAAATA TAAAGATTTA 

msa71927.2{l73_JM9130013} TCAAAAAATA TAAAGATTTA 

rasa71927.2{l73_1169NT} TCAAAAAATA TAAAGATTTA 

Consensus ********** ********** 



GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
********** 



TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
********** 



600 

AACGAAcTTT 
AACGAAcTTT 
AACGAAcTTT 
AACGAAcTTT 
AACGAAcTTT 
AACGAAtTTT 
AACGAAtTTT 
AACGAAtTTT 
AACGAAcTTT 
AACGAACTTT 
AACGAAcTTT 
******„*** 



w *" * boll 

msa71927.2{173_18RS2lJ CCAGAGTATG GgTGGCGtAA TATAACAGAt TCTAAATTAT ACGGTCtAAC 

msa71927 . 2 { 173_2603 } CCAGAGTATG GgTGGCGtAA TATAACAGAt TCTAAATTAT ACGGTCtAAC 

msa71927.2{l73_A909} CCAGAGTATG GgTGGCGtAA TATAACAGAt TCTAAATTAT ACGGTCtAAC 

!Dsa71927.2{l73_090} CCAGAGTATG GgTGGCGtAA TATAACAGAt TCTAAATTAT ACGGTCtAAC 

msa71927.2{l73_CJB110} CCAGAGTATG GgTGGCGtAA TATAACAGAt TCTAAATTAT ACGGTCtAAC 

msa71927.2{l73_COHl} CCAGAGTATG GgTGGCGtAA TATAACAGAc TCTAAATTAT ACGGTCcAAC 

msa71927.2{l73_M78l} CCAGAGTATG GgTGGCGtAA TATAACAGAc TCTAAATTAT ACGGTCcAAC 

msa71927.2(l73_M732} CCAGAGTATG GgTGGCGtAA TATAACAGAc TCTAAATTAT ACGGTCnAAC 

msa71927.2{173_H3SB} CCAGAGTATG GaTGGCGcAA TATAACAGAt TCTAAATTAT ACGGTCcAAC 

msa71927.2{l73_JM913 0013} CCAGAGTATG GaTGGCGcAA TATAACAGAt TCTAAATTAT ACGGTCcAAC 

msa71927.2{l73_1169NT} CCAGAGTATG GgTGGCGtAA TATAACAGAt TCTAAATTAT ACGGTCcAAC 

Consensus ********** *_*****_** *********_ ********** ******_*** 



, 651 700 

msa71927.2{173_18RS2l} GCATAAt CCT tGGgATCTTG CTCATAAtGC TGGTGGCTCT TCTGGTGGAA 

msa71927.2fl73_2603| GCATAAtCCT tGGgATCTTG CTCATAAtGC TGGTGGCTCT TCTGGTGGAA 

msa71927.2{l73_A909} GCATAAtCCT tGGgATCTTG CTCATAAtGC TGGTGGCTCT TCTGGTGGAA 

msa71927.2{l73_090} GCATAAtCCT tGGgATCTTG CTCATAAtGC TGGTGGCTCT TCTGGTGGAA 

msa71927 .2{l73_CJB110} GCATAAtCCT tGGgATCTTG CTCATAAtGC TGGTGGCTCT TCTGGTGGAA 

msa71927 . 2 ( 173_COHl } GCATAAtCCT tGGaATCTTG CTCATAAcGC TGGTGGCTCT TCTGGTGGAA 

msa71927.2{173_M78l} GCATAAtCCT tGGaATCTTG CTCATAAcGC TGGTGGCTCT TCTGGTGGAA 

msa71927.2{l73_M732} GCATAAtCCT tGGgATCTTG CTCATAAcGC TGGTGGCTCT TCTGGTGGAA 

msa71927 .2{173_H36B} GCATAAcCCT tGGaATCTTG CTCATAAtGC TGGTGGCTCT TCTGGTGGAA 

msa71927 . 2 {l73_JM9130013 } GCATAAcCCT tGGaATCTTG CTCATAAtGC TGGTGGCTCT TCTGGTGGAA 

msa71927.2{l73_1169NT} GCATAAcCCT cGGaATCTTG CTCATAAtGC TGGTGGCTCT TCTGGTGGAA 

Consensus ******-*** _**^****** *******_** ********** ********** 



701 750 

msa71927.2{l73_JL8RS2l} GTGCAGCAGc cATTGCTAGC GGaATGACGC CAATTGCTAG CGGtAGTGAT 

msa71927.2fl73_2603] GTGCAGCAGc cATTGCTAGC GGaATGACGC CAATTGCTAG CGGtAGTGAT 

msa71927.2{173_A909) GTGCAGCAGc cATTGCTAGC GGaATGACGC CAATTGCTAG CGGtAGTGAT 

msa71927.2{173_090} GTGCAGCAGc cATTGCTAGC GGaATGACGC CAATTGCTAG CGGtAGTGAT 

msa71927.2{l73_CJB110} GTGCAGCAGc cATTGCTAGC GGaATGACGC CAATTGCTAG CGGtAGTGAT 

msa71927.2fl73_COHl} GTGCAGCAGc tATTGCTAGC GGaATGACGC CAATTGCTAG CGGcAGTGAT 

msa71927.2|173_M78lJ GTGCAGCAGc tATTGCTAGC GGaATGACGC CAATTGCTAG CGGcAGTGAT 

msa71927.2{173_M732} GTGCAGCAGc tATTGCTAGC GGaATGACGC CAATTGCTAG CGGcAGTGAT 

msa71927.2{173_H36B} GTGCAGCAGt tATTGCTAGC GGgATGACGC CAATTGCTAG CGGtAGTGAT 

msa71927.2{l73_JM9130013} GTGCAGCAGt tATTGCTAGC GGgATGACGC CAATTGCTAG CGGtAGTGAT 

msa71927.2{l73_1169NT} GTGCAGCAGc cATTGCTAGC GGrATGACGC CAATTGCTAG CGGtAGTGAT 

Consensus *********_ _********* **_******* ********** ***_****** 



f , ' JJ - 800 

rasa71927.2{173_18RS2lj GCTGGTGGTT CTATCCGTAT TCCATCTTCT TGGACGGGCT TgGTAGGTTT 

msa71927.2{l73_2603) GCTGGTGGTT CTATCCGTAT TCCATCTTCT TGGACGGGCT TgGTAGGTTT 

msa71927.2{l73_A909} GCTGGTGGTT CTATCCGTAT TCCATCTTCT TGGACGGGCT TgGTAGGTTT 

msa71927.2{l73_090} GCTGGTGGTT CTATCCGTAT TCCATCTTCT TGGACGGGCT TgGTAGGTTT 

msa71927.2{l73_CJB110) GCTGGTGGTT CTATCCGTAT TCCATCTTCT TGGACGGGCT TgGTAGGTTT 

msa71927.2{l73__COHl} GCTGGTGGTT CTATCCGTAT TCCATCTTCT TGGACGGGCT TaGTAGGTTT 



821 



WO 2004/018646 
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Table 48: Comparative Sequences relating to SAG1474 



msa71927.2{l73_M78l} 
msa71927.2{173_M732} 
msa71927 .2{l73_H36B} 
msa71927 . 2 { 173_JM9130013 } 
msa71927.2{l73_1169NT} 
Consensus 



GCTGGTGGTT 
GCTGGTGGTT 
GCTGGTGGTT 
GCTGGTGGTT 
GCTGGTGGTT 



CTATCCGTAT 
CTATCCGTAT 
CTATCCGTAT 
CTATCCGTAT 
CTATCCGTAT 



********** ********** 



TCCATCTTCT 
TCCATCTTCT 
TCCATCTTCT 
TCCATCTTCT 
TCCATCTTCT 
********** 



TGGACGGGCT 
TGGACGGGCT 
TGGACGGGCT 
TGGACGGGCT 
TGGACGGGCT 



TaGTAGGTTT 
TaGTAGGTTT 
TgGTAGGTTT 
TgGTAGGTTT 
TgGTAGGTTT 



********** *„******** 



msa71927.2{ 

msa71927. 

msa71927 . 
msa71927 
msa71927.2{ 

msa71927 . 

msa71927. 

msa71927 . 

msa71927 . 
msa71927.2{l73 
msa71927.2{ 



173_18RS2l) 
2{173_2603} 
2{173__A909} 
,2{173_090 T 
173_CJB110' 
2{l73_COHl] 
2{173_M781] 
2{173_M732' 
2(173_H36Bj 
JM9130013' 
173_1169NT] 
Consensus 



801 

AAAACCAACA 
AAAACCAACA 
AAAACCAACA 
AAAACCAACA 
AAAACCAACA 
AAAACCAACA 
AAAACCAACA 
AAAACCAACA 
AAAACCAACA 
AAAACCAACA 
AAAACCAACA 



AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 



TGAGTaATGA 
TGAGTaATGA 
TGAGTaATGA 
TGAGTaATGA 
TGAGTcATGA 
TGAGTaATGA 
TGAGTaATGA 
TGAGTaATGA 
TGAGTaATGA 
TGAGTaATGA 
TGAGTaATGA 



AAAGCCAGAT 
AAAGCCAGAT 
AAAGCCAGAT 
AAAGCCAGAT 
AAAGCCAGAT 
AAAGCCAGAT 
AAAGCCAGAT 
AAAGCCAGAT 
AAAGCCAGAT 
AAAGCCAGAT 
AAAGCCAGAT 



********** ********** *****_**** ********** 



850 

TCGTATAGTA 
TCGTATAGTA 
TCGTATAGTA 
TCGTATAGTA 
TCGTATAGTA 
TCGTATAGTA 
TCGTATAGTA 
TCGTATAGTA 
TCGTATAGTA 
TCGTATAGTA 
TCGTATAGTA 
********** 



msa71927.2{ 
msa71927 . 
msa71927 
msa71927 

msa71927.2{ 
msa71927 
msa71927 . 
msa7l927 
msa71927 
msa71927.2{l73 

rnsa71927.2{ 



173_18RS2l} 
2 { 173^2603} 
2{173_A909} 
.2{173_090} 
173_CJB110} 
2(l73_COHl} 
2{173_M781) 
2{173_M732} 
2{173_H36B} 
_JM9130013} 
173_1169NT} 
Consensus 



851 

CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
********** 



TTTTCCATTA 
TTTTCCATTA 
TTTTCCATTA 
TTTTCCATTA 
TTTTCCATTA 
TTTTCCATTA 
TTTTCCATTA 
TTTTCCATTA 
TTTTCCATTA 
TTTTCCATTA 
TTTTCCATTA 
********** 



ACTAAGTCAT 
ACTAAGTCAT 
ACTAAGTCAT 
ACTAAGTCAT 
ACTAAGTCAT 
ACTAAGTCAT 
ACTAAGTCAT 
ACTAAGTCAT 
ACTAAGTCAT 
ACTAAGTCAT 
ACTAAGTCAT 
********** 



CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
********** 



900 

AGAAACATTa 
AGAAACATTa 
AGAAACATTa 
AGAAACATTa 
AGAAACATTa 
AGAAACATTg 
AGAAACATTg 
AGAAACATTg 
AGAAACATTa 
AGAAACATTa 
AGAAACATTa 
*********_ 



901 

msa71927.2{l73_18RS2l} TTAACTTAtC TAAAGAAAAG 

msa71927.2{l73_2603} TTAACTTAtC TAAAGAAAAG 

msa7 1927 .2(17 3_A909) TTAACTTAtC TAAAGAAAAG 

msa71927.2{l73_090) TTAACTTAtC TAAAGAAAAG 

msa71927.2{l73_CJB110} TTAACTTAtC TAAAGAAAAG 

msa71927 .2{173_C0H1} TTAACTTAcC TAAAGAAAAG 

msa71 927 .2(173 M78lJ TTAACTTAcC TAAAGAAAAG 

msa7 192 7. 2 {1733173 2} TTAACTTAcC TAAAGAAAAG 

msa71927.2{l73_H36B} TTAACTTAtC TAAAGAAAAG 

msa71927.2{l73_JM9130013} TTAACTTAtC TAAAGAAAAG 

msa71927.2{l73_1169NT} TTAACTTAtC TAAAGAAAAG 

Consensus ********-* ********** 



CGATCAAACG 
CGATCAAACG 
CGATCAAACG 
CGATCAAACG 
CGATCAAACG 
CGATCAAACG 
CGATCAAACG 
CGATCAAACG 
CGATCAAACG 
CGATCAAACG 
CGATCAAACG 



CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 



950 

TTAATGATTT 
TTAATGATTT 
TTAATGATTT 
TTAATGATTT 
TTAATGATTT 
TTAATGATTT 
TTAATGATTT 
TTAATGATTT 
TTAATGATTT 
TTAATGATTT 
TTAATGATTT 



********** ********** ********** 



951 1000 

msa71927.2{l73_18RS2ll AAAATCTTTA CCAATTGCTT ATACTTTGAA ATCACCAATG GGAACAGAAG 

msa71927.2{l73_2603j AAAATCTTTA CCAATTGCTT ATACTTTGAA ATCACCAATG GGAACAGAAG 

msa71927.2{173_A909} AAAATCTTTA CCAATTGCTT ATACTTTGAA ATCACCAATG GGAACAGAAG 

msa71927.2{l73_090} AAAATCTTTA CCAATTGCTT ATACTTTGAA ATCACCAATG GGAACAGAAG 

msa71927 ,2{l73jCJB110} AAAATCTTTA CCAATTGCTT ATACTTTGAA ATCACCAATG GGAACAGAAG 

maa71927.2{l73_COHl} AAAATCTTTA CCAATTGCTT ATACTTTGAA ATCACCAATG GGAACAGAAG 

msa71927.2{l73_M78l} AAAATCTTTA CCAATTGCTT ATACTTTGAA ATCACCAATG GGAACAGAAG 

msa7 1 92 7 . 2 { 1 73_M73 2 } AAAATCTTTA CCAATTGCTT ATACTTTGAA ATCACCAATG GGAACAGAAG 

msa71927.2{173_H36B} AAAATCTTTA CCAATTGCTT ATACTTTGAA ATCACCAATG GGAACAGAAG 

msa71927.2{l73_JM9130013} AAAATCTTTA CCAATTGCTT ATACTTTGAA ATCACCAATG GGAACAGAAG 

msa71927.2{l73_1169NT} AAAATCTTTA CCAATTGCTT ATACTTTGAA ATCACCAATG GGAACAGAAG 

Consensus ********** ********** ********** ********** ********** 



msa71927.2{ 
msa71927 . 
msa71927 . 
msa71927 

msa7l927.2{ 
msa71927 . 
rasa71927 . 
msa71927 
ins a7 1927 . 
msa71927.2{l73 

msa71927.2{ 



173_18RS2l} 
2{173_2603) 
2{173_A909> 
.2{l73_090} 
173_CJB110} 
2{l73_COHl] 
2{173_M781] 
2{173_M732; 
2{173_H36B] 
JM9130013* 
173_1169NT) 
Consensus 



1001 

TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
********** 



TGCTAAAAAc 
TGCTAAAAAc 
TGCTAAAAAc 
TGCTAAAAAC 
TGCTAAAAAC 
TGCTAAAAAt 
TGCTAAAAAt 
TGCTAAAAAt 
TGCTAAAAAt 
TGCTAAAAAt 
TGCTAAAAAc 
********* _ 



GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
********** 



ACAACGTCAc 
ACAACGTCAC 
ACAACGTCAc 
ACAACGTCAc 
ACAACGTCAc 
ACAACGTCAc 
ACAACGTCAC 
ACAACGTCAc 
ACAACGTCAt 
ACAACGTCAt 
ACAACGTCAC 
*********_ 



1050 
ATTCTTAAGA 
ATTCTTAAGA 
ATTCTTAAGA 
ATTCTTAAGA 
ATTCTTAAGA 
ATTCTTAAGA 
ATTCTTAAGA 
ATTCTTAAGA 
ATTCTTAAGA 
ATTCTTAAGA 
ATTCTTAAGA 
********** 



msa71927 . 2 { 173_18RS2l) 
msa7 1927 . 2 { 173_2603 } 
msa7 1 9 2 7 . 2 { 173_A9 09 J 
msa71927.2{l73_090) 

tnsa71927 . 2 { 173_CJB110} 



1051 1100 

aAACAAGGAT TCAAAGTaAC AGAGATAGAc TTACCAATTG ATGGTAGAGC 

aAACAAGGAT TCAAAGTaAC AGAGATAGAc TTACCAATTG ATGGTAGAGC 

aAACAAGGAT TCAAAGTaAC AGAGATAGAc TTACCAATTG ATGGTAGAGC 

aAACAAGGAT TCAAAGTaAC AGAGATAGAc TTACCAATTG ATGGTAGAGC 

aAACAAGGAT TCAAAGTaAC AGAGATAGAc TTACCAATTG ATGGTAGAGC 



822 



WO 2004/018646 



PCT/US2003/026827 



Table 48: Comparative Sequences relating to SAG1474 



msa71927 . 2 jl73_COHl} aAACAAGGAT 

msa71927.2{l73__M781} gAACAAGGAT 

msa71927.2{l73_M732} aAACAAGGAT 

msa71927.2{l73_H36B} aAACAAGGAT 

msa71927.2{l73_JM9130013} aAACAAGGAT 

msa71927.2{l73_1169NT} aAACAAGGAT 

Consensus _********* 



TCAAAGTgAC 
TCAAAGTgAC 
TCAAAGTgAC 
TCAAAGTgAC 
TCAAAGTgAC 
TCAAAGTaAC 
********** 



AGAGATAGAt 
AGAGATAGAt 
AGAGATAGAt 
AGAGATAGAc 
AGAGATAGAc 
AGAGATAGAC 
*********_ 



TTACCAATTG 
TTACCAATTG 
TTACCAATTG 
TTACCAATTG 
TTACCAATTG 
TTACCAATTG 
********** 



ATGGTAGAGC 
ATGGTAGAGC 
ATGGTAGAGC 
ATGGTAGAGC 
ATGGTAGAGC 
ATGGTAGAGC 
********** 



msa71927.2{ 

msa71927 . 

msa71927 . 
msa71927 
msa71927.2{ 

msa71927 . 

msa71927 . 

msa71927 . 

msa71927 . 
msa71927.2{l73 
msa71927.2{ 



173_18RS2l} 
2{173_2603} 
2{173_A909} 
.2{l73_090) 
173_CJB110} 
2{173_C0H1} 
2{173_M781) 
2{173_M732) 
2{173_H36B} 
_JM9130013) 
173_1159NT} 
Consensus 



1101 

ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
********** 



GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
********** 



CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
********** 



TGGcATGGGA 
TGGcATGGGA 
TGGcATGGGA 
TGGcATGGGA 
TGGcATGGGA 
TGGCATGGGA 
TGGcATGGGA 
TGGcATGGGA 
TGGtATGGGA 
TGGtATGGGA 
TGGcATGGGA 
***~****** 



1150 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
********** 



msa71927.2{ 

msa71927 . 

rasa71927 . 
msa71927 
msa71927.2{ 

msa71927 . 

msa71927 . 

msa71927. 

msa71927 . 
msa71927.2{!73 
msa71927.2{ 



173_18RS2lJ 
2{l73_2603) 
2{173_A909} 
2{173_090} 
173_CJB110) 
2{l73_COHl} 
2{l73JM78l} 
2{173_M732} 
2{173_H36B} 
_JM9130013} 
173JL169NT} 
Consensus 



1151 

CAACAATTGA 
CAACAATTGA 
CAACAATTGA 
CAACAATTGA 
CAACAATTGA 
CAACAATTGA 
CAACAATTGA 
CAACAATTGA 
CAACAATTGA 
CAACAATTGA 
CAACAATTGA 
********** 



AAAAGACTTA 
AAAAGACTTA 
AAAAGACTTA 
AAAAGACTTA 
AAAAGACTTA 
AAAAGACTTA 
AAAAGACTTA 
AAAAGACTTA 
AAAAGACTTA 
AAAAGACTTA 
AAAAGACTTA 
********** 



AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
********** 



GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
********** 



1200 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 
********** 



msa7l927.2{ 
msa71927 
msa71927 
msa71927 

msa71927.2{ 
msa71927 
msa71927 . 
msa71927 . 
msa71927 . 
msa71927.2{l73 

msa71927.2{ 



173_18RS21} 
2(173_2603) 
2{173_A909) 
.2 (173^090} 
173_CJB110} 
2{173_C0H1> 
2{173_M781} 
2{173_M732} 
2{173_H36B) 
JM9130013 J 
173_1169NT} 
Consensus 



1201 

GAT CC tATTA 
GATCCtATTA 
GATCCtATTA 
GATCCtATTA 
GATCCtATTA 
GATCCcATTA 
GATCCcATTA 
GATCCcATTA 
GATCCcATTA 
GATCCcATTA 
GATCCtATTA 
***** _ **** 



CTTGGGcAGT 
CTTGGGcAGT 
CTTGGGCAGT 
CTTGGGcAGT 
CTTGGGcAGT 
CTTGGGcAGT 
CTTGGGcAGT 
CTTGGGcAGT 
CTTGGGcAGT 
CTTGGGgAGT 
CTTGGGcAGT 
******„*** 



TCATGTTATT 
TCATGTTATT 
TCATGTTATT 
TCATGTTATT 
TCATGTTATT 
TCATGTTATT 
TCATGTTATT 
TCATGTTATT 
TCATGTTATT 
TCATGTTATT 
TCATGTTATT 
********** 



TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
********** 



1250 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
********** 



rnsa71927.2{ 

msa71927 . 

msa71927. 
tnsa71927 
msa71927.2{ 

msa71927 . 

msa71927 . 

msa71927 . 

maa71927. 
msa71927.2{l73 
msa71927.2{ 



173_18RS2l] 
2{l73_2603] 
2 {173_A909; 

2{l73_090l 
173_CJB110; 
2{l73_COHl) 
2{l73_M78i; 
2f 173JM732 1 
2{173__H36b) 
JM9130013' 
173_1169NT) 
Consensus 



1251 

TGAACTTAAG 
TGAACTTAAG 
TGAACTTAAG 
TGAACTTAAG 
TGAACTTAAG 
TGAACTTAAG 
TGAACTTAAG 
TGAACTTAAG 
TGAACTTAAG 
TGAACTTAAG 
TGAACTTAAG 
********** 



AAATCTATTa 
AAATCTATTa 
AAATCTATTa 
AAATCTATTa 
AAATCTATTa 
AAATCTATTg 
AAATCTATTg 
AAATCTATTg 
AAATCTATTa 
AAATCTATTa 
AAATCTATTa 
*********_ 



TGGAAGCCCA 
TGGAAGCCCA 
TGGAAGCCCA 
TGGAAGCCCA 
TGGAAGCCCA 
TGGAAGCCCA 
TGGAAGCCCA 
TGGAAGCCCA 
TGGAAGCCCA 
TGGAAGCCCA 
TGGAAGCCCA 
********** 



AAAACATATG 
AAAACATATG 
AAAACATATG 
AAAACATATG 
AAAACATATG 
AAAACATATG 
AAAACATATG 
AAAACATATG 
AAAACATATG 
AAAACATATG 
AAAACATATG 
********** 



1300 
GATGATTATC 
GATGATTATC 
GATGATTATC 
GATGATTATC. 
GATGATTATC 
GATGATTATC 
GATGATTATC 
GATGATTATC 
GATGATTATC 
GATGATTATC 
GATGATTATC 
********** 



msa71927.2{ 

msa71927 . 

msa71927. 
msa71927 
msa71927.2{ 

msa71927 . 

msa7192 7 . 

msa71927 . 

msa71927 
msa71927.2{l73 
msa71927.2{ 



173_18RS21\ 
2{173_2603} 
2{173_A909) 
2{l73_090} 
173_CJB110> 
2{173_C0H1} 
2{173_M781} 
2{173_M732} 
2{173_H3 6BJ 
__JM9130013> 
173_1169NT} 
Consensus 



1301 

, GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 



GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 



********** ********** 



CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
********** 



TTCCTATTTT 
TTCCTATTTT 
TTCCTATTTT 
TTCCTATTTT 
TTCCTATTTT 
TTCCTATTTT 
TTCCTATTTT 
TTCCTATTTT 
TTCCTATTTT 
TTCCTATTTT 
TTCCTATTTT 



1350 
CTTATCGCCA 
CTTATCGCCA 
CTTATCGCCA 
CTTATCGCCA 
CTTATCGCCA 
CTTATCGCCA 
CTTATCGCCA 
CTTATCGCCA 
CTTATCGCCA 
CTTATCGCCA 
CTTATCGCCA 



********** ********** 



msa71927 . 2 { 173_18RS21 } 
msa7l927,2{l73_2603} 
Kisa7 1 92 7 . 2 { 17 3_A9 09 ) 
msa71927 . 2 {173_090 } 



1351 1400 
ACGACCGCAA GTTTAGCCCC TCTAAATACA GATCCATATG TaACAGAGgA 
ACGACCGCAA GTTTAGCCCC TCTAAATACA GATCCATATG TaACAGAGgA 
ACGACCGCAA GTTTAGCCCC TCTAAATACA GATCCATATG ' TaACAGAGgA 
ACGACCGCAA GTTTAGCCCC TCTAAATACA GATCCATATG TaACAGAGgA 



823 



WO 2004/018646 



PCT/US2003/026827 



Table 48: Comparative Sequences relating to SAG1474 



msa71927 . 2 { 173_CJB110 ) ACGACCGCAA 

msa71927.2{l73_COHl) ACGACCGCAA 

msa71927.2{l73_M78lj ACGACCGCAA 

msa71927.2{l73_M732} ACGACCGCAA 

msa71927 .2{l73JH36B} ACGACCGCAA 

msa71927 . 2 { 173_JM9130013 } ACGACCGCAA 

tnsa71927.2{l73_1169NT} ACGACCGCAA 

Consensus ********** 



GTTTAGCCCC 
GTTTAGCCCC 
GTTTAGCCCC 
GTTTAGCCCC 
GTTTAGCCCC 
GTTTAGCCCC 
GTTTAGCCCC 



TCTAAATACA 
TCTAAATACA 
TCTAAATACA 
TCTAAATACA 
TCTAAATACA 
TCTAAATACA 
TCTAAATACA 



GATCCATATG 
GATCCATATG 
GATCCATATG 
GATCCATATG 
GATCCATATG 
GATCCATATG 
GATCCATATG 



TaACAGAGgA 
TaACAGAGaA 
TaACAGAGaA 
T t ACAGAGaA 
TaACAGAGgA 
TaACAGAGgA 
TaACAGAGgA 



********** ********** ********** *_******^* 



msa71927.2{ 
msa71927 
msa71927 
msa71927 

msa71927.2{ 
msa71927 . 
msa71927. 
msa71927 . 
msa71927. 
msa71927.2{l73 

msa71927.2{ 



173_JL8RS2l} 
2 { 173^2603} 
2{173_A909} 
.2{173_090} 
173_CJB110} 
2{l73_COHl} 
2{173_M78l) 
2{173_M732} 
2{l73_H36B} 
JM9130013) 
1*73_1169Nt) 
Consensus 



1401 

AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
********** 



GCGATTTATA 
GCGATTTATA 
GCGATTTATA 
GCGATTTATA 
GCGATTTATA 
GCGATTTATA 
GCGATTTATA 
GCGATTTATA 
GCGATTTATA 
GCGATTTATA 
GCGATTTATA 



ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 



CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 



1450 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 



********** ********** ********** ********** 



msa71927 . 2 { 173_18RS21 } 
msa71927 . 2 {l73_2603 } 
msa71927.2{l73_A909} 
msa71927.2{l73_090* 
msa7 1927 . 2 { 173_CJB110 ' 
msa7 1927 . 2 {l73__COHl j 
msa7 1927 . 2 { 173__M781 
msa71927 . 2 { 173_M732 
msa71927 . 2 { 173__H36B] 
msa71927.2{l73_JM9130013) 
msa7192 7 . 2 { 1 73_1 1 69NT} 
Consensus 



1451 

TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 



TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 



********** ********** 



TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
********** 



TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
********** 



1500 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
********** 



msa7 1927 . 2 { 173_18RS2 1 J 
msa71927.2(l73_2603] 
msa71927.2{l73_A909} 
msa71927.2{l73_090} 
msa71927 . 2 { 173_C«JB110} 
msa71927.2{l73 COHl} 
msa71927.2{l73_M78l} 
msa71927 - 2 {173_M732 } 
msa71927 . 2 { 173__H36B} 
msa71927 . 2 {173_JM9130013 } 
msa71927 . 2 { 173_1169NT} 
Consensus 



1501 

ACACaAATTG 
ACACaAATTG 
ACACaAATTG 
ACACaAATTG 
ACACaAATTG 
ACACcAATTG 
ACACcAATTG 
ACACcAATTG 
ACACaAATTG 
ACACaAATTG 
ACACaAATTG 
****„***** 



CTAATATGAC 
CTAATATGAC 
CTAATATGAC 
CTAATATGAC 
CTAATATGAC 
CTAATATGAC 
CTAATATGAC 
CTAATATGAC 
CTAATATGAC 
CTAATATGAC 
CTAATATGAC 



AGGACTCCCA 
AGGACTCCCA 
AGGACTCCCA 
AGGACTCCCA 
AGGACTCCCA 
AGGACTCCCA 
AGGACTCCCA 
AGGACTCCCA 
AGGACTCCCA 
AGGACTCCCA 
AGGACTCCCA 



GCTATCAGTA 
GCTATCAGTA 
GCTATCAGTA 
GCTATCAGTA 
GCTATCAGTA 
GCTATCAGTA 
GCTATCAGTA 
GCTATCAGTA 
GCTATCAGTA 
GCTATCAGTA 
GCTATCAGTA 



1550 
TCCCGACTTA 
TCCCGACTTA 
TCCCGACTTA 
TCCCGACTTA 
TCCCGACTTA 
TCCCGACTTA 
TCCCGACTTA 
TCCCGACTTA 
TCCCGACTTA 
TCCCGACTTA 
TCCCGACTTA 



********** ********** ********** ********** 



msa71927.2{ 

msa71927. 

msa71927. 
msa71927 
msa71927.2{ 

msa71927 . 

msa71927 . 

msa71927 . 

msa71927 . 
msa71927.2{l73 
msa71927.2{ 



173_18RS2l} 
2 { 173J2603} 
2{173_A909} 
.2{l73_090} 
173_CJB110} 
2{l73_COHl} 
2(173_M781J 
2{173_M732) 
2{173_H3 6B} 
_JM9130013} 
173__1169NT} 
Consensus 



1551 

CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
********** 



TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
********** 



CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 



GATGTTAATG 

GATGTTAATG 
GATGTTAATG 
GATGTTAATG 
GATGTTAATG 
GATGTTAATG 
GATGTTAATG 
GATGTTAATG 
GATGTTAATG 
GATGTTAATG 
GATGTTAATG 



1600 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 



********** ********** ********** 



1601 

msa71927.2{l73_18RS2l} ACTATGATAT GGTATTAATT AAATTTGCAA CTTTCTTTGA 

msa7 192 7, 2 { 173_2603} ACTATGATAT GGTATTAATT AAATTTGCAA CTTTCTTTGA 

msa71927.2{l73_A909} ACTATGATAT GGTATTAATT AAATTTGCAA CTTTCTTTGA 

msa71927 . 2{l73_090) ACTATGATAT GGTATTAATT AAATTTGCAA CTTTCTTTGA 

msa71927.2{l73_CJB110} ACTATGATAT GGTATTAATT AAATTTGCAA CTTTCT TT GA 

msa71927.2{l73_COHl} ACTATGATAT GGTATTAATT AAATTTGCAA CTTTCTTTGA 

msa71927 .2{173_M781} ACTATGATAT GGTATTAATT AAATTTGCAA CTTTCTTTGA 

msa71927,2{l73_M732} ACTATGATAT GGTATTAATT AAATTTGCAA CTTTCTTTGA 

msa71927.2{l73_H36B} ACTATGATAT GGTATTAATT AAATTTGCAA C TTT CTT TGA 

msa71927.2{l73_JM9130013} ACTATGATAT GGTATTAATT AAATTTGCAA CTTTCTTTGA 

msa71927.2{l73__1169NT} ACTATGATAT GGTATTAATT AAATTTGCAA CTTTCTTTGA 

Consensus ********** ********** ********** ********** 



1650 
AAAAcATCAT 
AAAAcATCAT 
AAAACATCAT 
AAAAcATCAT 
AAAAcATCAT 
AAAACATCAT 
AAAACATCAT 
AAAAcATCAT 
AAAAtATCAT 
AAAAtATCAT 
AAAAcATCAT 
****_***** 



msa71927 . 2 { 173_18RS21 } 
msa71927 , 2 ( 173_2603 5 
msa71927.2{173_A909} 



1651 1700 
GGTTTTAATG TTAAATGGCA AAGAATAATA GATAAAGAAG TGAAACCATC 
GGTTTTAATG TTAAATGGCA AAGAATAATA GATAAAGAAG TGAAACCATC 
GGTTTTAATG TTAAATGGCA AAGAATAATA GATAAAGAAG TGAAACCATC 



824 



WO 2004/018646 



PCT/US2003/026827 



Table 48: Comparative Sequences relating to SAG1474 



msa71927.2{l73_090} GGTTTTAATG TTAAATGGCA AAGAATAATA 

msa71927.2{l73_CJB110} GGTTTTAATG TTAAATGGCA AAGAATAATA 

msa71927.2{l73_COHl} GGTTTTAATG TTAAATGGCA AAGAATAATA 

msa71927.2{l73_M78lj GGTTTTAATG TTAAATGGCA AAGAATAATA 

msa71927.2(l73_M732} GGTTTTAATG TTAAATGGCA AAGAATAATA 

msa71927.2{l73_H36B} GGTTTTAATG TTAAATGGCA AAGAATAATA 

msa71927.2{l73_JM913 0013} GGTTTTAATG TTAAATGGCA AAGAATAATA 

msa71927 . 2 { 173_1169NT} GGTTTTAATG TTAAATGGCA AAGAATAATA 

Consensus ********** ********** ********** 



GATAAAGAAG 
GATAAAGAAG 
GATAAAGAAG 
GATAAAGAAG 
GATAAAGAAG 
GATAAAGAAG 
GATAAAGAAG 
GATAAAGAAG 



TGAAACCATC 
TGAAACCATC 
TGAAACCATC 
TGAAACCATC 
TGAAACCATC 
TGAAACCATC 
TGAAACCATC 
TGAAACCATC 



********** ********** 



1701 1750 

msa71927.2{l73_18RS2l} TaCTGgCCTA ATACAGCCTA CTAACTCCCT CTTTAAAGCT CATTCATCAT 

msa71927.2{l73_2603) TaCTGgCCTA ATACAGCCTA CTAACTCCCT CTTTAAAGCT CATTCATCAT 

msa71927.2{l73_A909} TaCTGgCCTA ATACAGCCTA CTAACTCCCT CTTTAAAGCT CATTCATCAT 

msa7 1 9 2 7 . 2 { 17 3_0 9 0 } TaCTGgCCTA ATACAGCCTA CTAACTCCCT CTTTAAAGCT CATTCATCAT 

msa71927 ,2{l73_CJB110] TaCTGgCCTA ATACAGCCTA CTAACTCCCT CTTTAAAGCT CATTCATCAT 

msa71927.2{l73_COHl} TgCTGaCCTA ATACAGCCTA CTAACTCCCT CTTTAAAGCT CATTCATCAT 

msa71927.2{l73_M78l} TgCTGaCCTA ATACAGCCTA CTAACTCCCT CTTTAAAGCT CATTCATCAT 

msa71927.2{l73_M732i TgCTGaCCTA ATACAGCCTA CTAACTCCCT CTTTAAAGCT CATTCATCAT 

msa71927.2{l73_H36B) TaCTGgCCTA ATACAGCCTA CTAACTCCCT CTTTAAAGCT CATTCATCAT 

msa71927 . 2 { 173_JM913 0013 } TaCTGgCCTA ATACAGCCTA CTAACTCCCT CTTTAAAGCT CATTCATCAT 

msa71927 .2{173_1169NT} TaCTGgCCTA ATACAGCCTA CTAACTCCCT CTTTAAAGCT CATTCATCAT 

Consensus *-***_**** ********** ********** ********** ********** 



msa71927»2{ 
msa71927. 
msa71927. 
msa71927 

msa71927.2{ 
msa71927. 
msa71927. 
msa71927 
msa71927 
msa71927-2{l73 

msa71927.2{ 



173_18RS2l} 
2{173_2603} 
2{l73_A909) 
.2{173_090) 
173_CJB110} 
2{l73_COHl} 
2{173_M781} 
2{173_M732} 
2{173__H36B} 
_JM9130013} 
173_1169NT} 
Consensus 



1751 

TAGTAAATTT 
TAGTAAATTT" 
TAGTAAATTT 
TAGTAAATTT 
TAGTAAATTT 
TAGTAAATTT 
TAGTAAATTT 
TAGTAAATTT 
TAGTAAATTT 
TAGTAAATTT 
TAGTAAATTT 
********** 



AGAAGAAAAT 
AGAAGAAAAT 
AGAAGAAAAT 
AGAAGAAAAT 
AGAAGAAAAT 
AGAAGAAAAT 
AGAAGAAAAT 
AGAAGAAAAT 
AGAAGAAAAT 
AGAAGAAAAT 
AGAAGAAAAT 
********** 



TCACAAGTTA 
TCACAAGTTA 
TCACAAGTTA 
TCACAAGTTA 
TCACAAGTTA 
TCACAAGTTA 
TCACAAGTTA 
TCACAAGTTA 
TCACAAGTTA 
TCACAAGTTA 
TCACAAGTTA 
********** 



CTCAAGTATC 
CTCAAGTATC 
CTCAAGTATC 
CTCAAGTATC 
CTCAAGTATC 
CTCAAGTATC 
CTCAAGTATC 
CTCAAGTATC 
CTCAAGTATC 
CTCAAGTATC 
CTCAAGTATC 
********** 



1800 
TAT CT CT AAA 
TATCTCTAAA 
TATCTCTAAA 
TATCTCTAAA 
TATCTCTAAA 
TATCTCTAAA 
TATCTCTAAA 
TATCTCTAAA 
TATCTCTAAA 
TATCTCTAAA 
TATCTCTAAA 
********** 



1801 

msa71927 .2{17 3_1 8RS2 1 } AAATGGATGA AATCGTCTGT TAAAAATAAA 

ms a7 1 9 2 7 . 2 f 1 73_2 603} AAATGGATGA AATCGTCTGT TAAAAATAAA 

msa71927.2{l73_A909} AAATGGATGA AATCGTCTGT TAAAAATAAA 

msa7 19 27 . 2 { 1 7 3^0 9 0 } AAATGGATGA AATCGTCTGT TAAAAATAAA 

msa71927.2{l73_CJB110> AAATGGATGA AATCGTCTGT TAAAAATAAA 

maa71927.2{l73_COHl} AAATGGATGA AATCGTCTGT TAAAAATAAA 

msa71927.2{l73_M78l} AAATGGATGA AATCGTCTGT TAAAAATAAA 

msa7 1927.2? 173_M732} AAATGGATGA AATCGTCTGT TAAAAATAAA 

msa71927.2(173_H36Bj AAATGGATGA AATCGTCTGT TAAAAATAAA 

msa71927.2{l73_JM9130013) AAATGGATGA AATCGTCTGT TAAAAATAAA 

msa71927.2{l73_1169NT} AAATGGATGA AATCGTCTGT TAAAAATAAA 

Consensus ********** ********** ********** 



ccatccgtaa 
ccatccgtaa 
ccatccgtaa 
ccatccgtaa 
ccatccgtaa 
ccatccgtaa 
ccatccgtaa 
ccatccgtaa 



1850 
tggcatatca 
tggcatatca 
tggcatatca 
tggcatatca 
tggcatatca 
tggcatatca 
tggcatatca 
tggcatatca 



ccatccgtaa 
ccatccgtaa 



tggcatat — 
tggcatatca 



1851 

msa71927 . 2 { 173_18RS21 } aaaagca 
msa71927.2{l73_2603} aaaagca 
msa71927.2{173_A909} aaaagca 
msa71927 .2{l73_090} aaaagca 
msa71927.2{l73_CJB110} aaaagca 
msa71927 . 2{l73_COHl} aaaagca 
msa71927 .2{173_M78l} aaaagca 
msa71927 .2{173_M732} aaaagca 

msa71927 .2{173_H36B} 

msa71927.2{l73_JM9130013) 

msa71927.2{l73_1169NT} aaaagca 
Consensus 

SEQ ID NO: 4814 

STRAIN 2603 frame: 1 

NSTETSASWPTTNT I VQTNDSNPTAKFVSESGQS VI GQVKPDNS AALTTVDTPHHI SAP 
DALKTTQSSPVVESTSTKLTEETYKQKDGQDLANMVRSGQVTSEELVNM^ 
LNAVITTRRQEAIEEARKLKDTNQPFLGVPLLVKGLGHS I KGGETNNGLI YADGKI STFD 
S S YVKKY KDLGF 1 1 LGQTNFPE YG WRN I TDSKL YGLTHNP WDLAHNAGGS SGGSAAA IAS 
GMTP I ASGSDAGGS IRI PSSWTGLVGLKPTRGLVSNEKPDS YSTAVHFPLTKSSRDAETL 
LTYXKKSDQTIiVSVNDLKSLPIAYTLKSPMGTEVS 

LPIDGRAIMRDYSTLAIGMGGAFSTIEKDLKKHGFTKEDVDPITWAVHVIYQNSDKAELK 
KS IMEAQKHMDDYRKAMEKLHKQFPI FLSPTTASLAPI*NTDPYVTEEDKRAI YNMENI/SQ 
EERI AliFNRQWEPMLRRTPFTQI ANMTGLPAI S I PTYLSESGLP IGTMLMAGANYDMVLI 
KFATFFEKHHGFNVKWQRI I DKEVKPSTGL I QPTNSLFKAHSSLVNLEENSQVTQVS ISK 
KWMKS S VKNKPSVMAYQKA 



SEQ ID NO J 4815 
STRAIN _090 frame: l 

NSTETSASWPTTNT IVQTNDSNPTAKFVSESGQS VI GQVKPDNS AALTTVDTPHHI SAP 
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DALKTTQSSPVVESTSTKLTEETYKQKDGKDLANMWS IAKENPS 
LNAVI TTRRQEA I EEARKLKDTNQ P FLGVPLLVKGLGHS I KGGETNNGL I YADGKI STFD 
SS YVKKYKDLGF 1 1 LGQTNFPE YGWRN I TDS KL YGLTHNPWDLAHNAGGS SGGS AAAI AS 
GMTP I ASGSDAGGS I RI PSSWTGLVGLKPTRGLVSNEKPDS YSTAVHFPLTKSSRDAETL 
LTYLKKSDQTLVSVNDLKSLPI AYTLKS PMGTEVS QDAKNAI MDNVTFLRKQGFKVTE I D 
LPIDGRALMRDYSTLAIGMGGAFSTIEKDLKKHGFTKEDVDPITWAVHVIYQNSDKAELK 
KS IMEAQKHMDDYRKAMEKLHKQFPI FLS PTTASLAPLNTDPYVTEEDKRAI YNMENLSQ 
EERI ALFNRQWEPMLRRTPFTQIANMTGLPAI S I PTYLS E SGLP I GTMLMAGANYDMVL I 
KFATFFEKHHGFNVKWQRIIDKEVKPSTGLIQPTNSLFKAHSSLVNIiEENSQVTQVSISK 
KWMKS S VKNKPS VMAYQKA 

SEQ ID NO: 4816 

STRAIN A909 frame: 2 

TTNT I VQTNDSNPTAKF VS ESGQS V I GQVKPDNS AALTT VDT PHH I SAPDALKTTQSSPV 
VESTSTKLTEETYKQKDGQDLANMVRSGQVTSEELVNMAYDI IAKENPS LMAVITTRRQE 
AI EEARKLKDTNQP FLGVPLLVKGLGH S I KGGETNNGL I YADGKI STFDSSYVKKYKDLG 
FIILGGTOFPEYGWRNITDSKLYGLTHNPWDLAHNAGGSSGGSAAAIASGMTPIASGSDA 
GGSIRIPSSWTGLVGLKPTRGLVSNEKPDSYSTAVHFPIjTKSSRDAETLLTYLKKSDQTIi 
VSVNDLKSLPIAYTLKSPMGTEVSQDAKNAIMDNVTFLRKQGFKVTEIDLPIDGRALMRD 
YSTIiAIGMGGAFSTI ekdlkkhgftkedvdp ITWAVHVI YQNSDKAELKKS IMEAQKHMD 
DYRKAMEKIiHKQFPIFLSPTTASIjAPLNTDPYVTEEDK3^IYNMENLSQEERIALFNRQW 
EPMLRRTPFTQIANMTGLPAIS IPTYLSESGLPI GTMLMAGANYDMVL I KFATFFEKHHG 
FNVKWQRI IDKEVKPSTGLI QPTNSLFKAHSSLVNLEENSQVTQVS I SKKWMKSSVKNKP 
SVMAYQKA 

SEQ ID NO: 4817 

STRAIN COH1 frame: 1 

NSTETSASVAPTTNT I VQTNDSNPTAKFASESGQS VI GQVKPANSAALTTVDTPH I SAPD 
ALKTTQSSPVVESPSTKLTEETYKQKDGQDIANMVRSGQVTSEELVNMAYDI IAKENPSL 
NAV I TTRRQEA I EEARKLKDTNQP FLGVPLLVKGLGHS I KGGETNNGL I YADGKI STFDS 
S YVKKYKDLGF I ILGQTNFPEYGWRNI TDSKLYGPTHNPWNLAHNAGGSSGGSAAAI ASG 
MTPI ASGSDAGGS IRI PS SWTGLVGLKPTRGLVSNEKPDS YSTAVHFPLTKS SRDAETLL 
TYLKKSDQTLVS VNDLKSLP I AYTLKS PMGTEVSQDAKNAI MDNVTFLRKQGFKVTE I DL 
PIDGRALMRDYSTLAIGMGGAFSTIEKDLKKHGFTKEDVDPITWAVHVIYQNSDKAELKK 
SIVEAQKRMDYRKMTEKLHKQFPIFLSPTTASLAPLNTDPYVTEKDKRAIYNME 
ERIALFNRQWEPMLRRTPFTPIANMTGLPAISIPTYLSESGLPIGTMIjMAGANYDMVLIK 
FAT FFEKHHG FNVKWQRI IDKEVKPSADLI QPTNSLFKAHSSLVNLEENSQVTQVS I SKK 
WMKSSVKNKPSVMAYQKA 

SEQ ID NO: 4818 

STRAIN M732 frame: 1 

SVAPTTNT I VQTNDSNPTAKFASE SGQS VI GQVKPANSAALTTVDTPH I S APDALKTTQS 
S PVVES PSTKLTEET YKQKDGQDLANMVRSGQVTS EELVNMAYD 1 1 AKENPSLNAVITTR 
RQEAI EEARKLKDTNQPFLGVPLLVKGLGHS I KGGETNNGLI YADGKI STFDSS YVKKYK 
DLGF 1 1 LGQTNFPEYGWRNITDSKLYGXTHNPWDLAHNAGGSSGGSAAAIASGMTPIASG 
SDAGGS IRI PSSWTGLVGLKPTRGLVSNEKPDS YSTAVHFPLTKS SRDAETLLTYLKKSD 
QTLVS VNDLKSLP I AYTLKS PMGTEVSQDAKNAI MDNVTFLRKQGFKVTE IDLPI DGRAL 
MRDYSTLAIGMGGAFST I EKDLKKHGFTKEDVDP I TWAVHVI YQNSDKAELKKS I VEAQK 
HMDDYRKAMEKLHKQFP I FLSPTTASLAPLNTDPYVTEKDKRAI YNMENLSQEERI ALFN 
RQWEPMLRRTPFTPI ANMTGLPAI S I PTYLSE SGLP I GTMLMAGANYDMVL I KFATFFEK 
HHGFNVKWQRI IDKEVKPSADLI QPTNSLFKAHSSLVNLEENSQVTQVS I SKKWMKSSVK 
NKPSVMAYQKA 

SEQ ID NO: 4819 

STRAIN 18RS21 frame: 1 

NSTETSASWPTTNTI VQTNDSNPTAKFVSESGQSVI GQVKPDNS AALTTVDTPHH I SAP 
DALKTTQSSPVVESTSTKLTEETYKQKDGQDIANMVRSGQVTSEELVNMAYDI IAKENPS 
LNAVI TTRRQEAI EEARKLKDTNQPFLGVPLLVKGLGHS I KGGETNNGL I YADGKI STFD 
SS YVKKYKDLGF 1 1 LGQTNFPE YGWRN I TDS KLYGLTHNPWDLAHNAGGS SGGS AAAI AS 
GMTPIASGSDAGGSIRIPSSWTGLVGLKPTRGLVSNEKPDSYSTAVHFPLTKSSRDAETL 
LTYLKKSDOTLVSVNDLKSLPIAYTLKSPMGTEVSQDAKNAIMDNVTFLRKQ/3FKVTEID 
LP I DGRALMRD YSTLAI GMGGAFST I EKDLKKHGFTKEDVDP ITWAVHVI YQNSDKAELK 
KS IMEAQKHMDDYRKAMEKLHKQFP I FLSPTTASLAPLNTDPYVTEEDKRAI YNMENLSQ 
EERIALFNRQWEPMLRRTPFTQIANMTGLPAISIPTYLSESGLPIGTMLMAGANYDMVLI 
KFAT FFEKHHGFNVKWQRI IDKEVKPSTGLI QPTNSLFKAHSSLVNLEENSQVTQVS I S K 
KWMKS S VKNKPS VMAYQKA 

SEQ ID NO: 4820 

STRAIN M781 frame: 2 

ASVAPTTNT I VQTNDSNPTAKFASESGQS VI GQVKPANSAALTTVDTPH I SAPDALKTTQ 
SSPVVESPSTKLTEETYKQKDGQDLANMVRSGQVTSEELVNMAYDIIAKENPSLNAVIT^ 
RRQEAI EEARKLKDTNQPFLGVPLLVKGLGHS I KGGETNNGLI YADGKI STFDS S YVKKY 
KDLGF 1 1 LGQ/rNFPEYGWRNITDSKLYGPTHNPWNLAHNAGGSSGGSAAAIASGMTP IAS 
GSDAGGS IRI PSSWTGLVGLKPTRGLVSNEKPDS YSTAVHFPLTKS SRDAETLLTYLKKS 
DQTLVSVNDLKSLPI AYTLKSPMGTEVSQDAKNAI MDNVTFLREQGFKVTE I DLPIDGRA 
LMRDYSTLA I GMGGAFST I EKDLKKHGFTKEDVDP ITWAVHVI YQNSDKAELKKS IVEAQ 
KHMDD YRKAMEKLHKQFP I FLS PTTAS LAPLNTDP YVTE KDKRAI YNMENLSQEERI ALF 
NRQWEPMLRRTPFTP I ANMTGLPAI S I PTYLSESGLP I GTMLMAGANYDMVL I KFATFFE 
KHHGFNVKWQRI IDKEVKPSADLIQPTNSLFKAHSSLVNLEENSQVTQVSISKKWMKSSV 
KNKP SVMAYQKA 

SEQ ID NO: 4821 

STRAIN CJB110 frame: 3 
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VPTTNTIVQTNDSNPTAKFVSESGQSVIGQVKPDNSAALTTVDTPHH^ 
PVVESTSTKLTEETYKQKDGKDIJ^NMWSGQVTSEELVNMAYDI IAKENPSLNAVITTRR 
QEAI EEARKLKDTNQPFLGVPLLVKGLGHS I KGGETNNGL I YADGKI STPDS SYVKKYKD 
LGFI I LGQTNFPEYGWRNITDSKLYGLTHNPWDLAHNAGGSSGGSAAAI ASGMTP I ASGS 
DAGGS IRI PSSWTGLVGLRPTRGLVSHEKPDS YSTAVHFPLTKS SRDAETLLTYLKKSDQ 
TLVSVNDLKSLP I AYTLKS PMGTE VSQDAKNAI MDNVTFLRKQGFKVTE I DLPIDGRALM 
RDYSTLAIGMGGAFSTIEKDLKKHGFTKEDVDPITWAVHVIYQNSDKAELKKSIMEAQKH 
MDDYRKAME KLHKQFP I FLSPTTASLAPLNTDPYVTEEDKRAI YNMENLSQEERIALFNR 
QWEPMLRRTPFTQI ANMTGLPAI S I PTYLSESGLPIGTMLMAGANYDMVLIKFATFFEKH 
HGFNVKWQRI IDKEVKPSTGLI QPTNSIiFKAHSSIiVNLEENSQVTQVS I SKKWMKSSVKN 
KPSVMAYQKA 

SEQ ID NO: 4822 

STRAIN 1169NT frame: 1 

NSTETSASVAPTTNT I VQTNDSNPTAKFASESGQSVI CQVKPDNSAALTTVDTPH I SAPD 
DLKTTQSSPVVESTSTKLTEETYKQKDGQDIiANMVRSGQVTSEELVNMAYDIIAKENPSL 
NAVITTRRQFJVI EEARKLKDTNQPFLGVPLLVKGLGHS I KGGETNNGL I YADGKI STFDS 
S YVKKYKDLGF 1 1 LGQTNF PE YGWRNI TDSKL YG PTHNPRNLAHNAGGS SGGS AAAI ASG 
MTPIASGSDAGGSIRIPSSWTGLVGLKPTRGLVSNEKPDSYSTAVHFPLTKSSRDAETLL 
TYLKKSDQTLVSVNDLKSLPIAYTLKSPMGTEVSQDAKNAIMDNVTFLRKQGFKVTEIDL 
PIDGRALMRDYSTLAIGMGGAFST I EKDLKKHGFTKEDVDPI TWAVHVI YQNSDKAELKK 
S IMEAQKHMDDYRKAMEKLHKQFP I FLSPTTASLAPLNTDPYVTEEDKRAI YNMENLSQE 
ERIALFNRQWEPMLRRTPFTQI ANMTGLPAI S IPTYLSESGLPIGTMLMAGANYDMVLIK 
FATFFEKHHGFNVKWQRI IDKEVKPSTGLIQPTNSLFKAHSSLVNLEENSQVTQVSI SKK 
WMKS S VKNKPS VMAYQKA 

SEQ ID NO: 4823 

STRAIN JM9130013 frame: 2 

SVAPTTNTIVQTNDSNPTAKFSSESGQSVIGQVKPANSVALTTVDTPHISAPDALKTTQS 
SPVVESPSTKLTEETYKQKDGQELANMWSGQVTSEELVNMAYDIIAKENPSIjNAVITTR 
RQEAI EEARKLKDTNQPFLGVPLLVKGLGHS I KGGETNNGL I YAGGKI STFDSSYVKKYK 
DLGFI I LGQTNF PE YGWRNI TDSKLYGPTHNPWNLAHNAGGSSGGSAAVI ASGMTPI ASG 
SDAGGSIRIPSSWTGLVGLKPTRGLVSNEKPDSYSTAVHFPLTKSSRDAETLLTYLKKSD 
QTLVS VNDLKSLP I AYTLKS PMGTE VSQDAKNAI MDNVI FLRKQGFKVTEIDLP I DGRAL 
MRDYSTLAIGMGGAFSTIEKDLKKHGFTKEDVDPITWGVHVIYQNSDKAELKKSIMEAQK 
HMDDYRKAMEKLHKQFP I FLSPTTASLAPLNTDPYVTEEDKRAI YNMENLSQEERI ALFN 
RQWEPMLRRTPFTQI ANMTGLPAI S I PTYLSESGLPIGTMLMAGANYDMVLI KFATFFEK 
YHGFNVKWQR I IDKEVKPSTGLIQPTNSLFKAHSSLVNLEENSQVTQVSI SKKWMKSSVK 
NKPSVMAY 

SEQ ID NO: 4824 

STRAIN H36B frame: 3 

S VVPTTNTI VQTNDSNPTAKFS SE SGQS VI GQVKPANS VALTTVDTPHI SAPDALKTTQS 
S PVVES PSTKLTEET YKQKTX^DLANMVRSGQVTS EELVNMAYD 1 1 AKENP SLNAVI TTR 
RQEAI EEARKLKDTNQPFLGVPLLVKGLGHS I KGGETNNGL I YAGGKI STFDSSYVKKYK 
DLGFI I LGQTNFPEYGWRNI TDSKLYGPTHNPWNLAHNAGGSSGGSAAVI ASGMTP I ASG 
SDAGGS I RI PS S WTGLVGLKPTRGLVSNEKPD S YSTAVHFPLTKS SRDAETLLTYLKKSD 
OJTLVSVNDLKSLPIAYTLKSPMGTEVSQDAKNAIMDNVIFLRKO^FKVTEIDLPIDGRAL 
MRDYSTLAI GMGGAFSTI EKDLKKHGFTKEDVDP I TWAVHVI YQNSDKAELKKS IMEAQK 
HMDDYRKAMEKLHKQFPI FLSPTTASLAPLNTDPYVTEEDKRAI YNMENLSQEERIALFN 
RQWEPMLRRTPFTQI ANMTGLPAI S I PTYLSESGLP IGTMLMAGANYDMVLI KFATFFEK 
YHGFNVKWQRI IDKEVKPSTGLI QPTNSLFKAHSSLVNLEENSQVTQVS I SKKWMKSSVK 
NK 



PRETTY Of: /biotmp/msa72034 . 2 { *} January 22, 2003 07:25 

1 

msa72034 . 2{l73_090} nstetsasw pTTNTIVQTN DSNPTAKFvS 

msa72034.2{l73__18RS2l} nstetsasw pTTNTIVQTN DSNPTAKFvS 

msa72034 . 2 { 173_2603 } nstetsasw pTTNTIVQTN DSNPTAKFvS 

msa72034.2{l73_A909} -TTNTIVQTN DSNPTAKFvS 

msa72034 .2{173_CJB110} v pTTNTIVQTN DSNPTAKFvS 

msa72034 .2f 173_COHl} nstetsasva pTTNTIVQTN DSNPTAKFaS 

msa72034 . 2 { 173_M732 } sva pTTNTIVQTN DSNPTAKFaS 

msa72034.2{l73_M78l} asva pTTNTIVQTN DSNPTAKFaS 

msa72034.2{l73_1169NT) nstetsasva pTTNTIVQTN DSNPTAKFaS 

msa72034.2{l73_H36B} sw pTTNTIVQTN DSNPTAKFsS 

msa72034.2{l73 JM9130013} sva pTTNTIVQTN DSNPTAKFsS 

Consensus - -********* ******** _* 



ESGQSVIgQV 
ESGQSVIgQV 
ESGQSVIgQV 
ESGQSVIgQV 
ESGQSVIgQV 
ESGQSVIgQV 
ESGQSVIgQV 
ESGQSVIgQV 
ESGQSVIcQV 
ESGQSVIgQV 
ESGQSVIgQV 
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KPdNSaALTT 
KPdNSaALTT 
KPdNSaALTT 
KPdNSaALTT 
KPdNSaALTT 
KPaNSaALTT 
KPaNSaALTT 
KPaNSaALTT 
KPdNSaALTT 
KPaNSvALTT 
KPaNSvALTT 



*******_** **_**_**** 



51 100 

msa72034. 2 { 173^090} VDTphHISAP DaLKTTQSSP WEStSTKLT EETYKQKDGk dLANMVRSGQ 

msa72034.2{l73 18RS21) VDTphHISAP DaLKTTQSSP WEStSTKLT EETYKQKDGq dLANMVRSGQ 

msa72034.2{l73_2603} VDTphHISAP DaLKTTQSSP WEStSTKLT EETYKQKDGq dLANMVRSGQ 

msa72034.2{l73_A909} VDTphHISAP DaLKTTQSSP WEStSTKLT EETYKQKDGq dLANMVRSGQ 

msa72034.2{l73_CJB110} VDTphHISAP DaLKTTQSSP WEStSTKLT EETYKQKDGk dLANMVRSGQ 

msa72034 .2{l73_COHl} VDT.pHISAP DaLKTTQSSP WESpSTKLT EETYKQKDGq dLANMVRSGQ 

msa72 034.2{173_M732} VDT.pHISAP DaLKTTQSSP WESpSTKLT EETYKQKDGq dLANMVRSGQ 

msa72034.2{l73_M78l} VDT.pHISAP DaLKTTQSSP WESpSTKLT EETYKQKDGq dLANMVRSGQ 

msa72034.2{l73_1169NT} VDT.pHISAP DdLKTTQSSP WEStSTKLT EETYKQKDGq dLANMVRSGQ 

msa72034.2{!73__H36B} VDT.pHISAP DaLKTTQSSP WESpSTKLT EETYKQKDGq dLANMVRSGQ 

msa72034.2{l73_lJM9130013) VDT.pHISAP DaLKTTQSSP WESpSTKLT EETYKQKDGq eLANMVRSGQ 
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Consensus 



msa72034.2{l73_090} 
msa72034 . 2 { 173_JL8RS21 } 
msa72034 . 2 { 173_2603 } 
msa72034 . 2 (173_A909 } 
msa72034.2{l73_CJB110} 
msa72034 . 2 { 173_C0H1 J 
msa72034 . 2 {l73_M732 } 
msa72034 , 2 { 173_M7 81 } 
msa72034 . 2 { 173_1169NT} 
msa72034.2{l73_H36B} 
msa72034.2{l73_JM9130013} 
Consensus 



msa72034.2{l73_090} 
msa72034 . 2 { 173_18RS21 } 
msa72034.2fl73_2603) 
msa72 034.2{l73_A909} 
rasa72034 . 2 { 173_CJB110 } 
tnsa72034 . 2 {173_C0H1 } 
msa72034 . 2 f 173_M732 } 
msa72034.2{173_M78l} 
msa72034 . 2 { 173_1169NT} 
msa72034 . 2 {l73_H36B} 
msa72034 . 2 {l73_JM9130013 } 
Consensus 



msa72034 . 2 { 173_090 } 
msa72034 . 2 { 173JL8RS21 } 
msa72034 . 2 ( 173_2603 } 
msa72034.2{173_A909} 
msa72034 . 2 { 173_CJB110 } 
msa72034 . 2 { 173_COHl } 
msa72034 - 2 { 173_M732 } 
msa72034 . 2 { 173_M78l} 
msa72034 . 2 { 173_1169NT} 
msa72034.2{l73_H36B} 
msa72034 . 2 {l73_JM9130013 } 
Consensus 



msa72034 . 2 { 173_090 } 
msa72034 . 2 { 173_18RS2l} 
msa72034 . 2 {l73_2603 } 
msa72034.2{l73_A909} 
rasa72034 . 2 { 173_CJB110 } 
msa7 2034.2(17 3_C0H1 } 
msa72 034 . 2 { 173_M732 J 
msa72034 . 2 { 173_M781 } 
msa72034 . 2 { 173_1169NT} 
msa72034 . 2 {l73_H36B} 
msa72034 . 2 { 173_JM9130013 } 
Consensus 



msa72034 . 2 { 173_090 } 
msa72034 . 2 { 173_18RS21 } 
msa72034 . 2 ( 173_2603 } 
msa72034 . 2 { 173_A909 J 
msa72034 . 2 { 173_CJB110> 
Tnsa7 2 0 3 4 . 2 { 17 3_C0H1 } 
msa72034 . 2 { 173_M732 [ 
msa72034.2{173_M781 
msa72034 . 2 { 173_1169NT] 
msa72034 . 2 { 173_H36B] 
msa72034 . 2 {173_JM9130013 , 
Consensus 



_**★*★*** ****^***** ***★*★***- 



*** ***** 

101 

VTSEELVNMA YDIIAKENPS LNAVITTRRQ 
VTSEELVNMA YDIIAKENPS LNAVITTRRQ 
VTSEELVNMA YDIIAKENPS LNAVITTRRQ 
VTSEELVNMA YDIIAKENPS LNAVITTRRQ 
VTSEELVNMA YDIIAKENPS LNAVITTRRQ 
VTSEELVNMA YDIIAKENPS LNAVITTRRQ 
VTSEELVNMA YDIIAKENPS LNAVITTRRQ 
VTSEELVNMA YDIIAKENPS LNAVITTRRQ 
VTSEELVNMA YDIIAKENPS LNAVITTRRQ 
VTSEELVNMA YDIIAKENPS LNAVITTRRQ 
VTSEELVNMA YDIIAKENPS LNAVITTRRQ 
********** ********** ********** 

151 

LLVKGLGHSI KGGETNNGL I YAdGKI STFD 
LLVKGLGHSI KGGETNNGL I YAdGKI STFD 
LLVKGLGHSI KGGETNNGL I YAdGKI STFD 
LLVKGLGHSI KGGETNNGL I YAdGKI STFD 
LLVKGLGHSI KGGETNNGL I YAdGKI STFD 
LLVKGLGHSI KGGETNNGL I YAdGKI STFD 
LLVKGLGHSI KGGETNNGL I YAdGKI STFD 
LLVKGLGHSI KGGETNNGL I YAdGKI STFD 
LLVKGLGHSI KGGETNNGL I YAdGKI STFD 
LLVKGLGHSI KGGETNNGL I YAgGKI STFD 
LLVKGLGHSI KGGETNNGL I YAgGKI STFD 
********** ********** **-******* 



msa72034 
msa72034.2{ 

msa72034 

msa72034 . 
msa72034.2{ 

msa72034. 

msa72034 . 

msa72034 . 
msa72034 .2{ 

msa72034 



,2{173_090> 
173_18RS21} 
2{173_2603> 
2{173_A909} 
173_CJB110> 
2{173_C0H1) 
2{173_M732} 
2{173_M781} 
173_1169NT) 
2{173_H36B} 



EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
********** 



SSYVKKYKDL 
SSYVKKYKDL 
SSYVKKYKDL 
SSYVKKYKDL 
SSYVKKYKDL 
SSYVKKYKDL 
SSYVKKYKDL 
SSYVKKYKDL 
SSYVKKYKDL 
SSYVKKYKDL 
SSYVKKYKDL 
********** 



201 

PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 



SKLYGl THNP 
S KLYG1THNP 
S KL YGlTHNP 
SKLYG1THNP 
SKLYG1THNP 
SKLYGpTHNP 
SKLYGxTHNP 
SKLYGpTHNP 
SKLYGpTHNP 
SKLYGpTHNP 
SKLYGpTHNP 



********** *****_**** 



wdLAHNAGGS 
wdLAHNAGGS 
wdLAHNAGGS 
wdLAHNAGGS 
wdLAHNAGGS 
wnLAHNAGGS 
wdLAHNAGGS 
wnLAHNAGGS 
mLAHNAGGS 
wnLAHNAGGS 
wnLAHNAGGS 
******** 



251 

AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
********** 

301 

LTYLKKSDQT 
LTYLKKSDQT 
LTYLKKSDQT 
LTYLKKSDQT 
LTYLKKSDQT 
LTYLKKSDQT 
LTYLKKSDQT 
LTYLKKSDQT 
LTYLKKSDQT 
LTYLKKSDQT 
LTYLKKSDQT 
********** 

351 

kQGFKVTEID 
kQGFKVTE ID 
kQGFKVTEID 
kQGFKVTEID 
kQGFKVTEID 
kQGFKVTEID 
kQGFKVTEID 
eQGFKVTEID 
kQGFKVTEID 
kQGFKVTEID 



WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
********** 



LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
********** 



LPIDGRALMR 
LPIDGRALMR 
LPIDGRALMR 
LPIDGRALMR 
LPIDGRALMR 
LPIDGRALMR 
LPIDGRALMR 
LPIDGRALMR 
LPIDGRALMR 
LPIDGRALMR 



RGLVSnEKPD 
RGLVSnEKPD 
RGLVSnEKPD 
RGLVSnEKPD 
RGLVShEKPD 
RGLVSnEKPD 
RGLVSnEKPD 
RGLVSnEKPD 
RGLVSnEKPD 
RGLVSnEKPD 
RGLVSnEKPD 
*****_**** 



PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 
********** 



SGGSAAalAS 
SGGSAAalAS 
SGGSAAalAS 
SGGSAAalAS 
SGGSAAalAS 
SGGSAAalAS 
SGGSAAalAS 
SGGSAAalAS 
SGGSAAalAS 
SGGSAAvIAS 
SGGSAAvIAS 
******_*** 



SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
********** 



GTEVSQDAKN 
GTEVSQDAKN 
GTEVSQDAKN 
GTEVSQDAKN 
GTEVSQDAKN 
GTEVSQDAKN 
GTEVSQDAKN 
GTEVSQDAKN 
GTEVSQDAKN 
GTEVSQDAKN 
GTEVSQDAKN 
********** 



DYSTLAIGMG 
DYSTLAIGMG 
DYSTLAIGMG 
DYSTLAIGMG 
DYSTLAIGMG 
DYSTLAIGMG 
DYSTLAIGMG 
DYSTLAIGMG 
DYSTLAIGMG 
DYSTLAIGMG 



GAFSTIEKDL 
GAFSTIEKDL 
GAFSTIEKDL 
GAFSTIEKDL 
GAFSTIEKDL 
GAFSTIEKDL 
GAFSTIEKDL 
GAFSTIEKDL 
GAFSTIEKDL 
GAFSTIEKDL 



_********* 
150 

DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
********** 

200 

GFIILGQTNF 
GFIILGQTNF 
GFIILGQTNF 
GFI ILGQTNF 
GFI ILGQTNF 
GFIILGQTNF 
GFI ILGQTNF 
GFIILGQTNF 
GFIILGQTNF 
GFIILGQTNF 
GFIILGQTNF 
********** 

250 

GMTP I ASGSD 
GMTPIASGSD 
GMTP I ASGSD 
GMTPIASGSD 
GMTPIASGSD 
GMTPIASGSD 
GMTPIASGSD 
GMTPIASGSD 
GMTPIASGSD 
GMTPIASGSD 
GMTPIASGSD 
********** 

300 

TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
********** 

350 

AIMDNVtFLR 
AIMDNVtFLR 
AIMDNVtFLR 
AIMDNVtFLR 
AIMDNVtFLR 
AIMDNVtFLR 
AIMDNVtFLR 
AIMDNVtFLR 
AIMDNVtFLR 
AIMDNViFLR 
AIMDNViFLR 
******_*** 

400 

KKHGFTKEDV 
KKHGFTKEDV 
KKHGFTKEDV 
KKHGFTKEDV 
KKHGFTKEDV 
KKHGFTKEDV 
KKHGFTKEDV 
KKHGFTKEDV 
KKHGFTKEDV 
KKHGFTKEDV 
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Table 48: Comparative Sequences relating to SAG1474 



msa72034 . 2 { 173_JM9130013 } 
Consensus 



kQGFKVTEID LPIDGRALMR DYSTLAIGMG GAFSTIEKDL KKHG FTKEDV 
_********* ********** ********** ********** ********** 



msa72034 
msa72034.2{ 

msa72034 

msa72034 
msa72034.2{ 

msa72034 . 

msa72034 . 

msa72034 . 
msa72034 . 2 { 

msa72034 . 
msa72034.2{l73 



2{l73_090> 
173_18RS2l} 
2{l73_2603} 
2{173_A909} 
173_CJB110> 
2{l73_COHl) 
2{173_M732} 
2{l73_M78l} 
173_1169NT} 
2{173_H3 6B} 
JM9130013} 
Consensus 



401 

DPITWaVHVI 
DPITWaVHVI 
DPITWaVHVI 
DPITWaVHVI 
DPITWaVHVI 
DPITWaVHVI 
DPITWaVHVI 
DPITWaVHVI 
DPITWaVHVI 
DPITWaVHVI 
DPITWgVHVI 



YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 



KSImEAQKHM 
KSImEAQKHM 
KSImEAQKHM 
KSImEAQKHM 
KSImEAQKHM 
KSIvEAQKHM 
KSIvEAQKHM 
KSIvEAQKHM 
KSImEAQKHM 
KSImEAQKHM 
KSImEAQKHM 



*****_**** ********** ***_****** 



DDYRKAMEKL 
DDYRKAMEKL 
DDYRKAMEKL 
DDYRKAMEKL 
DDYRKAMEKL 
DDYRKAMEKL 
DDYRKAMEKL 
DDYRKAMEKL 
DDYRKAMEKL 
DDYRKAMEKL 
DDYRKAMEKL 
********** 



450 

HKQFPIPLSP 
HKQFPIFLSP 
HKQFPIFLSP 
HKQFPIFLSP 
HKQFPIFLSP 
HKQFPIFLSP 
HKQFPIFLSP 
HKQFPIFLSP 
HKQFPIFLSP 
HKQFPIFLSP 
HKQFPIFLSP 
********** 



msa72034 
msa72034 . 2{ 

msa72034 . 

msa72034. 
msa72034.2{ 

msa72034. 

msa72034 . 

msa72034 . 
msa72034.2{ 

msa72034 . 
msa72034.2{l73 



2{l73_090} 
173_18RS2ll 
2(l73_2603} 
2{173_A909} 
173_CJB110} 
2fl73_COHl) 
2{173_M732} 
2{173_M781} 
173_1169NT} 
2{173_H36B} 
JM9130013} 
Consensus 



451 

TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
********** 



DPYVTEeDKR 
DPYVTEeDKR 
DPYVTEeDKR 
DPYVTEeDKR 
DPYVTEeDKR 
DPYVTEkDKR 
DPYVTEkDKR 
DPYVTEkDKR 
DPYVTEeDKR 
DPYVTEeDKR 
DPYVTEeDKR 
******_*** 



AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 



EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 



********** ********** 



500 

WEPMLRRTPF 
WEPMLRRTPF 
WEPMLRRTPF 
WEPMLRRTPF 
WEPMLRRTPF 
WEPMLRRTPF 
WEPMLRRTPF 
WEPMLRRTPF 
WEPMLRRTPF 
WEPMLRRTPF 
WEPMLRRTPF 
********** 



msa72034 
msa72034.2{ 

msa72034 

msa72034 . 
msa72034 .2{ 

msa72034 . 

msa72 034 . 

msa72034 . 
msa72034.2{ 

msa72034 . 
msa72034 .2(173. 



2{173_090} 
173_18RS2l} 
2{173_2603} 
2{173_A909 
173J2JB110, 
2{l73_COHl' 
2{173_M732' 
2{l73_M78l] 
173_1169NT} 
2{173_H36B} 
_JM9130013} 
Consensus 



501 

TqlANMTGLP 
TqlANMTGLP 
TqlANMTGLP 
TqlANMTGLP 
TqlANMTGLP 
TpIANMTGLP 
Tp I ANMTGLP 
TpIANMTGLP 
TqlANMTGLP 
TqlANMTGLP 
TqlANMTGLP 
*„******** 



AISIPTYLSE 
AISIPTYLSE 
AISIPTYLSE 
AISIPTYLSE 
AISIPTYLSE 
AISIPTYLSE 
AISIPTYLSE 
AISIPTYLSE 
AISIPTYLSE 
AISIPTYLSE 
AISIPTYLSE 
********** 



SGLPIGTMLM 
SGLPIGTMLM 
SGLPIGTMLM 
SGLPIGTMLM 
SGLPIGTMLM 
SGLPIGTMLM 
SGLPIGTMLM 
SGLPIGTMLM 
SGLPIGTMLM 
SGLPIGTMLM 
SGLPIGTMLM 
********** 



AGANYDMVLI 
AGANYDMVLI 
AGANYDMVLI 
AGANYDMVLI 
AGANYDMVLI 
AGANYDMVLI 
AGANYDMVLI 
AGANYDMVLI 
AGANYDMVLI 
AGANYDMVLI 
AGANYDMVLI 
********** 



550 

KFATFFEKhH 
KFAT FFEKhH 
KFATFFEKhH 
KFATFFEKhH 
KFATFFEKhH 
KFATFFEKhH 
KFATFFEKhH 
KFATFFEKhH 
KFATFFEKhH 
KFAT FFEKyH 
KFATFFEKyH 
********_* 



msa72034 
msa72034.2{ 
msa72034 
msa72034 
tnsa72034.2{ 
msa72034 
msa72034 
msa72034 . 
msa72034.2{ 
msa72034 . 
msa72034.2{l73 



.2{173_090] 
173_18RS21 
2{l73_2603 i 
2{173_A909; 
173_CJB110 1 
2{l73_COHl] 
2{173_M732, 
2{l73_M78l] 
173__1169NT] 
2{173_H36B] 
;_JM9130013} 
Consensus 



551 

GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 



DKEVKPStgL 
DKEVKPStgL 
DKEVKPStgL 
DKEVKPStgL 
DKEVKPStgL 
DKEVKPSadL 
DKEVKPSadL 
DKEVKPSadL 
DKEVKPStgL 
DKEVKPStgL 
DKEVKPStgL 



********** *******_ 



IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
********** 



HSSLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 
********** 



600 

SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
********** 



msa72034 .2{173_090} 
msa72034 . 2 { 173_18RS21 } 
msa72034. 2 f 173^2603} 
msa72034 . 2 {173_A909} 
msa72034 - 2 { 173_CJB110} 
msa72034 . 2 { 173_C0H1 J 
msa72034..2|l73__M732} 
msa72034 . 2 {173_M781} 
msa72034 . 2 { 173_1169NT} 
msa72034.2{l73_H36B) 
msa72034.2{l73_JM9130013} 
Consensus 



601 619 
KWMKSSVKNK psvmayqka 
KWMKSSVKNK psvmayqka 
KWMKSSVKNK psvmayqka 
KWMKSSVKNK psvmayqka 
KWMKSSVKNK psvmayqka 
KWMKSSVKNK psvmayqka 
KWMKSSVKNK psvmayqka 
KWMKSSVKNK psvmayqka 
KWMKSSVKNK psvmayqka 

KWMKSSVKNK 

KWMKSSVKNK psvmay 

********** 
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SEQ ID NO: 4901 
STRAIN 2603 

aaacatccgatacttaatgatcaaaaatccttagcaattgttgaacagat 
agaatatgattttgataaattcgataattcagaagcttctttttatgcaa 
cattagctagawttcgcgttatggatagagaaatcaaaaaatttattaga 
gaaaatccaaatagtcaaatcctttcaattggttgtggacttgatacaag 
gtttgaaagagtcgataatggacaaattaggtggtataaccttgatttgc 
cagagg 1 1 a t ggaga taagaaaattatttttt gaagagca tgaa agag 1 1 
actaatatagcaaaatcagccctagatgaaacttggacacgggaggtaaa 
tccccaaaatgccccttttctaatcgtgtcagaaggtgttttaatgtttc 
taaaagaagatgacgtagagacttttcttcatatcctgacaaattcattt 
agccaatttatggcacaatttgatttgtgtcataaggaaatgattaataa 
aggaaagcaacatgatacagtaaagtatatggatacagaatttcagtttg 
gt at cacaga tggt cat gagat t g tggat 1 1 agaccc t aaa 1 1 aaagcaa 
ataaatctgattaactttacagatgagatgagcaaatttgagttaggcac 
acttcgctctttacttccaacaattcgtaaatttaataattgtttaggtg 
tgtacgaatataaagcatc 

SEQ ID NO: 4902 
STRAIN 090 

TAATGATCAAAAAT C CTTAG CAATTGTTG AACAGATAGAATATGATTTTG 
ATAAATT CGATAATT CAGAAGCTT C"l" l'TTTATGCAACATTAG CTAGAATT 
CG CGTTATGGATAGAGAAATCAAAAAATTTATTAGAGAAAATCCAAATAG 
TCAAATCCn^CAATTGGTTGTGGACTTGATACAAGGTTTGAAAGAGTCG 
ATAATGGACAAATTAGGTGGTATAACCTTGATTTG C CAGAgGTT ATGGAG 
ATAAGAAAATTATTTTTTGAAGAG CATGAAAG AGTTACTAATAT AG CAAA 
AT CAG CCATAGATGAAACTTGGACACGGG AGGTAAAT CC C CAAAATGCCC 
CTTTT CT AAT CGTGTCAGAAGGTGTTTTAATGTTT CTAAAAGAAGATGAC 
GTAG AGACTTTTCTTCATAT CCTGACAAATT CATTTAGC CAATTTATGGC 
ACAATTTGATTTGTGT CATAAGGAAATGATTAATAAAGG AAAG CAACATG 
ATACAGTAAAGTATATGGATACAGAATTTCAGTTTGGTATCACAGATGGT 
CATGAGATTGTGGATTTAGACCCTAAATTAAAGCAAATAAATCT 
CTTTACAGATGAGATGAG CAAATTTGAGTTAGG CACACTT CX3CT CTTTAC 
TTCCAACAATTCGTAAATTTAATAATTGTTTAGGTGTGTACGAATATAAA 
GCATC 

SEQ ID NO: 4903 
STRAIN A909 

AAACATCCGATACTTAATGA 

TCAAAAATCCTTAG CAATTGTTGAACAGATAGAATATGATTTTGATAAAT 
TCGATAATTCAGAAGCTTCITTTTATGCAACATTAGCTAGAATT 
ATGGATAGAGAAAT CAAAAAATTTATTAGAGAAAAT C CAAATAGT CAAAT 
CcTTT CaATTGGTTGTGGACTTGATACAAGGTTTGAAAGAGT CGATAATG 
GACAAATTAGGTGGTATAAC CTTG ATTTG C CAGAGGTTATGG AG ATAAGA 
AAATTaTTTTTTGAAGAG CATG AAAGAGTTACTAATATAG CAAAATCAG C 
CCTAGATGaAACITGGACACGGGAGGTAAATCCCCAAAATGCCCCTTTTC 
TAATCGTGT CAG AAGGTGTTTTAATGTT t CTAAAAGAAGATGACGTAGAG 
ACTTTT CTTCATATCCTGACAAATT CATTTAGCCAATTTATGGCACAATT 
TGATTTGTGT CATAAGGAAATGATTAATAAAGGAAAG CAACATGATACAG 
TAAAGTATATGGATACAGAATTTCAGTTTGGTATCACAGATGGTCATGAG 
ATTGTGGATTTAGAC CCTAAATTAAAG CAAAT AAATCTGATTAACTTTAC 
AGATGAGATGAG CAAATTTG AGTTAGG CACACTTCGCT LUTTACTTC CAA 
CAATT CGTAAATTTAATAATTGTTTAGGTGTGTACGAATATAAAG CATC 

SEQ ID NO: 4904 
STRAIN H36B 

AAACATCCGATACTTAATCATCAAAAATCCTTAGCA 

ATTGTTGAACAGATAGAAT ATGATTTTGAT AAATTCGATAATT CAGAAG C 
TTCTTTTTATGCAaCATTAGCTAGAATTCGCGTTATGGATAGAGAAAT CA 
AAAAATTTATTAGAGAAAATCCAAATAGTCATATCU'iU'rCAATT GGCT 
GgACTTGATACAAGGTTTGAAAGAGTCGATAATGGACAAATTAGGTGGTA 
TAACCTTGATTTGCCAGAGGTTATGGAGATAAGAAAATTATTTTTTGAAG 
AGCATGAAAGAGTTACTAATATAG CAAAATCAGCCcTAGATGAAACTTGG 
ACACGGGAGGTAAATCCCCAAAATGCCCCTITTTCrAATCGTGTCAa^AGG 
TGTTTTAATGTTTCTAAAAG AAGATGACGTAGAGA(J-l"lTT CTTCATATC C 
TGACAAATT CATTTAGCCAA.TTTATGGCACAATTTGATTTGTGT CAgAAG 
GAAATGATTAATAAAGG AAAGCAA CATGATACAGTAAAGTATATGGATAC 
AGAATTTCAGTTGGGTATCACAGATGGTCATGAAATTGTGGATTTAGACC 
CTAAATTAAAGCAAATAAATCTGATTAACTTTAC^ 
TTTGAGTTAGGCACACTTCGCTCTTTACTTCCAACAAT^ 
TAATTGTTTAGGTGTGTACGAATATAAAGCATC 

SEQ ID NO: 4905 
STRAIN 18RS21 

AACATCCGATACTTAATGATCAAAAATCCTTAGCAAT 
TGTTGAACAGAT AGAATATG ATTTTGATAAATT CGAT AATTCAGAAG CTT 
CTITTTTATG CAACATTAG CTAGAATTCGCGTTATGGATAGAGAAATCAAA 
AAATTTATTAGAGAAAAT CCAAATAGT CaAAT CCTTT CAATTGGTTGTGG 
ACTTGATACAAGGTTTGAA^GAGTCGATAATGGACAAATTAGGTGGTATA 
AC CTTGATTTG C CAG AG GTTATGGAGAT AAGAAAATT ATTTTTTG AAGAG 
CATGAAAGAGTTACT AATATAG CAAAATCAGC CCTAG ATG AAACTTGGAC 
ACGGG AGGTAAATCCCCAAAATGCCCCTTTTCT AATCG TGTCAgAAGGTG 
TTTTAATGTTTCTAAAAGAAGATGACGTAGAG ACTTTT CTT CATATC CTG 



830 



WO 2004/018646 



PCT/US2003/026827 



Table 49: Comparative Sequences related to SAG1502 



ACAAATTCATTTAGCCAATTTATGGC^CaATTTGATTTGTGTCATAaGGA 
AATGATTAATAAAGGAAAGCAACATGATACAGTAAAGTATATGGATACAG 
AATTT CAGTTTGGT AT CACAGATGGTCATG AGATTGTGG ATTT AGAC C CT 
AAATTAAAGCAAATAAATCTGATTAACTTTACAGATGAGATGAGCAAATT 
TG AGTTAGG CACACTT CG CT CTTTACTTC CAACAATTCGT AAAT TTAAT A 
ATTGTTTAGGTGTGT ACGAA t ATA a aG CATC 

SEQ ID NO: 4906 
STRAIN M732 

AAACATCCX3ATACTTAATGATCAAAAATCCTTAG CAATTGTTGAACA 

GATAG AATAT GATTTGGAT AAATT CGATAATT CAGAAG CTT CTTTTTAT G 

(TAACATTAGCTAGAATTCGCGTTATGGATAGAGAAATCAAAAAATTTATT 

AGAGAAAATCCAAATAGTCAAATCCTTTCAATTGGTTGTGGACTTGATAC 

AAGGTTTGAAA.GAGTCGATAATGGACAAATTAGGTGGTATAACCTTGATT 

TGCCAGAGGTTATGGAGATAAGAAAATTATTTTTTGAAGAGCATGAAAGA 

GTTACTAAT ATAG CAAAAT CAG CC CTAGATGAAACTTGGACACGGGAGGT 

AAATCCCCAAAATGCCCCTTTTCTAATCGTGTCAGAAGGTGTTTTAATGT 

TTCTAAAAgAAGATGACGTAGAGACTTTTCTTCAtATCCTGACT^ 

TTTAGCCAATTTATGGCaCAATTrGATTTGTGTCATAAGGAAATGATTAA 

TAAAGGAAAGCAACATGATACAGTAAAGTATATGGATACAGAATTTCAGT 

TTGGTATCACAGATGGTCATGAGATTGTGGATTTAGACCCTAAATTAAAG 

CAAAT AAAT CTGATTAACTTTACAGATGAGATGAG CAAATTTGAGTTAgG 

CACACITCGCTCTTTACT^ 

G t GTGTACGAAT ATAAAG CATC 

SEQ ID NO s 4907 
STRAIN COH1 

AAACATCCGATACTTAATGATCAAAAATCCTTAGCAA 
TTGTTGAACAGATAGAATATGATTTGGATAAATT CGATAATT CAGAAG CT 
TC w ri w riT ATG CAACATTAGCrrAGAATT CGCGTTATGGAT AGAGAAAT CAA 
AAAATTTATT AGAGAAAATCCAAATAGT CAAAT C CTTTCAATTGGTTGTG 
GACTTGATACAAGGTTTGAAAGAGTCGATAATGGACAAATTAGGTGGTAT 
AACCIT'GATTTGCCAGAGGTTATGGAGATAAGAAAATTATTTTTTG 
GCATGAAAGAGTTACTAATATAGCAAAAT CAG CCCTAGATGAAACTTGG A 
CACGGGAGGTAAAT CCC CAAAATG CCCCTTTTCTAATCGTGT CAG AAGGT 
GTTTTAATGTTT CTAAAAGAAGATGACGT AGAGACITTTCTT CAT AT CCT 
GACAAATT CATTTAGCCAATTT ATGGCACAATTTGATTTGTGT CATAAGG 
AAATGATTAATAAAGGAAAGCAACATGATACAGTAAAGTATATGGATACA 
GAATTTCAGTTTGGTATCACAGATGGT CATGAGATTGTGGATTTAGAC CC 
TAAATTAAAGCAAATAAATCTGATTAACTTTACA.GATGAGATGAGCAAAT 
ITGAGTTAGGCACACTTCGCTCTTTACTTCCAACAATTC 
AATTGTTTAGGTGTGTACG AATATAAAGCAT C 

SEQ ID NO: 4908 
STRAIN M7 81 

AAACATCCGATACTTAATGATCA 

AAAATCCTTAGCAATTGTTGAACAGATAGAATATGATTTGGATAA^TTCG 
ATAATTCAGAAG CTT CTTTTTATG CAACATTAGCT AG AATTCGCGTTATG 
GATAGAGAAATCAAZUW^TTTATTAGAGAAAAT CCAAATAGT CAAAT CCT 
TTO^TTGGTTGTGGACTTGATACAAGGTTTGAAAGAGTCGATAA 
AAATTAGGTGGTATAACCTTGATTTGCCAGAGGTTATGGAGATAAGAAAA 
TTATTTTTTGAAGAG CATG AAAGAGTTACTAATATAGCAAAATCAGC C CT 
AGATGAAACTTGGACACGGGAGGTAAATC C CCAAAATGC C CCTTTT CTAA 
TCGTGTCAGAAGGTGTTTTAATGTTTCTAAAAgAAGATGACGTAGAGACT 
TTT CTTCATATCCTGACAAAT t GATTTAGCCAATTTAtGGCACAATTTGA 
TTTGTGT CATAAGGAAATGATTAATAAAGG AAAG CAACATGATACAGTAA 
AGTATATGGATACAGAATTT CAGTTTGGTAT CACAGATGGT CATGAGATT 
GTGGATTTAg AC CCTAAATTAAAGCAAATAAAT CTGATTAACTTTACAG A 
TGAGATGAGCAAATTTGAGTTAGG CACA.CTTCGCT CTTTACTT C CA^CAA 
TTCGTAAATTTAATAAT tGTTTAGGTGTGTACGAATATAAAG CATC 

SEQ ID NO: 4909 
STRAIN CJB110 

AAACATCCGATACTTAATGATCAAAAA.TCCrTAGCAA 
TTGTTGAACAGATAGAATATGATTTTGATAAATT CGATAATT CAGAAGCT 
TCTTTTTATG CAACATT AGCTAGAATT CG CGTTATGGATAGAGAAAT CAA 
AAAATTTATT AGAGAAAATC CAAATAGTCAAATC CIT^ 
GACITGATACAAG^TTTGAAAGAGTCGATAATGGACAAATTAGGTGGTAT 
AACCITGATTTG CCAGAGGTTATGG AGATAAGAAAATTATTTT^ 
GCATGAAAGAGTTACTAATATAGCAAAAT CAG CCATAGATGAAACTTGG A 
CACGGGAGGTAAAT CCC CAAAATGC CC CITTTCTAAT CGTGT CAGAAGGT 
GTTTTAATGTTT CTAAAAGAAGATG ACGTAGAGACITTTCTT CAT AT CCT 
' GACAAATTCATTT AGCCAATTTATGG CACAATTT GATTTGTGT CATAAG G 
' AAATGATTAATAAAGGAAAG CAACATGATACAGTAAAGT ATATGG ATACA 
GAATTT CAGTTTGGT ATCAGRGATGGTC^ 

TAAATTAAAG CAAAT AAAT CTTG ATT AACTTTACAG ATGAG AT GAG CAAAT 
TTGAGTTAGG CACACTTCG CTCTTTACTTC CAACAATC 
AATTGTTTAGGTGTGTACGAAT ATAAAG CATC 

SEQ ID NO: 4910 
STRAIN 116 9NT 

AAACATCCGATACTT AATGAT CAAAAATCCTTAG CAAT 
TGTTGAACAGATAGAATATGATTITGATAAATTCGATAATTCAGAAGCTT 



831 



WO 2004/018646 



PCT/US2003/026827 



Table 49: Comparative Sequences related to SAG1502 



CTTTTTATG CAACAT TAG CTAG AATTCGC GTTATGGAT AGAGAAATC AAA 
AAATTTATT AG AGAAAAT C CAAATAGT CATATC CTTT CTATTGGTTGTG G 
ACTTGATACAAGGTTTGAAAGAGTCGATAATGGACAAATTAGGTGGTATA 
ACCTTGATTTGCCAGAGGTTATGGAGATAAGAAAATTATTTTTTGAAGAG 
CATGAAAGAGTTACTAATATAGCAAAATCAGCCCTAGATGAAACTTGGAC 
ACAGGAGGTAAATCCCCAAAATGCCCCTTTTCTGATCGTGTCAGAAGGTG 
TTTTAATGTTTCTAAAAGAAGATGACGTAGAGACTTTTcTTCATATCCTG 
ACAAATTCATTTAGCGAATTTATGGCACAATTTGATTTGTGt CAGAAGGA 
AATGATTAATAAAGGAAAGCAACATGATACAGTAAAGTATATGGATACAG 
AATTTCAGTTTGGTATCACAGATGGTCATGAAATTGTGGATTTAGACCCT 
AAATTAAAGCAAATAAATCTGATTAACITTACAGATGAGATGAGCAAATT 
TG AGT TAGG CACACTTCG C T CTTTACTTC CAACAATTCGTAAATTTAAT A 
ATTGTTTAGGTGTGTACGAATATAAAGCATC 

SEQ ID NO: 4911 
STRAIN JM913 0013 

AGCAATTGTTGAACAGATAGAATATGATT 

TTGATAAATTCX3ATAATTCAGAAGCTTCTTTTTATGCAACATTAGCTAGA 

ATTCGCGTTATGGATAGAGAAATCAAAAAATTTATTAGAGAAAATCCAAA 

TAGTCA.TATCCTTTCAATTGGCTGTGGACTTGATACAAGGTTTGAAAGAG 

T CGATAATGGACAAATTAGGTGGT ATAAC CTTGATTTG CCAGAGGTTATG 

GAGATAAGAAAATTATTTTTTGAAGAG CATGAAAG AGTTACT AATATAG C 

AAAAT CAGCC CTAG ATG AAACTTGGACACGGG AGGTAAAT CCC CAAAATG 

CCCCTTTTCTAATCGTGTCAGAAGGTGTTTTAATGTTTCTAAAAGAAGAT 

GACXJTAGAGACTTCTCTTCATATCCTGAGAAATTCATTTAGCCAATTTAT 

GGCACAATCTGACTTGTGTCAgAAGGAAATGATTAATAAAGGAAAGCAAC 

ATGATAC^GTAAAGTATATGGATACAGAATTTCAGTTTGGTATCACAGAT 

GGTCATG AAATTGTGGATTT AG AC CCTAAATTAAAG CAAATAAATCTGAT 

TAACTTTACAGATGAGATGAGCAAATTTGAGTTAGGCACACT^ 

TACCTCCAACAACTCGTAAATTTAATAATTGTTTAGGTGTGTACGAATAT 

AAAGCATC 



PRETTY of: /biotmp/msa42193 . 2 { *} January 21, 2003 05:04 



msa42193 .2{l76_090} 
msa42193.2{l76_CJB110} 
msa42193 . 2 { 176_18RS21 } 
msa42193.2{l76_2603} 
msa42193 . 2 { 176_A909 } 
msa42193 . 2{l76_COHlJ 
msa42193.2{!76_M732} 
msa42193 . 2 {l76_M78l} 
msa42 193 . 2 { 176_H3 6B } 
msa42193 . 2 { 176_JM9130013 } 
msa42193 . 2 { 176_1169NT} 
Consensus 



AAACATCCGA 
-AACATCCGA 
AAACATCCGA 
AAACATCCGA 
AAACATCCGA 
AAACATCCGA 
AAACATCCGA 
AAACATCCGA 



taatga 

TACTtaatga 
TACTtaatga 
TACTtaatga 
TACTtaatga 
TACTtaatga 
TACTtaatga' 
TACTtaatga 
TACTtaatga 



tcaaaaatcc 
tcaaaaatcc 
tcaaaaatcc 
tcaaaaatcc 
tcaaaaatcc 
tcaaaaatcc 
tcaaaaatcc 
tcaaaaatcc 
tcaaaaatcc 



AAACATCCGA TACTtaatga 
********** **** 



tcaaaaatcc 



1 1 AGCAATTG 
1 t AGCAATTG 
tt AGCAATTG 
1 1 AGCAATTG 
1 1 AGCAATTG 
1 1 AGCAATTG 
1 1 AGCAATTG 
1 1 AGCAATTG 
1 1 AGCAATTG 
— AGCAATTG 
1 1 AGCAATTG 
— ******** 



50 

TTGAACAGAT 
TTGAACAGAT 
TTGAACAGAT 
TTGAACAGAT 
TTGAACAGAT 
TTGAACAGAT 
TTGAACAGAT 
TTGAACAGAT 
TTGAACAGAT 
TTGAACAGAT 
TTGAACAGAT 
********** 



, 51 100 

msa42193.2{176_090} AGAATATGAT TT t GATAAAT TCGATAATTC AGAAG CTT CT TTTTATGCAA 

msa42193 .2{176_CJB110} AGAATATGAT TT t GATAAAT TCGATAATTC AGAAGCTTCT TTTTATGCAA 

msa42193 .2{l76_18RS2l} AGAATATGAT TT t GATAAAT TCGATAATTC AGAAGCTTCT TTTTATGCAA 

msa42193 . 2 {176_2603 ) AGAATATGAT TT t GATAAAT TCGATAATTC AGAAGCTTCT TTTTATGCAA 

msa42193 . 2 {176_A909} AGAATATGAT TT t GATAAAT TCGATAATTC AGAAGCTTCT TTTTATGCAA 

msa42193.2{!76_COHl} AGAATATGAT TTgGATAAAT TCGATAATTC AGAAGCTTCT TTTTATGCAA 

msa42193.2{l76_M732} AGAATATGAT TTgGATAAAT TCGATAATTC AGAAGCTTCT TTTTATGCAA 

msa42193.2{l76_M78l} AGAATATGAT TTgGATAAAT TCGATAATTC AGAAGCTTCT TTTTATGCAA 

msa42193.2{l76_H36B} AGAATATGAT TTtGATAAAT TCGATAATTC AGAAGCTTCT TTTTATGCAA 

msa42193.2{l76_JM9130013} AGAATATGAT TTtGATAAAT TCGATAATTC AGAAGCTTCT TTTTATGCAA 

msa42193.2{l76_1169NT} AGAATATGAT TTtGATAAAT TCGATAATTC AGAAGCTTCT TTTTATGCAA 

Consensus ********** **„******* ********** ********** ********** 



101 150 

msa42193.2{176_090} CATTAGCTAG AaTTCGCGTT ATGGATAGAG AAATCAAAAA ATTTATTAGA 

rasa42193.2{l76_CJB110} CATTAGCTAG AaTTCGCGTT ATGGATAGAG AAATCAAAAA ATTTATTAGA 

msa42193.2{l76_18RS2l} CATTAGCTAG AaTTCGCGTT ATGGATAGAG AAATCAAAAA ATTTATTAGA 

msa42193.2{176_2603} CATTAGCTAG AwTTCGCGTT ATGGATAGAG AAATCAAAAA ATTTATTAGA 

tnsa42193.2fl76_A909} CATTAGCTAG AaTTCGCGTT ATGGATAGAG AAATCAAAAA ATTTATTAGA 

msa42193.2{176_COHl) CATTAGCTAG AaTTCGCGTT ATGGATAGAG AAATCAAAAA ATTTATTAGA 

msa42193 . 2 { 176_M732 } CATTAGCTAG AaTTCGCGTT ATGGATAGAG AAATCAAAAA ATTTATTAGA 

msa42193.2{l76_M78l} CATTAGCTAG AaTTCGCGTT ATGGATAGAG AAATCAAAAA ATTTATTAGA 

msa42193.2{l76_H36B) CATTAGCTAG AaTTCGCGTT ATGGATAGAG AAATCAAAAA ATTTATTAGA 

msa42193.2{l76_JM9130013} CATTAGCTAG AaTTCGCGTT ATGGATAGAG AAATCAAAAA ATTTATTAGA 

msa42193.2{l76_1169NT} CATTAGCTAG AaTTCGCGTT ATGGATAGAG AAATCAAAAA ATTTATTAGA 

Consensus ********** *_******** ********** ********** ********** 



msa42193 .2{l76_090} 
msa42193 . 2 { 176__CJB110 } 
msa42193 . 2{176_18RS2l} 
msa42193 . 2 {176_2603 } 
msa42193 . 2 { 176_A909> 
msa42193.2{l76 COHl} 



151 

GAAAATCCAA ATAGTCAaAT CCTTTCaATT GGtTGTGGAC 
GAAAATCCAA ATAGTCAaAT CCTTTCaATT GGtTGTGGAC 
GAAAATCCAA ATAGTCAaAT CCTTTCaATT GGtTGTGGAC 
GAAAATCCAA ATAGTCAaAT CCTTTCaATT GGtTGTGGAC 
GAAAATCCAA ATAGTCAaAT CCTTTCaATT GGtTGTGGAC 
GAAAATCCAA ATAGTCAaAT CCTTTCaATT GGtTGTGGAC 



200 

TTGATACAAG 
TTGATACAAG 
TTGATACAAG 
TTGATACAAG 
TTGATACAAG 
TTGATACAAG 



832 



WO 2004/018646 



PCT/US2003/026827 



Table 49: Comparative Sequences related to SAG1502 



msa42193 . 2 ( 176_M732 } 
tnsa42193 . 2 { 176JVI781 } 
msa42*193 .2{176_H36B} 
msa42193 . 2 {l76_JM9130013 } 
msa42193 . 2 { 176JL169NT} 
Consensus 



GAAAATCCAA 
GAAAATCCAA 
GAAAATCCAA 
GAAAATCCAA 
GAAAATCCAA 



ATAGTCAaAT 
ATAGTCAaAT 
ATAGTCAtAT 
ATAGTCAtAT 
ATAGTCAtAT 



********** *******_** 



CCTTTCaATT 
CCTTTCaATT 
CCTTTCaATT 
CCTTTCaATT 
CCTTTCtATT 
******_*** 



GGtTGTGGAC 
GGtTGTGGAC 
GG cTGTGGAC 
GG cTGTGGAC 
GGtTGTGGAC 
** _******* 



TTGATACAAG 
TTGATACAAG 
TTGATACAAG 
TTGATACAAG 
TTGATACAAG 
********** 



msa42193 
msa42193.2{ 
msa42193 .2{ 

msa42193 . 

rasa42193 . 

msa42193 . 

msa42193. 

msa42193 . 

msa42193 . 
msa42193.2{l76 
msa42193.2{ 



.2{176_090} 
176_CJB110} 
176_18RS2l} 
2fl76_2603} 
2il76_A909} 
2{176_COHl} 
2{176_M732; 
2(l76_M78i; 
2{176_H36Bi 
JM9130013' 
176_JL169NT] 
Consensus 



201 

GTTTGAAAGA 
GTTTGAAAGA 
GTTTGAAAGA 
GTTTGAAAGA 
GTTTGAAAGA 
GTTTGAAAGA 
GTTTGAAAGA 
GTTTGAAAGA 
GTTTGAAAGA 
GTTTGAAAGA 
GTTTGAAAGA 



GTCGATAATG 
GTCGATAATG 
GTCGATAATG 
GTCGATAATG 
GTCGATAATG 
GTCGATAATG 
GTCGATAATG 
GTCGATAATG 
GTCGATAATG 
GTCGATAATG 
GTCGATAATG 



********** ********** 



GACAAATTAG 
GACAAATTAG 
GACAAATTAG 
GACAAATTAG 
GACAAATTAG 
GACAAATTAG 
GACAAATTAG 
GACAAATTAG 
GACAAATTAG 
GACAAATTAG 
GACAAATTAG 
********** 



GTGGTATAAC 
GTGGTATAAC 
GTGGTATAAC 
GTGGTATAAC 
GTGGTATAAC 
GTGGTATAAC 
GTGGTATAAC 
GTGGTATAAC 
GTGGTATAAC 
GTGGTATAAC 
GTGGTATAAC 
********** 



250 

CTTGATTTGC 
CTTGATTTGC 
CTTGATTTGC 
CTTGATTTGC 
CTTGATTTGC 
CTTGATTTGC 
CTTGATTTGC 
CTTGATTTGC 
CTTGATTTGC 
CTTGATTTGC 
CTTGATTTGC 
********** 



251 300 

msa42193 .2{176_090} CAGAGGTTAT GGAGATAAGA AAATTATTTT TTGAAGAGCA TGAAAGAGTT 

msa42193.2{l76_CJB110} CAGAGGTTAT GGAGATAAGA AAATTATTTT TTGAAGAGCA TGAAAGAGTT 

msa42193 . 2 { 176__18RS21 } CAGAGGTTAT GGAGATAAGA AAATTATTTT TTGAAGAGCA TGAAAGAGTT 

msa42193.2{!76_2603} CAGAGGTTAT GGAGATAAGA AAATTATTTT TTGAAGAGCA TGAAAGAGTT 

msa42193.2{l76__A909j CAGAGGTTAT GGAGATAAGA AAATTATTTT TTGAAGAGCA TGAAAGAGTT 

maa42193 .2 {l76_COHl} CAGAGGTTAT GGAGATAAGA AAATTATTTT TTGAAGAGCA TGAAAGAGTT 

msa42193 . 2 { 176_M732 } CAGAGGTTAT GGAGATAAGA AAATTATTTT TTGAAGAGCA TGAAAGAGTT 

msa42193 ,2{l76_M78l} CAGAGGTTAT GGAGATAAGA AAATTATTTT TTGAAGAGCA TGAAAGAGTT 

msa42193.2{l76_H36B} CAGAGGTTAT GGAGATAAGA AAATTATTTT TTGAAGAGCA TGAAAGAGTT 

msa42193.2{l76_JM9130013} CAGAGGTTAT GGAGATAAGA AAATTATTTT TTGAAGAGCA TGAAAGAGTT 

msa4 2193.2(17 6_1 1 6 9 NT } CAGAGGTTAT GGAGATAAGA AAATTATTTT TTGAAGAGCA TGAAAGAGTT 

Consensus ********** ********** ********** ********** ********** 



301 350 

msa421 93. 2 { 176^090} ACTAATATAG CAAAATCAGC CaTAGATGAA ACTTGGACAC gGGAGGTAAA 

msa42193.2{l76_CJB110} ACTAATATAG CAAAATCAGC CaTAGATGAA ACTTGGACAC gGGAGGTAAA 

msa42193.2{l76_18RS2l) ACTAATATAG CAAAATCAGC C c TAG ATGAA ACTTGGACAC gGGAGGTAAA 

msa42193.2{l76_2603} ACTAATATAG CAAAATCAGC CcTAGATGAA ACTTGGACAC gGGAGGTAAA 

msa42193.2f 176_A909} ACTAATATAG CAAAATCAGC CcTAGATGAA ACTTGGACAC gGGAGGTAAA 

tnsa42193.2{176_COHl} ACTAATATAG CAAAATCAGC CcTAGATGAA ACTTGGACAC gGGAGGTAAA 

msa42193.2{l76_M732} ACTAATATAG CAAAATCAGC CcTAGATGAA ACTTGGACAC gGGAGGTAAA 

msa42193.2{176_M78l} ACTAATATAG CAAAATCAGC CcTAGATGAA ACTTGGACAC gGGAGGTAAA 

msa42193.2{l76__H36B} ACTAATATAG CAAAATCAGC CcTAGATGAA ACTTGGACAC gGGAGGTAAA 

msa42193.2{l76_JM9130013} ACTAATATAG CAAAATCAGC CcTAGATGAA ACTTGGACAC gGGAGGTAAA 

msa42193.2{l76_1169NT} ACTAATATAG CAAAATCAGC CcTAGATGAA ACTTGGACAC aGGAGGTAAA 

Consensus ********** ********** *_******** ********** _********* 



msa42193 
msa42193 .2 
msa42193 .2 

msa42193. 

tnsa42193 . 

msa42193 . 

msa42193 . 

rasa42193 . 

msa42193 . 
msa42193.2{l76 
msa42193.2{' 



,2{176_090 J 
176_CJB110} 
176_18RS21} 
2{176_2603} 
2{176_A909} 
2{l76_COHl} 
2f 176J4732} 
2{176_M781} 
2{176_H36B} 
JM9130013} 
176_1169NT} 
Consensus 



351 

TCCCCAAAAT 
TCCCCAAAAT 
TCCCCAAAAT 
TCCCCAAAAT 
TCCCCAAAAT 
TCCCCAAAAT 
TCCCCAAAAT 
TCCCCAAAAT 
TCCCCAAAAT 
TCCCCAAAAT 
TCCCCAAAAT 
********** 



GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
********** 



TaATCGTGTC 
TaATCGTGTC 
TaATCGTGTC 
TaATCGTGTC 
TaATCGTGTC 
TaATCGTGTC 
TaATCGTGTC 
TaATCGTGTC 
TaATCGTGTC 
TaATCGTGTC 
TgATCGTGTC 
*_******** 



AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
********** 



400 

TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
********** 



401 450 

msa42193 . 2{l76_090} TAAAAGAAGA TGACGTAGAG ACTTTTCTTC AT AT C CTGAC AAATTCATTT 

msa42193 . 2 { 176_CJB110 } TAAAAGAAGA TGACGTAGAG ACTTTTCTTC ATAT C CTGAC AAATTCATTT 

msa42193 .2{l76_18RS2l} TAAAAGAAGA TGACGTAGAG ACTTTTCTTC ATAT C CTGAC AAATTCATTT 

msa42193.2{!76_2603} TAAAAGAAGA TGACGTAGAG ACTTTTCTTC ATATCCTGAC AAATTCATTT 

msa42193 .2{176_A909} TAAAAGAAGA TGACGTAGAG ACTTTTCTTC ATATCCTGAC AAATTCATTT 

msa42193 .2{l76_COHl} TAAAAGAAGA TGACGTAGAG ACTTTTCTTC ATATCCTGAC AAATTCATTT 

msa42193 .2{176_M732} TAAAAGAAGA TGACGTAGAG ACTTTTCTTC ATATCCTGAC AAATTCATTT 

msa42193 .2{l76_M78l} TAAAAGAAGA TGACGTAGAG ACTTTTCTTC ATATCCTGAC AAATTCATTT 

msa42193 .2{176_H36B> TAAAAGAAGA TGACGTAGAG ACTTTTCTTC ATATCCTGAC AAATTCATTT 

msa42193.2{l76_JM9130013l TAAAAGAAGA TGACGTAGAG ACTTTTCTTC ATATCCTGAC AAATTCATTT 

msa42193.2{l76_1169NT} TAAAAGAAGA TGACGTAGAG ACTTTTCTTC ATATCCTGAC AAATTCATTT 

Consensus ********** ********** ********** ********** ********** 



msa42193 .2{176_090} 
msa42193.2{l76_CJB110} 
msa42193 . 2 { 176_18RS21 } 
msa42193 . 2 { 176_2603 ) 
rasa42 193 . 2 { 176_A909 } 



451 

AG CCAATTTA 
AGCCAATTTA 
AG CCAATTTA 
AGCCAATTTA 
AGCCAATTTA 



TGGCACAATT TGATTTGTGT CAtAAGGAAA 

TGGCACAATT TGATTTGTGT CAtAAGGAAA 

TGGCACAATT TGATTTGTGT CAtAAGGAAA 

TGGCACAATT TGATTTGTGT CAtAAGGAAA 

TGGCACAATT TGATTTGTGT CAtAAGGAAA 



500 

TGATTAATAA 
TGATTAATAA 
TGATTAATAA 
TGATTAATAA 
TGATTAATAA 



833 



WO 2004/018646 



PCT/US2003/026827 



Table 49: Comparative Sequences related to SAG1502 



msa42 193 . 2 { 176_COHl 1 AGCCAATTTA TGGCACAATT TGATTTGTGT CAtAAGGAAA TGATTAATAA 

msa42193 .2{l76_M732) AGCCAATTTA TGGCACAATT TGATTTGTGT CAtAAGGAAA TGATTAATAA 

msa42193.2{l76_M78l} AGCCAATTTA TGGCACAATT TGATTTGTGT CAtAAGGAAA TGATTAATAA 

msa42193.2{l76_H36B} AGCCAATTTA TGGCACAATT TGATTTGTGT CAgAAGGAAA TGATTAATAA 

msa42193 . 2 { 176_JM913 0013 } AGCCAATTTA TGGCACAATT TGATTTGTGT CAgAAGGAAA TGATTAATAA 

msa42193.2{l76_1169NT} AGCCAATTTA TGGCACAATT TGATTTGTGT CAgAAGGAAA TGATTAATAA 

Consensus ********** ********** ********** **_******* ********** 



msa42193 
msa42193 .2 
msa42193 .2 

msa42193 . 

rasa42193 . 

msa42193 . 

msa42193 . 

msa42193 . 

msa42193 . 
msa42193.2(l76 
msa42193.2{' 



.2{176_090} 
176J2JB110} 
176_18RS2l} 
2(l76_2603) 
2(176__A909} 
2{l76_COHl} 
2fl76_M732} 
2{176_M78l} 
2{176_H36B} 
_JM9130013} 
176_JL169NT} 
Consensus 



501 

AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
********** 



CATGATACAG 
CATGATACAG 
CATGATACAG 
CATGATACAG 
CATGATACAG 
CATGATACAG 
CATGATACAG 
CATGATACAG 
CATGATACAG 
CATGATACAG 
CATGATACAG 
********** 



TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
********** 



GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
********** 



550 

TTTCAGTTtG 
TTTCAGTTtG 
TTTCAGTTtG 
TTTCAGTTtG 
TTTCAGTTtG 
TTTCAGTTtG 
TTTCAGTTtG 
TTTCAGTTtG 
TTTCAGTTgG 
TTTCAGTTtG 
TTTCAGTTtG 
********_* 



551 600 

msa4 2193.2(17 6_0 90} GTATCACAGA TGGTCATGAg ATTGTGGATT TAGACCCTAA ATTAAAGCAA 

msa42193 . 2{l76_CJB110 } GTATCACAGA TGGTCATGAg ATTGTGGATT TAGACCCTAA ATTAAAGCAA 

rasa42193 . 2{l76_18RS2l} GTATCACAGA TGGTCATGAg ATTGTGGATT TAGACCCTAA ATTAAAGCAA 

rasa42193 .2{l76_2603} GTATCACAGA TGGTCATGAg ATTGTGGATT TAGACCCTAA ATTAAAGCAA 

msa42193 .2{17S_A909} GTATCACAGA TGGTCATGAg ATTGTGGATT TAGACCCTAA ATTAAAGCAA 

msa42193 - 2 {l76_COHl} GTATCACAGA TGGTCATGAg ATTGTGGATT TAGACCCTAA ATTAAAGCAA 

msa42193 .2{176_M732} GTATCACAGA TGGTCATGAg ATTGTGGATT TAGACCCTAA ATTAAAGCAA 

msa42193 .2 {176_M781} GTATCACAGA TGGTCATGAg ATTGTGGATT TAGACCCTAA ATTAAAGCAA 

msa42193 .2 (176_H36B) GTATCACAGA TGGTCATGAa ATTGTGGATT TAGACCCTAA ATTAAAGCAA 

msa42193.2{l76_JM913 0013} GTATCACAGA TGGTCATGAa ATTGTGGATT TAGACCCTAA ATTAAAGCAA 

msa42193 .2{176_1169NT} GTATCACAGA TGGTCATGAa ATTGTGGATT TAGACCCTAA ATTAAAGCAA 

Consensus ********** *********-. ********** ********** ********** 



rasa42193 
msa42193 .2 
msa42193 .2 

msa42193 . 

msa42193 . 

msa42193 . 

rasa42193 . 

msa42193 . 

msa4 2193 . 
msa42193.2{l76 
msa42193.2{ 



.2{l76_090} 
176_CJB110} 
176_18RS2l} 
2(176^2603} 
2(176_A909} 
2{l76__COHl} 
2{176_M732} 
2(176_M781} 
2{176_H36B} 
JM9130013) 
176_1169NT} 
Consensus 



601 

ATAAAT CTGA 
ATAAATCTGA 
ATAAAT CTGA 
ATAAATCTGA 
ATAAATCTGA 
ATAAATCTGA 
ATAAATCTGA 
ATAAATCTGA 
ATAAATCTGA 
ATAAATCTGA 
ATAAATCTGA 
********** 



TTAACTTTAC 
TTAACTTTAC 
TTAACTTTAC 
TTAACTTTAC 
TTAACTTTAC 
TTAACTTTAC 
TTAACTTTAC 
TTAACTTTAC 
TTAACTTTAC 
TTAACTTTAC 
TTAACTTTAC 
********** 



AGATGAGATG 
AGATGAGATG 
AGATGAGATG 
AGATGAGATG 
AGATGAGATG 
AGATGAGATG 
AGATGAGATG 
AGATGAGATG 
AGATGAGATG 
AGATGAGATG 
AGATGAGATG 
********** 



AGCAAATTTG 
AGCAAATTTG 
AGCAAATTTG 
AGCAAATTTG 
AGCAAATTTG 
AGCAAATTTG 
AGCAAATTTG 
AGCAAATTTG 
AGCAAATTTG 
AGCAAATTTG 
AGCAAATTTG 
********* * 



650 

AGTTAGGCAC 
AGTTAGGCAC 
AGTTAGGCAC 
AGTTAGGCAC 
AGTTAGGCAC 
AGTTAGGCAC 
AGTTAGGCAC 
AGTTAGGCAC 
AGTTAGGCAC 
AGTTAGGCAC 
AGTTAGGCAC 
********** 



651 700 

msa42193 . 2{l76_090} ACTTCGCTCT TTACTTCCAA CAATTCGTAA ATTTAATAAT TGTTTAGGTG 

msa42193 . 2 { 176_CJB110} ACTTCGCTCT TTACTTCCAA CAATTCGTAA ATTTAATAAT TGTTTAGGTG 

msa42193.2{l76_18RS2l} ACTTCGCTCT TTACTTCCAA CAATTCGTAA ATTTAATAAT TGTTTAGGTG 

msa42193 .2f 176_2603> ACTTCGCTCT TTACTTCCAA CAATTCGTAA ATTTAATAAT TGTTTAGGTG 

msa42193 .2{176_A909} ACTTCGCTCT TTACTTCCAA CAATTCGTAA ATTTAATAAT TGTTTAGGTG 

maa42193 .2{l76_COHl} ACTTCGCTCT TTACTTCCAA CAATTCGTAA ATTTAATAAT TGTTTAGGTG 

tnsa42193.2(l76_M732} ACTTCGCTCT TTACTTCCAA CAATTCGTAA ATTTAATAAT TGTTTAGGTG 

msa42193.2fl76_M,78l} ACTTCGCTCT TTACTTCCAA CAATTCGTAA ATTTAATAAT TGTTTAGGTG 

msa42193 . 2 { 176__H36B} ACTTCGCTCT TTACTTCCAA CAATTCGTAA ATTTAATAAT TGTTTAGGTG 

msa42193.2{l76_JM9130013} ACTTCGCTCT TTACTTCCAA CAATTCGTAA ATTTAATAAT TGTTTAGGTG 

msa42193 . 2 { 176_1169NT} ACTTCGCTCT TTACTTCCAA CAATTCGTAA ATTTAATAAT TGTTTAGGTG 

Consensus ********** ********** ********** ********** ********** 



msa4 2 1 93 . 2 ( 176_0 90 
msa42193 . 2 {l76_CJB110 
msa42193 . 2 { 176_JL8RS21 
msa42193.2(l76_2603 
msa42193 . 2 (176_A909 
msa42193.2{176_COHl 
msa42193.2(l76_M732 
msa4 2 1 9 3 . 2 { 17 6_M7 8 1 
msa42193.2(176_H36B 
msa42193.2(l76_JM9130013} 
msa42193 . 2 ( 176_1169NT} 
Consensus 



701 

TGTACGAATA 
TGTACGAATA 
TGTACGAATA 
TGTACGAATA 
TGTACGAATA 
TGTACGAATA 
TGTACGAATA 
TGTACGAATA 
TGTACGAATA 
TGTACGAATA 
TGTACGAATA 
********** 



719 

TAAAGCATC 
TAAAGCATC 
TAAAGCATC 
TAAAGCATC 
TAAAGCATC 
TAAAGCATC 
TAAAGCATC 
TAAAGCATC 
TAAAGCATC 
TAAAGCATC 
TAAAGCATC 
********* 



SEQ ID NO: 4912 

STRAIN 2603 frame: 1 

KHPILNDQKSLAIVEQIEYDFDKFDNSEASFYAT^ 

GCGLDTRFERVDNGQ IRWYNLDLPEVME I RKLFFEEHERVTN IAKSALDETWTREVNPQN 
APFLIVSEGVLWFLKEDDVETFLHILTNSFSQFMAQFDLCHKEMINKGKQHDTVKYMDTE 
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Table 49: Comparative Sequences related to SAG1502 



FQFG ITDGHE I VDLDPKLKQ INL INFTDEMSKFELGTLRSLLPTI RKFNNCLGVYEYKA 

SEQ ID NO: 4913 

STRAIN 090 frame: 2 

NDQKS LA IVEQ I E YD FD KFDNSEAS FYATLARI RVMDRE I KKF I RENPNS Q I LS I GCGLD 
TRFERVDNGQI RWYNLDLPE VME I RKLFFEEHERVTNIAKSAIDETWTREVNPQNAPFL I 
VSEGVLMFLKEDDVETFLH I LTNSFSQFMAQFDLCHKEMINKGKQHDTVKYMDTEFQFG I 
TDGHE I VDLDPKLKQ INLI NFTDEMSKFELGTLRSLLPTI RKFNNCLGVYE YKA 

SEQ ID NO: 4914 

STRAIN A9 0 9 frame: 1 

KHP I LNDQKSLAI VEQI E YDFDKFDNSEAS FYATLARI RVMDRE I KKFI RENPNSQI LS I 
GCGLDTRFERVDNGQ I RWYNLDLPEVME I RKLFFEEHERVTNI AKSALDETWTREVNPQN 
APFLIVSEGVIjMFLKEDDVETFLHILTNSFSQFMAQFDLCHKEMINKGKQHDTVKYMDTE 
FQFG I TDGHE I VDLDPKLKQ INL I NFTDEMSKFELGTLRSLLPTI RKFNNCLGVYEYKA 

SEQ ID NO: 4915 

STRAIN H36B frame: 1 

KHPI LNDQKSLAI VEQI EYDFDKFDNSEAS FYATLARI RVMDREI KKFI RENPNSHI LS I 
GCGLDTRFERVDNGQIRWYNLDLPEVMEIRKLFFEEHERVTNIAKSALDETWTREVNPQN 
APFL I VSEGVLMFLKEDDVETFLHI LTNSFSQFMAQFDLCQKEMINKGKQHDTVKYMDTE 
FQLG ITDGHE I VDLDPKLKQ INL I NFTDEMSKFELGTLRSLLPTI RKFNNCLGVYEYKA 



SEQ ID NO: 4916 

STRAIN 18RS21 frame: 3 

HP I LNDQKSLAI VEQI EYDFDKFDNSEAS FYATLARI RVMDRE I KKFI RENPNSQI LS I G 
CGLDTRFERVDNGQ I RWYNLDLPEVME I RKLFFEEHERVTNI AKSALDETWTREVNPQNA 
PFL I VSEGVLMFLKEDDVETFLH I LTNS FSQFMAQFDLCHKEM INKGKQHDTVKYMDTE F 
QFGITDGHE I VDLDPKLKQ INL I NFTDEMSKFELGTLRSLLPTI RKFNNCLGVYEYKA 

SEQ ID NO: 4917 

STRAIN M732 frame: 1 

KHPI LNDQKSLAIVEQIEYDLDKFDNSEAS FYATLARI RVMDREI KKFIRENPNSQI LS I 
GCGLDTRFERVDNGQ I RWYNLDLPEVME I RKLFFEEHERVTNI AKSALDETWTREVNPQN 
APFL I VSEGVLMFLKEDDVETFLH I LTNS FSQFMAQFDLCHKEM I NKGKQHDTVKYMDTE 
FQFGITDGHEIVDLDPKLKQINLINFTDEMSKFELGTLRSLLPTIRKFNNCLGVYEYKA 

SEQ ID NO: 4918 

STRAIN COH1 frame: 1 

KHPI LNDQKSLAI VEQIEYDLDKFDNSEASFYATLARIRVMDRE I KKF I RENPNSQI LS I 
GCGLDTRFERVDNGQ I RWYNLDLPEVME I RKLFFEEHERVTNI AKSALDETWTREVNPQN 
APFLIVSEGVIiMFLKEDDVETFLHILTNSFSQFMAQFDLCHKEMINKGKQHDTVKYMDTE 
FQFG I TDGHE I VDLD PKLKQ I NL I NFTDEMSKFELGTLRSLLPT I RKFNNCLGVYEYKA 

SEQ ID NO: 4919 

STRAIN M781 frame: 1 

KHPI LNDQKSLAI VEQI EYDLDKFDNSEAS FYATLARI RVMDRE I KKFI RENPNSQILS I 
GCGLDTRFERVDNGQ I RWYNLDLPEVME I RKLFFEEHERVTNI AKSALDETWTREVNPQN 
APFL I VSEGVLMFLKEDDVETFLH I LTNS FSQFMAQFDLCHKEM INKGKQHDTVKYMDTE 
FQFG I TDGHE I VDLDPKLKQ I NLINFTDEMSKFELGTLRS LLPT I RKFNNCLGVYEYKA 

SEQ ID NO: 4920 

STRAIN CJB110 frame: 1 

KHPI LNDQKSLAI VEQI EYDFDKFDNSEAS FYATLARIRVMDREI KKFI RENPNSQI LS I 
GCGLDTRFERVDNGQ I RWYNLDLPEVME I RKLFFEEHERVTNI AKSAI DETWTREVNPQN 
APFLIVSEGVLMFLKEDDVETFLHILTNSFSQFMAQFDLCHKEMINKGKQHDTVKYMDTE 
FQFG ITDGHE I VDLDPKLKQ INLI NFTDEMSKFELGTLRSLLPTI RKFNNCLGVYEYKA 

SEQ ID NO: 4921 

STRAIN 1169NT frame: 1 

KHP I LNDQKSLAI VEQI EYDFDKFDNSEAS FYATLARI RVMDRE I KKFI RENPNSHI LS I 
GCGLDTRFERVDNGQ I RWYNLDLPEVME I RKLFFEEHERVTN I AKSALDETWTQEVNPQN 
APFL I VSEGVLMFLKEDDVETFLHI LTNSFSQFMAQFDLCQKEMINKGKQHDTVKYMDTE 
FQFG I TDGHE I VDLD PKLKQ I NL I NFTDEMSKFELGTLRSLLPT I RKFNNCLGVYEYKA 

SEQ ID NO: 4922 

STRAIN JM9130013 frame: 2 

AI VEQI EYDFDKFDNSEAS FYATLARI RVMDRE I KKF I RENPNSHI LS I GCGLDTRFERV 
DNGQ I RWYNLDLPEVME I RKLFFEEHERVTN I AKSALDETWTREVNPQNA PFL I VSEGVL 
MFLKEDDVETFLHI LTNSFSQFMAQFDLCQKEMINKGKQHDTVKYMDTEFQFGITDGHE I 
VDLDPKLKQ I NL I NFTDEMSKFELGTLRSLLPT I RKFNNCLGVYEYKA 

PRETTY of: /biotmp/msa42204 . 2 { * } January 21, 2003 05:05 

1 ' 50 

msa42204.2{l76_H36B} khpilndqks 1AIVEQIEYD fD KFDNSEAS FY AT LAR i R V MDREIKKFIR 

msa42204.2{l76 JM913 0013} AIVEQIEYD fDKFDNSEAS FYATLARiRV MDREIKKFIR 

msa42204.2{l76_090} ndqks 1AIVEQIEYD fDKFDNSEAS FYATLARiRV MDREIKKFIR 

msa42204.2{l76_18RS2l) ~ hp il ndqks 1AIVEQIEYD fDKFDNSEAS FYATLARiRV MDREIKKFIR 

msa42204.2{l76_2603} khpilndqks 1AIVEQIEYD fDKFDNSEAS FYATLARxRV MDREIKKFIR 

msa42204.2{!76 A909} khpilndqks 1 AIVEQIEYD fDKFDNSEAS FYATLARiRV MDREIKKFIR 

msa42204.2{!76 CJB110) khpilndqks 1AIVEQIEYD fDKFDNSEAS FYATLARiRV MDREIKKFIR 
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Table 49: Comparative Sequences related to SAG1502 



msa42204 . 2 { 176_COHl } 
msa42204 .2{l76_M732} 
msa42204 .2{l7 6_M78l} 
msa42204.2{l76_1169NT} 
Consensus 



khpilndqks 1AIVEQIEYD 1DKFDNSEAS FYATIxARiRV MDREIKKFIR 

khpilndqks 1AIVEQIEYD 1DKFDNSEAS FYATIiARi RV MDREIKKFIR 

khpilndqks 1AIVEQIEYD 1DKFDNSEAS FYATLARiRV MDREIKKFIR 

khpilndqks 1AIVEQIEYD f DKFDNSEAS FYATLARiRV MDREIKKFIR 



_********* 



_********* *******_** ********** 



msa42204 . 
msa42204.2{l76 
msa42204 
msa422 04 . 2{ 

msa42204 . 

msa42204 . 
msa42204.2{ 

msa42204 . 

msa42204 . 

msa42204 . 
msa42204 . 2{ 



2{l7S_H36B} 
_JM9130013} 
.2{176_090} 
176_18RS2l} 
2{176_2603} 
2{l76_A909} 
176_CJB110} 
2{176_C0H1} 
2{176_M732} 
2{176_M781} 
176_1169NT} 
Consensus 



51 

ENPNShlLSI 
ENPNShlLSI 
ENPNSqILSI 
ENPNSqILSI 
ENPNSqILSI 
ENPNSqILSI 
ENPNSqILSI 
ENPNSqILSI 
ENPNSqILSI 
ENPNSqILSI 
ENPNShlLSI 
*****_**** 



GCGLDTRFER 
GCGLDTRFER 
GCGLDTRFER 
GCGLDTRFER 
GCGLDTRFER 
GCGLDTRFER 
GCGLDTRFER 
GCGLDTRFER 
GCGLDTRFER 
GCGLDTRFER 
GCGLDTRFER 
********** 



VDNGQI RWYN 
VDNGQIRWYN 
VDNGQI RWYN 
VDNGQIRWYN 
VDNGQIRWYN 
VDNGQIRWYN 
VDNGQIRWYN 
VDNGQIRWYN 
VDNGQIRWYN 
VDNGQIRWYN 
VDNGQIRWYN 
********** 



LDLPEVMEIR 
LDLPEVMEIR 
LDLPEVMEIR 
LDLPEVMEIR 
LDLPEVMEIR 
LDLPEVMEIR 
LDLPEVMEIR 
LDLPEVMEIR 
LDLPEVMEIR 
LDLPEVMEIR 
LDLPEVMEIR 
********** 



100 

KLFFEEHERV 
KLFFEEHERV 
KLFFEEHERV 
KLFFEEHERV 
KLFFEEHERV 
KLFFEEHERV 
KLFFEEHERV 
KLFFEEHERV 
KLFFEEHERV 
KLFFEEHERV 
KLFFEEHERV 
********** 



msa42204.2{l76_H36B] 
msa42204 . 2 {176_JM9130013 
msa42204 . 2 { 176__090 j 
msa422 04 . 2 { 176_18RS21 ] 
msa42204 . 2 { 176_2603 ] 
msa42204 .2{176_A909; 
msa42204 . 2 { 176_CJB110 1 
msa42204.2{l7 6_COHl ] 
msa42204 . 2 { 176_M732 ' 
msa42204 . 2 { 176_M781 ] 
rasa42204 . 2 { 176_1169NT} 
Consensus 



101 

TNIAKSA1DE 
TNIAKSA1DE 
TNIAKSAiDE 
TNIAKSA1DE 
TNIAKSAIDE 
TNIAKSAIDE 
TNIAKSAiDE 
TNIAKSAIDE 
TNIAKSAIDE 
TNIAKSAIDE 
TNIAKSAIDE 



TWTrEVNPQN 
TWTrEVNPQN 
TWTrEVNPQN 
TWTrEVNPQN 
TWTrEVNPQN 
TWTrEVNPQN 
TWTrEVNPQN 
TWTrEVNPQN 
TWTrEVNPQN 
TWTrEVNPQN 
TWTqEVNPQN 



APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 



LMFLKEDDVE 
LMFLKEDDVE 
LMFLKEDDVE 
LMFLKEDDVE 
LMFLKEDDVE 
LMFLKEDDVE 
LMFLKEDDVE 
LMFLKEDDVE 
LMFLKEDDVE 
LMFLKEDDVE 
LMFLKEDDVE 



150 

TFLHILTNSF 
TFLHILTNSF 
TFLHILTNSF 
TFLHILTNSF 
TFLHILTNSF 
TFLHILTNSF 
TFLHILTNSF 
TFLHILTNSF 
TFLHILTNSF 
TFLHILTNSF 
TFLHILTNSF 



msa42204 . 
msa42204.2{l7€ 
msa42204' 

msa42204.2{ 
msa42204 . 
msa42204 . 

msa42204.2{ 
msa42204 
msa42204 
msa42204 

msa42204.2{ 



2{176_H36B) 
;_JM9130013) 
2{176_090} 
176_18RS21} 
2{176_2603} 
2{176_A909} 
176_CJB110} 
2(l76_COHl) 
2{176_M732) 
2{176_M781} 
176_1169NT} 
Consensus 



151 

SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 



qKEM INKGKQ 
qKEM INKGKQ 
hKEM I NKGKQ 
hKEM INKGKQ 
hKEM INKGKQ 
hKEM INKGKQ 
hKEM INKGKQ 
hKEM INKGKQ 
hKEM INKGKQ 
hKEM INKGKQ 
qKEM INKGKQ 



********** _********* 



HDTVKYMDTE 
HDTVKYMDTE 
HDTVKYMDTE 
HDTVKYMDTE 
HDTVKYMDTE 
HDTVKYMDTE 
HDTVKYMDTE 
HDTVKYMDTE 
HDTVKYMDTE 
HDTVKYMDTE 
HDTVKYMDTE 
********** 



FQ1GITDGHE 
FQf GITDGHE 
FQf GITDGHE 
FQf GITDGHE 
FQf GITDGHE 
FQfGITDGHE 
FQf GITDGHE 
FQfGITDGHE 
FQfGITDGHE 
FQfGITDGHE 
FQfGITDGHE 



200 
I VDLDPKLKQ 
I VDLDPKLKQ 
I VDLDPKLKQ 
I VDLDPKLKQ 
IVDLDPKLKQ 
I VDLDPKLKQ 
IVDLDPKLKQ 
IVDLDPKLKQ 
IVDLDPKLKQ 
IVDLDPKLKQ 
IVDLDPKLKQ 



**„******* ********** 



201 ' 239 

msa42204.2{l76_H36B} INLINFTDEM SKFELGTLRS LLPTIRKFNN CLGVYEYKA 

msa42204.2{l76_JM9130013} INLINFTDEM SKFELGTLRS LLPTIRKFNN CLGVYEYKA 

msa42204.2{l76_090} INLINFTDEM SKFELGTLRS LLPTIRKFNN CLGVYEYKA 

rasa42204.2{l76_18RS2l} INLINFTDEM SKFELGTLRS LLPTIRKFNN CLGVYEYKA 

msa42204. 2 {176^2603} INLINFTDEM SKFELGTLRS LLPTIRKFNN CLGVYEYKA 

msa42204.2{l76_A909j INLINFTDEM SKFELGTLRS LLPTIRKFNN CLGVYEYKA 

msa42204.2{l76_CJB110) INLINFTDEM SKFELGTLRS LLPTIRKFNN CLGVYEYKA 

msa42204 .2{l76_COHl} INLINFTDEM SKFELGTLRS LLPTIRKFNN CLGVYEYKA 

msa42204.2{176_M732} INLINFTDEM SKFELGTLRS LLPTIRKFNN CLGVYEYKA 

msa42204.2{l76_M78l} INLINFTDEM SKFELGTLRS LLPTIRKFNN CLGVYEYKA 

msa42204.2{l76_1169NT} INLINFTDEM SKFELGTLRS LLPTIRKFNN CLGVYEYKA 

Consensus ********** ********** ********** ********* 
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SEQ ID NO. 5001 
STRAIN 2603 

ATGAAAAAAC^AAAACTATTACTGCTTATTGGAGGCTTATTAATAATGATAATGATGACA 
GC^TGTAAGGATTCAAAAATCCCAGAAAACCGCACAAAGGAAGAGTACC^GCTGAACAA 
AATTTTAAACCX3TTTTTTGAGTTTTTAGC^ 

AAATACTTACTATTAGTATCGGATTCAGGTGATGCATTAGATTTAGAATATTTCTATAGT 
ATTCAAGATTTAAAAAAAAATAAGGATTTAGGGAAGTTTGAA 

GAAAAGCCGGGTGGCTATAATGAGTTAGAAAATAAAGAGGTCCCATTTGAATATTTTAAA 

AATAATATAGTTTATCCAAAAGGAAAACCGAATATTACATTTGATGACTTTATTAT 

GCAATGGATACTAAAGAATTAAAAGAATTAAAAAAATTAA 

AAACATCCGGAAACTGAGTTGAAAGATATAACATATGAATTGCCGACACAGTCGAAGCTT 
ATTAAAAAA 



SEQ ID NO. 5002 

STRAIN 090 

TAAGGATTCAAAAATCCC^GAAAACCGCACAAAG 

GAAGAGTACCAAGCTGAACAAAATTTTAAACTGTri"l M IUGAGlTTTTAGC 
AC^AAAATATAAAGATTTGAACAAAATACAAAAATACTTACTATTAGTAT 
CGGATTCAGGTG ATG CATTAG ATTTAGAATATTT CTATAGTATT CAAGAT 
TTAAAAAAAAATAAGGATTTAGGGAAGTTTGAAACAAGAAAAAGTCAAAT 
AGAAAAGCCGGGTGGCTATAATGAGTTAGAAAATAAAGAGGTCCCATTTG 
AATATTTTAAAAATAATATAGTTTATCCAAAAGGAAAACCGAATATTACA 
TTTGATGAClU'iATTATCGGAGCAATGGATACTAAAGAATTAAAAAAATT 
AAAAGTAAAAAGTTATTTATTAAAACATCCGGAAACTGAGTTGAAAGATA 
TAACATATGAATTGC CGACACAGT CGAAG CTT ATTAAAAAA 



SEQ ID NO. 5003 

STRAIN 18RS21 

TAAGGATTCAAAAATCCCAGAAAACCGCACAAAGGAAG 

AGTACCAAGCTGAACAAAATTTTAAACCX3TTTTTTGAGT^ 

AAAGATAAAGATTTGAGCAAAATACAAAAATACTTACTATTAGTATCGGA 

TTCAGGTGATGCATTAGATTTAGAATATTTCTATAGTATTCAAGATTTAA 

AAAAAAATAAGGATTTAGGGAAGTTTGAAACAAGAAAAAGTCAAATAGAA 

AAGCCGGGTGGCTAT AA.TGAGTTAGAAAATAAAGAGGTC C CATTTGAAT A 

1"1 M ITAAAAATAATATAGTTTATCCAAAAGGAAAA.CCGAATATTAC^TTTG 

ATGACITTATTATCGGAG CAATGG ATACTAAAGAATT AAAAGAATTAAAA 

GAATTAAAAAAATTAAAAGTAAAAAGTTATTTATTAAAACATCCGGAAAC 

TGAGTTGAAAGATATAACATATGAATTGCCGGCACAGTCGAAGCTTATTA 

AAAAA 



PRETTY of: /biotmp/msa212269 . 2 { * } February 10, 2003 05:07 .. 

1 50 

msa212269.2{l84_090} : 

msa212269.2{l84_2603) atgaaaaaac aaaaactatt actgcttatt ggaggcttat taataatgat 

msa212269 .2{184_18RS21} ~ 

Consensus ********** ********** ********** ********** ********** 



msa212269 . 2 { 184_090 
msa212269 . 2 { 184_2603 
msa212269.2{l84_18RS21 
Consensus 



msa2 12269 . 2 { 184_090 } 
msa212269 . 2 { 184_2603 } 
msa212269 . 2{ 184_18RS21 } 
Consensus 



msa2 12269 . 2 { 184_090 } 
msa212269 . 2 { 184_2603 } 
msa212269 . 2 { 184_18RS21) 
Consensus 



51 

aatgatgaca 



TAAGG 

gcatgTAAGG 

TAAGG 

********** 



********** 
101 

AAGAGTACCA 
AAGAGTACCA 
AAGAGTACCA 
********** 

151 

CAAAAAtATA 
CAAAAAgATA 
CAAAAAgATA 
******_*** *********_ 



AGCTGAACAA 
AGCTGAACAA 
AGCTGAACAA 
********** 



AAGATTTGAa 
AAGATTTGAg 
AAGATTTGAg 



ATTCAAAAAT 
ATTCAAAAAT 
ATTCAAAAAT 
********** 



AATTTTAAAC 
AATTTTAAAC 
AATTTTAAAC 
********** 



CAAAATACAA 
CAAAATACAA 
CAAAATACAA 
********** 



CCCAGAAAAC 
CCCAGAAAAC 
CCCAGAAAAC 
********** 



t GTTTTTTGA 
CGTTTTTTGA 
cGTTTTTTGA 
_********* 



AAATACTTAC 
AAATACTTAC 
AAATACTTAC 
********** 



100 

CGCACAAAGG 
CGCACAAAGG 
CGCACAAAGG 
********** 

150 

GTTTTTAGCA 
GTTTTTAGCA 
GTTTTTAGCA 
********** 

200 

TATTAGTATC 
TATTAGTATC 
TATTAGTATC 
********** 



msa212269 . 2 { 184_090 } 
msa212269 . 2 { 184_2603 } 
msa212269.2{l84_18RS21) 
Consensus 



201 250 
GGATTCAGGT GATG CATTAG ATTTAGAATA TTTCTATAGT ATTCAAGATT 
GG ATT CAGGT GATG CATTAG ATTTAGAATA TTTCTATAGT ATTCAAGATT 
GGATTCAGGT GATG CATTAG ATTTAGAATA TTTCTATAGT ATTCAAGATT 
********** ********** ********** ********** ********** 



msa212269 . 2 { 184_090 } 
msa212269 . 2 { 184_2603 } 
msa212269.2{l84_18RS2l} 



Consensus 



251 • 300 

TAAAAAAAAA TAAGGATTTA GGGAAGTTTG AAACAAGAAA AAGTCAAATA 
TAAAAAAAAA TAAGGATTTA GGGAAGTTTG AAACAAGAAA AAGTCAAATA 
TAAAAAAAAA TAAGGATTTA GGGAAGTTTG AAACAAGAAA AAGTCAAATA 
********** ********** ********** ********** ********** 



msa212269 . 2 { 184_090 } 
msa212269 . 2 { 184_2603 j 
msa2 12269 . 2 { 184_18RS21 } 



Consensus 



301 350 
GAAAAGCCGG GTGG CTATAA TGAGTTAGAA AATAAAGAGG TCCCATTTGA 
GAAAAGCCGG GTGG CTATAA TGAGTTAGAA AATAAAGAGG TCCCATTTGA 
GAAAAGCCGG GTGGCTATAA TGAGTTAGAA AATAAAGAGG TCCCATTTGA 
********** ********** ********** ********** ********** 
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Table 50: Comparative Sequences relating to SAG 1024 



msa212269 . 2 { 184_0 90 } 
msa212269 . 2 { 184J2603 } 
msa212269 . 2 { 184_18RS2l} 
Consensus 



351 

ATATTTTAAA 
ATATTTTAAA 
ATATTTTAAA 



AATAATATAG- TTTATCCAAA AGGAAAACCG 
AATAATATAG TTTATCCAAA AGGAAAACCG 
AATAATATAG TTTATCCAAA AGGAAAACCG 



********** ********** ********** ********** 



400 

AATATTACAT 
AATATTACAT 
AATATTACAT 
********** 



msa212269.2{l84_090) 
msa212269 . 2 { 184J2603 } 
msa212269.2{l84_18RS2l} 
Consensus 



msa212269 . 2 { 184_090 } 
msa2 12269 . 2 { 184_2603 } 
rasa212269.2{l84_18RS2l) 
Consensus 



msa212269. 2 {184_090} 
msa212269 . 2 { 184_2603 } 
rasa212269 . 2{ 184_18RS21 } 
Consensus 



401 

TTGATGACTT 
TTGATGACTT 
TTGATGACTT 
********** 



451 

AAAGAATTAA 
AAAGAATTAA 
AAAGAATTAA 
********** 



450 

TATTATCGGA GCAATGGATA CT 

TATTATCGGA GCAATGGATA CT aaagaatta 

TATTATCGGA GCAATGGATA CTaaagaatt aaaagaatta 
********** ********** ** — 



AAAAATTAAA AGTAAAAAGT TATTTATTAA 
AAAAATTAAA AGTAAAAAGT TATTTATTAA 
AAAAATTAAA AGTAAAAAGT TATTTATTAA 
********** ********** ********** 



501 

AACTGAGTTG 
AACTGAGTTG 
AACTGAGTTG 

********** ********** ********** ****_***** 



AAAGATATAA CATATGAATT GCCGaCACAG 
AAAGATATAA CATATGAATT GCCGaCACAG 
AAAGATATAA CATATGAATT GCCGgCACAG 



500 

AACATCCGGA 
AACATCCGGA 
AACATCCGGA 
********** 

550 

TCGAAGCTTA 
TCGAAGCTTA 
TCGAAGCTTA 
********** 



msa212269 . 2 { 184_090 } 
msa212269.2{l84_2603} 
msa212269.2{l84_18RS2l} 
Consensus 



551 

TTAAAAAA 
TTAAAAAA 
TTAAAAAA 
******** 



SEQ ID NO. 5004 

STRAIN 2603 frame: 1 

MKKQKLLLLIGGLLIMIMMTACKDSKI PENRTKEEYQAEQNFKPFFEFLAQKDKDLSKIQ 
KYLLLVSDSGDALDLEYFYS IQDLKKNKDLGKFETRKSQI EKPGGYNELENKEVPFE YFK 
NNI VYPKGKPNITFDDFI IGAMDTKELKELKKLKVKS YLLKHPETELKD ITYELPTQSKL 
IKK 

SEQ ID NO. 5005 

STRAIN 090 frame: 2 

KDSKI PENRTKEEYQAEQNFKLFFEETAQKYKDLNKIQKYLLLVSDSGDALDLEYFYS I Q 
DLKKNKDIiGKFETRKSQ IEKPGGYNELENKETVPFE YFKNNI VYPKG I TFDDF 1 1 GAM 
DTKELKKLKVKS YLLKHPETELKD I TYELPTQSKLI KK 

SEQ XD NO. 5006 

STRAIN 18RS21 frame: 2 

KDSKI PENRTKEEYQAEQNFKPFFEFLAQKDKDLSKI QKYLLLVSDSGDALDLE YFYS I Q 
DLKKNKDI/3KFETRKSQ I EKPGGYNELENKEVPFE YFKNN I VYPKGKPN I TFDDF 1 1 GAM 
DTKELKELKELKKLKVKSYLLKHPETELKDITYELPAQSKLI KK 



PRETTY of: /biotmp/msa212547 . 2 { * } February 10, 2003 05:11 



msa212547.2{l84_18RS2l} 
msa212547 . 2 {l84_2603 } 
msa2 12547.2(18 4_0 9 0 } 
Consensus 



msa212547 . 2 { 184_18RS2l} 
msa212547. 2 { 184^2603} 
msa212547.2{l84_090} 
Consensus 



msa212547.2{!84_18RS2l} 
msa212547.2{l84_2603} 
msa212547.2{l84_090} 
Consensus 



1 50 

— KDSKI PEN RTKEEYQAEQ NFKpFFEFLA 

mkkqklllli ggllimimmt acKDSKI PEN RTKEEYQAEQ NFKpFFEFLA 

—KDSKI PEN RTKEEYQAEQ NFKlFFEFLA 

********** ********** ********** ********** ***_****** 

51 100 

QKdKDLsKIQ KYLLLVSDSG DALDLEYFYS IQDLKKNKDL GKFETRKSQI 

QKdKDLsKIQ KYLLLVSDSG DALDLEYFYS IQDLKKNKDL GKFETRKSQI 

QKyKDLnKIQ KYLLLVSDSG DALDLEYFYS IQDLKKNKDL GKFETRKSQI 

**_***_*** ********** ********** ********** ********** 

101 150 
EKPGGYNELE NKEVPFEYFK NNIVYPKGKP NITFDDFIIG AMDTkelkel 
EKPGGYNELE NKEVPFEYFK NNIVYPKGKP NITFDDFIIG AMDT. . .kel 

EKPGGYNELE NKEVPFEYFK NNIVYPKGKP NITFDDFIIG AMDT 

********** ********** ********** ********** **** 



Itisa212547 . 2{ 184_18RS2l} 
msa212547 . 2 {184_2603 j 
msa212547.2{l84_090) 
/ Consensus 



151 186 
KELKKLKVKS YLLKHPETEL KDITYELPaQ SKLIKK 
KELKKLKVKS YLLKHPETEL KDITYELPtQ SKLIKK 
KELKKLKVKS YLLKHPETEL KDITYELPtQ SKLIKK 



838 
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SEQ ID NO. 5101 
STRAIN 2603 

ttgaataataaaggtgtcggtggcgatggtgtccaaatttatcaatacta 
tatcaaaatggacaacaataaaccttacttaagtcccaaagataagacta 
ctgtagagaagttagaagatcgctggaaaaaaattactttcaaagttcag 
gatactggcattggtttgaaagacgtttatcttcaatctgttaagtatgt 
tggtggtggcaataataatttagaccttatcacacctccaggatttaaaa 
aagaagataaaaaagttgaaaaaccaaaattagaccgtccaccaggaatt 
gatttaccagcaccaacttcaatgagaagttttgattattcaaccccacc 
gggaactaagccaagcaaacccaaagatagtttatcaactcctccaggtt 
tcccagatttaaacacgccgccggatgaagcaccaaaggatagtaaaaaa 
gacgctattgaagataaatcaggagcaattaaatatgctaagtctcttca 
acttagctttgttgatggccctattttagctagcaaagtaaatggcaaaa 
tattacaagtcgaatctgatggcaaattagtcattcctagaaatgctttg 
tcagctaatcaatttgatgacactagtcttaaaatttatcgtaataataa 
tcgcaataaagaaattactatcacaacagattattttgcagatacaaaat 
atgtcaatatcacagcggttgactatttgagcaatactacttttgagcaa 
ttagctactggtgaaacagtagattaccatgccattgtattttcaagctt 
tgctgctattaaagacaagggtggtaagatttatgttaacgataaattgc 
aagaaacttctcgtatagcgcttaaagataaatctgttaagattggtatt 
gaattaccaaatgatgtcagacatattgatagtttatctgttcgtcgttt 
gaatgaggttaaaactgttgataatatcttgaaaaatgatgaacaagaca 
ttaatctcagcaaaacttaccaattaaaatacaacccgacaaatcgtcgt 
ctagagtttactattaataacattaactcaagttcagaaatcatgaccac 
tttcaaagatggaaagatgccagaattggttgaacaaaaagatgtttctt 
tggatataaacgatatggacatgagtaagtttaaaactattcgacttgga 
cgaaaggattctgaatttaagggacaacttattgcaaaaactggaacagt 
tgaattagatatgtttttcaaacaatctcaagacccagcttcaattatta 
aaaaaatataccttatccaaaatggtgttccaaatgaattgaaaaaattt 
gactctagttttggtttaactgaaagtcagatagatggatactatattta 
taaagatgcaattaaccttaaatttaaattaaccagtggtgcaagtctta 
aagttgtttataaagggcaagaagatccatatagtcatcagaaagaagat 
atgactaaaaaaggtgaacagctcagtcattcaactcaagccaatgaaaa 
tacagcaaaagtaacctttgctaatattgactggtcacattatagtaagg 
ttactgtgaatggaaaagaagttgttaaaggtagtgagttacctttaact 
aaaggatggacaacatttgtattacataaaacagaaaattcattaaatgt 
taaaagtttgattatggagacgggtagtgtaagtaagaaagttcaacaac 
ttcctttaagtcctagattatctaaaaataagcatatgagggatatgcta 
cttactatgcaaaaagattcagcgtattacgaaacaagtgacagtctagt 
ccttcgaattaatctcactgcagatactaaacttaattttaatgctgtta 
aaggagcgagtgctcttactgaaaatatgatgatgagacagtttgcagtt 
gctggaccacaagatgatcctgttagtgaacataaatacccatcagtatt 
tctcttaactcctgccttattggaaactgctagtgaggcaactctaaatg 
gtaaggaaatcacagcatctggtattatcggtcacatcaaggatggtgat 
aaaagcaagcatgttgaagtcaaaatggtgaatgaaaatggagacatgct 
aggaacccctgttattattcaaggtaaagacttgactaatcgaacaaaac 
cattaatgagtggacgtagagtactttatgccggtaaacaatatgagttc 
cgggctaaattaccacttagtcgttttaacacttggattagggfctgaagt 
ggtaacagaagcaggagagaaagcaagtattgttcgtcgcatgttctttg 
accaatcagttccagagcttaacacagcagttgctaaacgtgatttgact 
tctgatactgctcttatccacatcgttgccaaagatgactctctaaaact 
aaaattatatcaagatgattcattacttgaatctgttgataaaaccggtc 
tttatagttttagaaatggtgtagaaatcactaaagatatgacagtacca 
ctagaatttggagataatattattaagttatctgctgttgacttatcaaa 
ttatcgtcgtaatgagacccttcatatctatagaaaccgttttgatgtta 
aagcaagccaaatgacagctgacaaaggagctaaagtaactgtggatatg 
ttgatgaagcacttagttgttccagaaatggcaggagcttatacattaac 
aatcgacgaagctccaaacacaaatgaatcaggaatgttaacaaacgcta 
aagtatcgattcattatgtaaatggtggtgttgataaagttgatgttccg 
attaaagtagttgacttagaagctattcgtaaagctgaagaagcacgtaa 
agctgaagaagcacgtaaagctgaagaagcacgtaaagctgaagagggac 
ataaaacccaagaagcacctatagttgaagaaggctacaaggttaataac 
gttcatcaaactgatactacagttaaagcgtctgatttaccaaagactaa 
gacagtttccgcagttcatatggctagaacagacaataaacagataactt 
cacatcagacacatgttgaaaaacaaattaaaaatacattgccatccact 
ggtgacagcaaacgtggttattatatcactggaatggctatcgttatgct 
gagtgtattatttagtttagctaaaaagtttaaaagcaaatat 

SEQ ID NO. 5102 
STRAIN A909 

GGTGTC CAAATTTAT CAATACT ATAT CAAAATGG ACAAGAATAAACCTT A 
CTTAAGTCC C7^AAGATAAGACTTACTGTAG AGAAGTTAGAAGATCG CTGGA 
AAAAAATTACTTT CAAAGTT CAGGATACTGG CAT TGGTTTGAAAGACGTT 
TATCTT CAAT CTGTTAAGTATGTTGGTGGTGG CAATAAT AATTTAGACCT 
TATCACACCT CCAGGATTT AAAAAAGAAG ATAAAAAAGTTGAAAAAC CAA 
AATT AGACCGTC CACCAGGAATTG ATTTACCa C CACCAACTT CAATG AGA 
AGTTTTGATTATTCAACCCCACCGGGAACTAAGCCAAGCAAACCCAAAGA 
TAGTTTATCAACT C CTC CAGGTTT C CCAGATTTAAACACG C CGC CGG ATG 
AAG CACTAAAGG AT AGTAAAA^GACG CT ATTGAAGATAAAT CAGGAGCA 
ATTAAATATG CT AAGTCT CTTCMCTT AG CTTTGT TGATGAC C CTATTTT 
AGCTAGCAAAGTAAATGGCAAAATATTACAAGTCGAATCTGATGGCAAAT 
TAGT CATTCCTAGAAATG CTTTGT CAG CT AATCAATTTGATG ACACTAGT 



839 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



CTTAAAATTT AT CGT AATAATAAT CGCAAT AAAGAAATT ACT AT CACAAC 
AGATTATTTTGCAGATACAAAATATGTCAATATCACAGCGGTTGACTATT 
TGAGCAATACTACTTTTGAGCAATTAGCTACTGGTGAAACAGTAGATTAC 
CATGCCATTGTATTTTCAAGCTTTGCTGCTATTA^GACAAGGGTGGTAA 
GATTTATGTTAACG ATAAATTG CAAGAA^CTT CT CGT ATAG C G CTTAAAG 
ATAAATCTGTTAAG ATTGGT ATTG AATTAC CAAATGATGT CAGACATATT 
GATAGTTTAT CTGTTCGTC GTTTGAATGAGGTTAAAACTGTTGATAATAT 
CTTGAAAAATGATGAACAAGACATTAATCTCAGCAAAA.CTTACCAATTAA 
AAT ACAAC C CG ACAAAT CGT CGT CTAGAGTTT ACTATTAATAACATTAAC 
TCAAGTT CAG AAAT CATGAC CACTTT CAAAGATGG AAAG ATG CCAGAATT 
GGTTGAa CAAAAAG ATGTTT CTTTGGATATAa aCG ATATGGACATGAGT A 
AGTTTAAAACTATTCGACTTGGACGAAAGGATTCTGAATTTAAGGGACAA 
CTTATTGCAAAAACTGGA^C^GTTGA^TTAGATATGTTTTTCAAACAATC 
TCAAGACCCAGCTTCAATTATTAAAAAAATATACCTTATCCAAAATGGTG 
TTCOVAATGAATTGAAAAAATTTGACTCTAGTTTT 

CAGATAGATGGATACTATATTTATAAAGATGCAATTAACCTTAAATTTAA 

ATTAACCAGTGGTG CAAGT CTT AAAGTTGTTT ATAAAGGGCAAG AAG AT C 

CATATAGT CAT CAGAAAGAAGATATGACTAAAAAAGGTG AACAG CTCAGT 

CATT CAACT CAAG C CAATG AAAATACAGCAAAAGTAACCTTTGCTAATAT 

TGACTGCTCACATTATAGTAAGGTTACTGTGAATGGAAAAGAAGTTGGTA 

AAGGTAGTGAGTTACCTTTAACTAAAGGATGGACAACATTTGT^ 

AAAACAGAAAATTC^TTAAATGTTAAAAGTTTGATTATGGAGACGGGTAG 

TGTAAGTAAGAAAGTTCAACAACTT CCTTTAAGT C CT AGATTATCTAAAA 

ATAAG CATATGAGGGATATGCTACTTACTATG CAAAAAG ATT CAG CGTAT 

TACGAaaCAAGTGACAGTCTAGTCCTTCGAATTAATCTC 

TAAACTT AATTTTAATG CTGTT AAAGGAG CGAGTG CT CTTACTGAAAATA 

TGATGATGAGACAGTTTG CAGTTG CTGGAC CACAAGATGATCCTGTTAGT 

GAACATAAATACCCATCAGTATTTC!TCITA 

TGCTAGTGAGGCAACTCTaAATC^GTAAGGAAATCACAGa^TCTGGTATTA 
TCGGTC^CATCAAGGATGCTGATAAA^GCAAGCATGTTGAAGTCAA^TG 
GTGAATGAAAATGGAGACATGCTAGGAAC C CCTGTTATTATTCAAGGTAA 
AGACTTGACTAATCGAAC7yy\ACCATTAATGAGTGX5ACGTAGAGTACTTT 
ATGCCGGTAAACAATATGAGTT CCGGG CTAAATTAC CACTTAGT CGTTTT 
AACACITGGATTAGGGTTGAAGTGGTAACAGAAGC^GCAGAGAAAGCAAG 
TATTGTT CGT CG CATGTT CTTTGAC CAAT CAG t TCCAGAG CTTAACACAG 
CAGTTGCTAAACGTGATTTGACTTCTGATACTGCTCT^ 
GC CAAAGATGACT CT CTAAAACTAAAATTATAT CAAGATG ATTCATTACT 
TGAATCTGTTGATAAAACCGGTCTTTATAGTTTTAGAAATGGTGTAGAAA 
TCACTAAAGATATGACAGTACCACTAGAATTTGGAGATAATATTATTAAG 
TTATCTG CTGTTGACTTAT CAAATTAT CGTCGTAATGAGACCCTT CATAT 
CTATAGAAACCX5TTTTGATGTT AAAGCAA.GC CAAATGACAGCT 
GAGCTAAAGT AACTGTGGATATGTTGATGAAG CACTTAGTTGTTC CAGAA 
ATGG CAGGAGCTTATACATTAACLAATCGACGAAGAT C CAAACACAAATGA 
ATCAGGAATGTTAACAAACGCTAAAGTATCGATTCATTATGTAAATGGTG 
GTGTTGATAAAGTTGATGTT C CGATTAAAGTAGTTGACTTAG AAG CTATT 
CX3TAAAGCTGAAGAAGCACATAAAGCTGACGAAGCACGTAAAGCTGAAGA 
AG CACGTAAAGCTGAAGAAGCACGTAAAG CTGAAGAAG CACGTAAAGCTG 
AAGAGGGACATaAAACCCAAGAAG CAC CTATAGTTGAAGAAGGCTACAAG 
GTTAATAACGTTCAT CAAACTGATACTACAGTTAAAG CGTCTGATTTAC C 
AAAGACTAAGACAGTTTCCG CAGTT CATATGG CT AGAACAGACAATAAAC 
AGATAACTT CACATCAC^CA(TATGTTGAAAAA.CAAATTAAAAATA 

SEQ ID NO. 5103 
STRAIN H3 6B 

TGGTGTCCAAATTTATCAATACTATATCAAAATGGACAACAA 

ACTT AAGTCC CAAAGAT AAGACTACTGTAGAGAAGTTAG a aGAT CGCTGG 

AAAAAAATTACTTT CAAAGTT CAGGATACTGGCATTGGTTTGAAAGACGT 

TTAT CTT CAAT CTGTTAAGTAT GTTGGTGGTGG CAATAATAATTTAGAC C 

TTATCACACCTCCAGGATTTAAAAA&GAAGATA 

AAATTAGAC CGTCCACCAGGAATTG ATTTACCAG CACCAACTT CAATGAG 
AAGTTTTGATTATTCAACC C CAC CGGGAACTAAG C CAAG CAAA.C CCAAAG 
ATAGTTTATCAACTCCTCCAG^TTTCCCAGATTTAAACACGCCGCCGGAT 
GAAG CACTAAAGGATAGTAAAAAAG ACG CTATTGAAGAT AAAT CAGG AG C 
AATTAAATATG CTAAGT CTCTTCAACITAGCTTTGTTGATGAC C CTATTT 
TAG CTAGCAAAGTAAATGG CAAAATATTACAAGT CGAAT CTGATGGCAAA 
TTAGTCATT CCTAGAAATGCTTTGTCAGCTAAT CAATTTGATGACACTAG 
TCTTAAAATTTATCGTAATAAT AAT CG CAATAAAGAAATTa cTAT CACAA 
CAGATTATTTTG CAG ATACAAAATATGTCAATAT CACAG CGGTTG ACTAT 
TTGAGCAATACTACTTTTGAGCAATTAGCTACTGGTGAAaCAGT 
CCATGCCATTGTAtTTTCAAGCTTTGCTGCT 

AGATTTATGTCAACGATAAATTGCAAGAAACIT CTCGTAT AGCG CTTAAA 
GATAAAT CTGTTAAG ATTGCTFATTGAATTAC CAAATG ATGTCAGACATAT 
TGATAGTTTATCTGTTCGTCGTTTGAATGAGCTTAAAACTGTTGATAATA 
T CTTGAAAAATGATGAACAAGACATTAAT CT CAG CA^AACTTACCAATTA 
AAATACAACCCGACAAATCGTCGTCTAGAGTTTACTATTAATAACATTAA 
CTCAAGTTCAGAAAT CATGACCACTTT CAAAG ATGGAAAGATG C CAgAAT 
TGGTTGAACAAAAAGATGTTTCTTTGGATATAAACGATATGGACATGAGT 
AAGTTTAAAACrATTCGACTTGGACGAAAGGATTCTGA^ 
ACTTATTGCAAAAACTXX-IAACAGTTG^ 

CT CAAGACC CAG CTT CAATTATTA*U\AAAATATAC CTTAT CCAAAATGGT 

GTTCCAAATGAATTGAAAAAATTTGACTCTAGTTTTGGTTTA 

T CAG ATACATGGAT ACTAT ATTTATAAAG ATG CAATTAAC CTTAAATTTA 

AATTAAC CAGTGGTG CAAGTCTTAAAGTTGTTTATAAAG GGCAAG AAGAT 



840 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



CCATATAG t CAT CAGAAAGAAG AT ATG ACTAAAAAAG GTG AACAG CT CAG 
TCATTCAACT CAAG C CAATGAAAATACAG CAAAAGTAAC CTTTG CTAATA 
TTGACIGGTCACATTATAGTAA.GGTTACTGTGAATGGAAAAGAAGTTGGT 
AAAGGTAGTGAGTTACCITTAACTAAAGGATGGACAACATTTGTATTACA 
TAAAACAG AAAATT CATTAAATGTTAAAAGTTTGATTATGGAGACGG GTA 
GTGTAAGTAAGAAAGTT CAACAACTTC CTTTAAGT CCTAG ATTAT CT AAA 
AATAAGCAT ATG AGGGATATG CTACITACTATGCAAAAAGATTCAG CGTA 
TT ACGAAACAAGTGACAGT CTAGTCCT TCG AATT AAT CT CACTG CAGAT A 
CTAAACTTAATTTT AATGCTGTTAAAGGAGCGAGTG CT CTT ACTG AAAAT 
ATGATGATG AGACAGTTTG CAGTTG CTGG ACCACAAG ATGAT C CTGTTAG 
TGAACAT AAATAC C CATCAGTATTT CTCTT AACT CCTG C CTTATT GG AAA 
CTGCTAGTGAGGCaACrCTAAATGGTAAGGAAATCACAGCATCTGGTATT 
AT CGGTCACATCAAGGATGG t G ATAAAAG CAAGCATGTTG AAGT CAAAAT 
GGTGAATGAAAATGGAGACATGCTAGGAACCCCTGTTATTATTCAAGGTA 
AAGACTTGACTAAT CGAACAAAAC CATTAATG AGTGGACGT AGAGTACTT 
TATGCCGGTAAACAATATGAGTTCCGGGCTAAATTACCACTTAGTCGTTT 
TAAC a CTTGGATTAGGGTTGAAGTGGT AACAG AAG CAGG AGAGAAAG CAA 
GTATTGTTCGTCGCATGTTCTTTGACCAATCAGT^ 
GCAGTTGCTAAACGTGATTTGACITCTGATACTGCTCTTAT 
TGCCAAAGATGACTCTCTAAAACTAAAATTATAT CAAGATGATT CATTAC 
TTGAATCTGTTGATAAAACCGGTCTTTATAGTTTTAGAAATGGTGTAGAA 
ATCACTAAAGATATGACAGTACCACTAGAATTTGGAGATAATATTACTAA 
GTTAT CTGCTGTTGACITATCAAATTATCGT CGTAATGAGAC CCTTCATA 
T CTATAGAAACCGTTTTGATGTTAAAG CAAGCCAAATGACAG CTGACAAA 
GGAG CTAAAGTAACTGTGGATATGTTGATGAAGCACTTAGTTGTT CCAGA 
AATGG C^GGAGCTTATACATTAACAAT CGACGAAG CTC CAA 
AAT CAGGAATGTTAACAAACGCTAAAGTAT CGATTCATTATGTAAATGGT 
GGTGTTGATAAAG t tGATGTTCCGATTAAAGTAGTTGACTTAGAAGCTAT 
T CGTAA^GCTGAAGAAG CACATAAAGCTG ACG AAG CACGTAAAG CTGAAG 
AAGCACGTAAAGCTGACGAAGCACATAAAG^GAAGAAGTACGTAAAGCT 
GAAGAAG CACATAAAGT CGAAGAAG CACGTAAAGCTG AAGAGGGACATAA 
AACCCAAGAAGCACCTATAGTTGAAGAAGG CTACAAGGTT AATAACGTT C 
ATCAAACTGATACTACAGTTAAAGCGTCTGATTTACCAA^GACTAAGACA 
GTTT C CG CAGTTCATATGG CTAGAACAGACAATAAACAG ATAACTT CACA 
T CAGACACATG 

SEQ ID NO. 5104 
STRAIN 18RS21 

TTGAATAATAAAGGTGTCGGTGGCGATGGTGTCCAA 

ATTT ATCAATACTATAT CAAAATGGACAACAATAAAC CTT ACTTAAGTCC 
CAAAGATAAGACTACTGTAGAGAAGTTAGAAGATCGCTGGAAAAAAATTA 
CTTTCAAAGTTCAGGATACTGGCATTGGTTTGA^ 

TCTGTTAAGTATGTTGK3TGGTGGGAATAATAATTTAG AC CTT AT CACAC C 
T C CAGGATTT AAAAAAGAAGAT AAAAAAGTTGAAAAA.C CAAAATT AG ACC 
GT CCACCAGGAATTGATTT ACCAG CAC CAACTTCAATGAG AAGTTTT GAT 
TATTCAACC C CAC CGGGAACTAAG CCAAG CAAAC C CAAAGAT AGTTTAT C 
AACTCCTCCAGGTTTCCCAGATTTAAACACGCCGCCGGaTGAAGCACCAA 
AGGATAGTAAAAAAGACGCTATTGAAGATAAATCAGGAGCAATTAAATAT 
GCTAAGT CTCITCAACTTAGCTTTGTTGA CTATTTTAG CT AG CAA 

AGTAAATGG CAAAATATTACAAGT CGAAT CTGATGGCAAATTAGTCATTC 
CT AGAAATG CTTTGT CAGCTAATCAATTTGATGACACTAGTCTTAAAATT 
TATCGTAATAATAATCGCAATAAAGAAATTACTATCACAACAGATTATTT 
TG CAGATACAAAATATGTCAATAT CACAG CGGTTG ACTATTTGAG CAATA 
CTACTTTTGAGCAArrAGCTACTOGTGAAACAG 

GTATTTTCAAGCTTTGCTGCTATTAAAGACAAGGGTGGTAAGATTTATGT 
TAACGAT AAATTG CAAGAaACTTCTCGTATAGCG CTTAAAGATAAAT CTG 
TTAAGATTGGTATTGAATTACCAAATGATGTCAGACATATTGATAGTT^ 
TCTGTTCGTCGTTTGAATGAGGTTAAAACTGTTGATAATATCTTGAAAAA 
TGATGAACAAGACATTAATCTCAG CAAaACTTACCAATTAAAATACAAC C 
CGACAAATCGT CGT CTAGAGTTTACTATTAAT AACATTAACT CAAGTT CA 
GAAATCATGACCACTTTCAAAGATGGAAAGATGCCAGAATTGGTTGAACA 
AAAAGATGTTTCTTTGGATATaAACGATATGGACATGAGTAAGTTTAAAA 
CTATT CG ACI^GGACGAAAGGATT CTGAATTTAAGGGACAACTT 
AAAACTGGAACAGTTGAATTAGATATGTTTTT CAAACAAT CT CAAG ACCC 
AG CTT CAATTATTAAAAAAATATAC CTTATC CAAAATGGT GTTC CAAATG 
AATTGAAAAAATTTGACTCTAGTTTTGGTTTAACT 

GGATACT ATATTTAT AAAGATG CAATTAAC CTTAAATTTAAATTAAC CAG 
TGGTGCAAGTCTTAAAGTTGTTTATAAAGGGCAAGAAGATCCATATAGTC 
AT CAGAAAGAAGATATGACTAAAAAAGGTGAACAG CT CAGTCATT CAACT 
CAAGC CAATGAAAATACAG CAAAAGTAAC CTTTG CTAATATTGACTGGTC 
ACATTATAGTAAGGTTACTGTG AATGG AAAAGAAGTTGTTAAAGGTAGTG 
AGTTACCTTTAACTAAAGGATGGACAACATTTGTATTACAT^ 
AATTCATTAAATGTTAAAAGTTTGATTATGGAGACGGGTAGTGTAAGTAA 
GAAAGTT CAACAACTTCCTTTAAGT CCTAGATTAT CTAAAAATAAGCAT A 
TGAGGGATATGCTACTTACTATGCAAAAAGATTCAGCGTATTACGAAACA 
AGTGACAGTCTAGT CCTTCGAATTAAT CT CACTG CAGATACTAAACTTAA 
TTTTAATGCTGTTAAAGGAGCGAGTGCTCTTACTGAAAATATGATGATGA 
GACAGTTTG CAGTTG CTGG ACCACAAG ATG ATCCTGTTAGTGAACATAAA 
TACCCAT CAGTATTTCT CTTAACT C CTG C CTTATT GGAAACTG CTAGTGA 
GG CAACTCTAAATGGTAAGG AAAT CACAG CAT CTGGTATTATCGGTCACA 
TCAAGGATGGTGAT AAAAG CAAGCATGTTGAAGT CAAAATGGTG A^T GAA 
AATGGAG ACATG CTAGGAACCC CTGTTATT ATTCAAGGT AAAGACTTGAC 
TAAT CGAACAAAACCATTAATGAGTGGACGTAGAGTACTTTATGC CGGTA 



841 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



AACAATATGAGTT C C GGG CT AAATT AC CACTT AGT CGTTTTAACACTTGG 
ATTAGGGTTGAAGTGGTAACAGAAGCAGGAGAGAAAGCAAGTATTGTTCG 
TCGCATGTTCTTTGACCAATCAGTTCCAGAGCTTAACACAGCAGTTGCTA 
AACGTGATTTGACTTOTGATACTGCTCTTATCCACATCGTTGCCAAAGAT 
GACT CT CTAAAACT AAAATTATAT CAAGATGATT CATTAC t T GAAT CTGT 
TGATAAAACCGGTCITTATAGTTTTAGAAATGGTGTAGAAATCACTAAAG 
AT ATGACAGTAC CACTAGAATTTGGAG ATAATATTATTAAGTTAT CTG CT 
GTTGACTTATCAAATTATCGTCGTAATGAGACCCTTCATATCTATAGAAA 
CCGTTTTGATGTTAAAGCAAGCCAAATGACAG CTGACAAAGG AGCTAAAG 
TAACTGTGGaTATGTTGATGAAGCACTTAGTTGTTCC^GAAATGGCAGGA 
G CTT ATACATTAACAATCGACG AAG CT CCAAACACAAATG AATCAGGAAT 
GTTAACAAACG CTAAAGTAT CG ATT CATTATGT AAATGGTGGTGT TG ATA 
AAGTTGATGTTCCGATTAAAGTAGTTGACTTAGAAGCTATTCGTAAAGCT 
GAAGAAG CACGT AAAGCTGAAGAAG CACGT AAAG CTG AAG AGGGACATAA 
AACC CAAGAAG CACCTAT AGTTGAAGAAGG CTAGAAGGTT AATAACGTT C 
AT CAAACTGATACTACAGTTAAAG CGT CTGATTTAC CAAAGACTAAG ACA 
GTTTCCGCAGTTCATATGGCTAGAACAGACAATAAACAGATAACTTCACA 
TCAGACACATGTTGAA 

SEQ ID NO. 5105 
STRAIN M732 

AAATTTATCAATACTATAT CAAAATGGACAACAATAAAC CTTACTTAAGT 
CCCAAAGATAAGACTACTGTAGAGAAGTTAGAAGATCGCTGGAAAAAAAT 
TACTTTCAAAGTT CAGGATACTGG CATTGGTTTGAAAGACGTTTATCTTC 
AATCTGTTAAGTATGTTGGTGGTGGCAATAATAATTTAGACCTTATCACA 
C CTCCAGGAITTAAAAAAGAAGATAAAAAAGTTG AAAAAC CAAAATTAGA 
CCGT CCa C CAGG AATTGATTTACCAG CAC CAACTT CAATGAGAAGTTTTG 
ATTATTCAACCCCAC CGGGAACTAAGC CAAGCAAACCCAAAGATAGTTTA 
TCAACTCCTCCAGGTTTCCCAGATTTAAACACGCCGCCGGATGAAGCCAC 
CAAAGGATAGTAAAAAAGACGCTATTX3AAGATAAATCAGGAGCAATTAAA 
TATGCTAAGTCT CTT CAACTTAG CTTTGTTGATGACC CTATTTTAG CTAG 
CAAAGTAAATGGCAAA^TATTACAAGT CX3 AA.T CTGATGG CAAATT 
TTCCTAGAAATGCTTTGTCAGCTAATCAATTTGATGACACT^ 
ATTTATCGTAATAATA^TCGCAATAAAGAAATTACTATCACAACAa^TTA 
TTTTG CAGAT ACAAAATATGTCAATAT CACAG CGGTTGACTATTTGAG CA 
ATACT ACTTTTG AG CAATT AGCTA CTGGTG AAACAGTAGATTAC CATG C C 
ATT GTATTTT CAAG CTTTG CTG CTATT AAAGACAAGG GTG GTAAGATTT A 
TGTTAACGAT AAATTGCAAG AAACTTCT CGTATAG CG CTTAAAGATAAAT 
CTGTTAAGATTGGTATTGAATTACCAAATGATGTCAGACATATTGATAGT 
TTAT CTGTT CGT CGTTT GAATGAGGTT AAAACTGTTGATAATAT CTTGAA 
AAATGATGAACAAGACATTAATCT CAGC^VAAACTTAC CAATT AAA&T ACA 
AC CCG ACAAATCGTCGT CTAGAGTTTACT ATT AATAAGATTAACT CAAGT 
TCAGAAATCATGACCACTTT CAAAGATGGAAAGATG CCAG AATTGGTTGA 
ACAAAAAGATGTITCTTTGGATATAAACGAT^ 
AAACT ATTCGACTTGGA.CGAAAGG ATT CTGAATTT AAGGGA 
GCAAAAACTGGAACAGTTGAATTAGAT ATGTTTTT CAAACAAT CT CAAGA 
CCCAG CTT CAATTATTAAAAAAATATACCTTATC CAAAATGG t GT TCCAA 
ATGAATTGAAAAAATTTGACTCTAGTTTTGGTTTAACT 
GATGGATACT ATATTTATAAAGATG CAATTAACCTTAAaTTTAAATTAAC 
CAGTGGTGCAAGTCITAAAGTTGTTTATAAAGGGCAAGAAGATCCATATA 
GT CAT CAGAAAG AAGATATG ACTAAAA&AGGTGAACAGCT CAGTCATTCA 
ACTCAAG CCAATGAAAATACAGCAAAAGTAACCTTTG CTAATATTGACTG 
GTCACATTATAGTAAGGTTACTGTGAATGGAAAAGAAGTTGGTAAAGGTA 
GTGAGTTACCTTTAACTAAAGGATGGACAACATTTGTATTACATAAAACA 
GAAAATTCATTAAATGTTAAAAGTTTGATTATGGAGACGGGTAGTGTAAG 
TAAGAAAGTT CAACAACTT C CTTTAAGTCCTAGATTATCTAAAAATAAGC 
ATATGAGGGATATGCTACTTACTATGCAAAAAGATTCAGCGTATTACGAA 
ACAAGTG ACAGT CTAGT CCTTCGAATTAAT CTCACTG CAGATACTAAACT 
TAATTTT AATG CTGTTAAAGGAGCG AGTG CT CTTACTGAAAATATGATG A 
TGAGACAGTTTGCAGTTG CTGGAC CACi^AGATGAT CCTGTT aGTG AACAT 
AAATACCCATCA-GTaTTTCTCTTAACTCCTGCCTTATTGGAAaCTGCTAG 
TGAGG CAACT CTAAATGGTAAGGAAAT CACAG CAT CTGGTATTAT CGGTC 
ACAT CAAGG ATGGTG ATAAAAGCAAGCATGTTGAAGT CAAAATGGTGAAT 
GAAAATGGAGAa^TGCTAGGAACCCCTGTTATTATTCAAGGTAAAGACrT 
GACTAATCGAACAAAAC CATTAATG AGTGGACGTAGAGTACTTTATG C CG 
GT AAACAATATGAGTTCCGGGCTAAATTAC CACTT AG t CGTTTTAACACT 
TGGATTAGGGTTGAAGTGGTAACAGAAGCAGGAGAGAAAGCAAGTATTGT 
T CGT CGCATGTT CTTTGAC CAAT CAGTTCCAGAG CTT AACACAG CAGTTG 
CTAAACGTGATTTGACTT CTGATACTG CTCTT ATCCACAT CGTTG CCAAA 
GATGACTCTCTAAAACT AAAATTAT AT CAAGATGATT CATTACTTGAAT C 
TGTTGATAAAACCGGTCTOTATAGTTTTAGAAATGGTGTAGAAATCACTA 
AAGATATGACAGTACCACTAGAATTTGGAGATAATATTATTAAGTTATCT 
GCTGTTGACTTAT CAAATTATCGTCGT AATGAGACCCTT CATAT CTATAG 
AAACCGTTTTGATGTTAAAGCAAGCCAAATGACAGCTGACAAAG 
AAGTAACTGTGGATATGTTGATGAAGCACTTAGTTGTTCCAGAAATGGCA 
GGAG CTT ATACATTAACAATCGACGAAG CTCCAAACACAAAT GAATCAGG 
AATGTTAACAAACGCTAAAGTATCGATTCATTATGTAAATGGTGGTGTTG 
ATAAAGTTGATGTT C CX3ATTAAAGT AGTTG ACTTAGAAG CTATTCGTAAA 
G CTGAAG AAG CACATAAAG CTGACGAAG CACGTAAAGCTG AAGAAGCACG 
TAAAG CTGAAGAAG CACATAAAGCTGAAGAAGTACGTAAAGCIGAAGAAG 
CACATAAAGT CGAAGAAGCACGTAAAG CTGAAGAGGGACATAAAACCCAA 
GAAG CACCT ATAGTTGAAGAAGGCT ACAAAGTTAATAACGTT CAT CAAAC 
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TG AT ACT ACAGTTAA^G CGT CTGATTTAC CAAAGACT AAGACAGTTT C C G 
CAGTT CATATGG CTAGAACAG ACAATAAACAG ATAACTT CACAT CAG ACA 
CATGTTGAAAA 

SEQ ID NO. 5106 
STRAIN COH1 

GTCCAAATTTATCAATACTATATCAAAATGGACAACAATAAACCTTACTT 

AAGT CCCAAAGATAAGACTACTGTAGAG AAGTTAGAAGAT CG CTGGAAAA 

AAATTACTTTCAAAGCTGAGGATACTGGCATTGGTTTGAAAGACGTTTAT 

CTTCAATCTGTTAAGTATGTTGGTGGTGGCAATAATAATTTAGACCTTAT 

CA.CACCTCCAGGATTTAAAAAAGAAGATAAAAAAGTTGAAAAACCAAAAT 

TAGACCGTCCACCAGGAATTGATTTACCAGCACCA^CTTCAATGAGAAGT 

TTTGATTATTCAACCCC!ACCGGGAACTAAGCCAAGCA7^CCCAAAGATAG 

TTTAT CAACT CCT CCAGG t TT CCCAGATTTAAA.CACG CCG C CGGATG AAG 

CCaCCAAAGGATAGTAAAAAAGACGCTATTGAAGATAAATCAGGAGCAAT 

TAAATATGCTAAGTCTCTTCAACTTAGCTTTGTTGATGACCCTATTTTAG 

CTAGCAAAGTAAATGGCAA^TATTACAAGTCGAATCTIGATGGCAAATTA 

GTCATTCCTAGAAATGCTTTGTCAGCTAATCAATTTGATGACACTAGTCT 

TAAAATTTAT CGTAATAATAATCG CAATAAAG AAATTACT AT CACAACAG 

ATTATTTTG CAGATACAPAATATGT CAATAT CACAG CGGTTGACTATTTG 

AGCAATACTACTTTTGAGCAATTAGCTACTGGTGAAACAGTAGATTACCA 

TG CCATTGTATTTTCAAGCTTTG CTG CTATTAAAGACAAGGGTGGTAAGA 

TTTATGTTAACGATAAATTGCAAGAAACTTCT CGTATAG CG CTTAAAGAT 

A^TCTGTTAAGATTGGTATTGAATTACCAAATGATGTCAGACATATTGA 

TAGTTTATCTGTTCGTCGTTTGAATGAGGTTAAAACTGTTGATAATATCT 

TGAAAAATGATGAAC^GACATTAATCTCn.GCAAAACTTACCAATTAW^ 

TACAACC CGACAAAT CGTCGTCTAGAGTTTACTATTAATAACATTAACT C 

AAGTTCAGAAAT GATGACCACTTT CAAAGATGGAAAGATG C CAGAATTGG 

TTGAACAAA^GATGTTTCITTGGATATAAACa^TATGGACATGAGTAAG 

TTTAAAACTATTCGACTTGGACGAAA.Ga^TTCTGAATTTAAGGGACAACT 

TATTG CAAAAACTGGAACAGTTGAATTAGATATGTTTTTCAAACAATCTC 

AAGAC CCAG CTT CAATT ATT AAAAAAATATAC CTTATCCAAAATGGTGTT 

CCAAATGAATTGAAAAAATTTGACTCTAGTTTTGGTTTA^ 

GATAGATGGATACTATATTTATAAAGATGCAATTAACCTTAAATTTAAAT 

TAACCAGTGGTG CAAGT CTTAAAGTTGTTTATAAAGGG CAAG AAGAT CCA 

TATAGT CATCAG AAAGAAGATATGACTAAAAAAGGTGAACAG CT CAGT CA 

TTCAACT CAAG C CAATG AAAAT ACAGCAAAAGTAACCTTTGCTA^TATTG 

ACTGGTCACATTATAGTAAGGTTACTGTGAATGGAAAAGAAGTTGGTAAA 

GGTAGTGAGTTACCTCTAACTAAAGGATGGACAACATTTGTATTACATAA 

AACAGA^AATTCATTAAATGTTAAAAGTTTGATTATGGAGACGGGTAGTG 

TAAGTAAGAAAGTTCAACAACTT CCTTTA^GT CCTAgATTAT CTAAAAAT 

AAG CATATGAGGGATATGCTACTTACTATG CAAAAAG ATT CAG CGTATTA 

CG AAACAAGTGACAGT CTAGTC CTT CGAATTAATCTCACTG CAGATACTA 

AACTTAATTTTAATGCTGTTAAAGGA 

ATGATGAGACAGTTTGCAGTTGCTGGACCACAAGATGATCCTGTTAGTGA 

ACATAAATACCCATCAGTATTT CTCTTAACT C CTG CCTTATTGGAAACTG 

CT AGTGAGG CAACT CTAAATGGTAAGGAAATCACAGCATCTGGTATTAT C 

GGTCAC^TCAAGGATGGTGATAAAAGCAAGCATGCTGAAGTCAAAATGGT 

GAATG AAAATGG AGACATG CTAGGAAC CCCTGTTATTATT CAAGGTAAAG 

ACTTGACTAATCGAACAAAACCACTA^TGAGTGGACGTAGAGTACTTTAT 

GCCGGTAAACAATATGAGTTCCGGGCTAA^TTACCACTTAGTCGTTTTAA 

CACTTGGATT AGGGTTGAAGTGGT AACAGAAG CAG GAGAGAAAG CAAGT A 

TTGTTCGTCGCATGTTCTT'TGACCAATCAGTTCCAa^GCTTAACACAGCA 

GTTGCTAAACGTGACTtGACCTCTGATACTGCTCTT^ 

CAAAGATGACTCT CTAAAa CTAAAATTATAT CAAGATGATTCATTACTTG 

AATCTGTTGATAAAACCGGTCTCTATAGTTTTAGAAATGGTGTAGAAATC 

ACTAAAGATATGACAGTACCACTAGAATTTGGAGATAATATTATTAAGTT 

AT CTG CTGTTG ACTTATCAAATTAT CGT CGTAATGAGACC CTT CATATCT 

AT AG AAACCGTTTTGATGTTAAAG CAAG CCAAATGACAG CTG ACAAAGG A 

GCTAAAGTAACTGTGGATATGTTGATGAAGCACTTAGTTGTTCCAGAAAT 

GGCAGGAGCTTATACATTAA(2AATCGACGAAGCTCCAAACACAAATGAAT 

CAGGAATGTTAACAAACG CTAAAGTAT CGATT CATTATGTAAATGGTGGT 

GTTGATAAAGTTGATGTTCCGATTAAAGTAGTTGACTTAGAAGCTATTCG 

TAAAG CTGAAGAAG CACATAAAG CTGACG AAG CACGTA*^AGCTGAAG AAG 

CACX3TAAAGCTGAAGAAGCACATAAAGCTGAAGAAGTACGTAAAGCTGAA 

GAAGCAC1ATAAAGTCGAAGAAGCACGTAAAGCTGAAGAGGGACATAAAAC 

C CAAGAAGCAC CTATAGTTGAAGAAGG CTACAAAGTT AAT AACGTTCATC 

AAACTGATACTACAGCTAAAGCGTCTGATTTACCAAAGACTAAGACAGTT 

T C CG CAGTT CATATGG CTAGAACAGACAATAAACAGATAACTTCACATCA 

GACACATGT 

SEQ ID NO. 5107 
STRAIN M781 

TTGAATAATAAAGGTGTCGGTGGCGATGGT 

CTCCAAATTTATCAATACTATATCAA^TGGACAACAATAAACCTTACTT 
AAGTC CCAAAGATAAGACTACTGTAGAGAAGTTAG AAGAT CG CTGGAAAA 
AAATT ACTTT CAAAGTT CAGGATACTGGCATTGGTTTGAAAGACGTTTAT 
CTTCAAT CTGCTAAGTATGTTGGTGGTGG CAATAA.TAATTTAGACCTTAT 
CACAC CT CCAGGATTTAAAAAAGAAGATAAAAAAGTTGAAAAACCAAAAT 
TAGACCGTCCACCAGGAATTGATTaACCAGCACCAACTT: , CAATGAGAAGT 
TTTGATT ATT CAAC CCCAC CGGGAACTAAG CCAAGCAAAC CCAAAGATAG 
TTTAT CAACT CCT C CAGGTTT C CCAGATTT AAACACG CCG C CGGATGAAG 
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CCaCCyVAAGGATAGTAAAAAAGACGCTATTGAAGATAAATCAGGAGCAAT 
TAAATATGCTAAGTCTCTT CAACT T AG CTTTGTTG ATGAC CCTATTTTAG 
CTAG CAAAGTAAATGG CAAAATATTACAAGTCGAAT CTG ATGGCAAA.TT A 
GT CATTC CT AGAAAT GCTTT GT CAG CT AAT CAAT TTGATG ACACT AGT CT 
TAAaATTTATCGTAATAATAATCGCAATAAAGAAATTaCTATCACAACAG 
ATTATTTTGCAGATACAAAATATGTCAATATCACAGCGGTTGACTATTTG 
AGCAATACTACTTTTGAGCAATTAGCTACTGGTGAAACAGTAGATTACCA 
TGCCATTGTATTTT CAAGCTTTG CTG CTAT TAAAGACAAGGGTGGTAAGA 
TTTATGTTAACGATAAATTGCAAGAAACrrTCrrCGTATAGCGCTTAAAGAT 
AAATCTGTTAAGATTGGTATTGAATTACCAAATGATGTCAGACATATTGA 
TAGTTTATCTGTTCGTCGTTTGAATGAGGTTAAAACTGTTGATAATATCT 
TGAAAAATGATG AACAAGACATTAATC TCAG CAAAACTTACCAATTAAAA 
TACAACC CG ACAAATCGT CGTCTAG AGTTTACTATTAAT AACATTAACT C 
AAGTT CAGAAAT CATGACCACT TT CAAAG ATGGAAAGATG C CAGAATTG G 
TTGAACAAAAAGATGTTTCTTTGGATATAAACGATATGGACATGAGTAAG 
TTTAAAACTATTCGACTTGGACGAAAGGATrCTGAATTTAAGGGACAACT 
TATTGCAAAAACTGGAACAGTTGAATTAGATATGTTTTT CAAACAAT CT C 
AAGACCCAGCTTCAATTATTAAAAAAATATACCTTATCCAAAATGGTGTT 
CCRAATGAATTGAAAAAATTTGACTCTAGTTTTGGTTTAACT 
GATAGATGGATACTATATTTATAAAGATG CAATTAACCITAAATTTAAAT 
TAAC GAGTGGTG CAAGTCTT AAAGTTGTTTAT AAAGGGCAAG AAGAT C CA 
TATAGT CAT CAGAAAGAAGATATG ACTAAAAAAGGTGAACAG CT CAGT CA 
TTCAACTCAAGCCAATGAAAATACAGCAAAAGTAACCT^ 
ACTGGTCACATTATAGTAAGGTTACTGTGAATGGAAAAGAAGTTGGTAAA 
GGTAGTGAGTTACCTTTAACTAAAGGATGGACAACATTTGTATTACATAA 
AACAGAAAATT CATTAAATGTT AAAAGTTTGATTATGGAGACGGGTAGTG 
TAAGTAAGAAAGTTCAACAACTTCCTTTAAGT C CTAGATT AT CTAAAAAT 
AAGCATATGAGGGATATGCTACTTACTATG CAAAAAGATT CAGCGTATT A 
CGAAACAAGTGACAGTCTAGTC CTT CG AATTAAT CTCACTGCAGATACT A 
AACTTAATTTTAATGCTGTTAAAGGAG CGAGTGCTCTTACTGAAAAT ATG 
ATGATGAGACAGTTTGCAGTTGCTGGACCACAAGATGATCCTGTTAGTGA 
ACATAAATACCCAT CAGTATTT CT CTTAACTCCTG CCTT ATTGGAAACTG 
CTAGTGAGG CAAOTCTAAATGGTAAGGAAATCACAGCAT CTGGTATTAT C 
GGTCACATCAAGGATGGTGATAAAAGCAAGCATGTTGAAGTCAAAATGGT 
GAATGAAAATGGAGACATG CTAGGAACCCCTGTTATT ATT CAAGGTAAAG 
ACTTGACTAATCGAACAAAACCATTAATGAGTGGATO 
GCCGGTAAACAATATGAGTT CCGGG CTAAATTAC CACTTAGT CGTTTTAA 
CACTT GGATTAGGGTTGAAGTGGTAACAGAAG CAGGAGAG AAAG CAAGTA 

GTTGCTAAACGTGATTTGACITCTGATACTGCT 

CAAAGATGACT CT CT AAAACTAAAATTAT ATGkAGATGATTCATTACTTG 
AATCTGTTGATAAAACCGGTCTTTATAGTTTTAGAAATGGTGTAG 
ACTAAAGATATGACAGT AC CACTAGAATT TGGAGATAATATTATTAAGTT 
AT CTG CTGTTGACTTATCAAATTAT CGT CGTAATGAGAC C CTT CATAT CT 
ATAGAAACCGTTTTGATGTTAAAGCAAGCCAAATGACAGCTGACAAAGGA 
GCTAAAGTAACTGTGGATATGTTG ATGAAG CACTTAGTTGTT CCAGAAAT 
GG CAGGAGCTTATACATTAACAAT CGACGAAGCTCCAAACACAAATGAAT 
CAGGAATGTTAACAAA.CGCTAAAGTAT CGATTCATTATGTAAATGGTGGT 
GTTGATAAAGTTGATGTTC CGATTAAAGTAGTTGACTTAGAAGCTATTCG 
TAAAG CTGAAGAAG CACATAAAG CTGACGAAG CACGTAAAGCTG AAGAAG 
CACGTAAAGCTGAAGAAGCACATAAAGCTGAAGAAGTACGTAAAGCTGAA 
GAAG CACATAAAGT CGAAGAAG CA C CGTAAAG CTG AAGAGGG ACATAAAA 
CC CAAGAAG CACCTATAGTTGAAGAAGGCTACAAAGTTAATAACGTT CAT 
CAAACTGATACTACAGTTAAAG CGTCTGATTTACCAAAGACTAAGACAGT 
TTCCGCAGTTCATATGGCTAGAACAGACAATAAACA 
AGACACATGTTG 

SEQ ID NO. 5109 
STRAIN JM9130013 

TGGTGTCCAAATTT ATCAATACTATAT CAAAATGG ACAACAATAAAC 
CIT'ACrrTAAGTCCCAAAGATAAGACTACTGTAGAGAAGTT AGAAGAT CG C 
TGGAAAAAAATTACTTT CAAAGTT CAGGATACTGGCATTGGTTTGAAAGA 
CGTTTATCTTCAATCTGTTAAGTATGTTGGTGGTGGCAATAATAATTTAG 
ACCTTATCACACCTCCAGGATTTAAAAAAGAAGATAAAAAAGTTGAAAAA 
CCAAAATTAGACCGTCCACCAGGAATTGATTTACCAGCACCA^ 
GAGAAGTTTTGATTATT CAAC CCCAC CGGGAACTAAG CCAAG CAAACCCA 
AAGATAGTTTATCAACT CCT CCAGGTTTCC CAGATTTAAA 
GATGAAGCACCAAAGGATAGTAAAAAAGACGCTATTGAAGATAAATCAGG 
AGCAATTAAATATG CTAAGT CTCTT CAACTTAGCTTTGTTG^ 
TTTTAGCTAGCAAAGTAAATGGCAAAATATTACAAGTCX3AATCTGATGGC 
AAATTAGTCATTCCTAGAAATG CTTTGTCAGCTAATCAATTTGATGACAC 
TAGTCITAAAATTTATCGT AATAAT AATCG CAATAAAGAAATTACTATCA 
CAACAGATTATTTTGCAGATACAAAATATGTCAATATCACAGCGGTTGAC 
TATTTGAGCAaTACTACTTTTGAGCAATTA^ 

TTAC CATGC CATTGTATTTT CAAG CTTTG CTG CTATTAAAGACAAGGGTG 
GTAAGATTTATGTTAACGATAAATTGCAAGAAAC^ CG CTT 

AAAGATAAATCTGTTAAGATTGGTATTGAATTAC CAAATGATGT CAGACA 
TATTGATAGTTTAT CTGTTCGTCGTTTGAATG AGGTT AAA^ 
ATAT CTTGAAAAATGATGAACAAG ACATTAATCT CAG CAAAACTTAC CAA 
TTAAAATACAACCCGACAAATCGT CGT CTAGAGTTTACTATTAATA^CAT 
TAACT CAAGTT CAGAAAT CATGACCACTTT CAAAG ATGGAAAGATG CCAG 
AATTGGTTGAACAAAAAGATGTTTCTTTGGATATAAACGATATGGAC^ 
AGTAAGTTTAAAACTATTCGACITGGACGAAAGGATTCT 
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ACAACTTATTG CAAAAACT G GAACAGTTGAATTAG ATAT GTTTTT CAAAC 
AAT CT G^GAC CCAG CH?TCAATTATTAAAAAAAT ATACCTTAT CCAAAAT 
GGTGTT C CAAATG AATTGAAAAAATTTG ACTCTAGTTTTGGTTTAACTG A 
AAGTCAGATAGATGGATACTATATTTATAAAGATGCAATTAACCTTAAAT 
TTAAATTAACCAGTGGTGCAaGTCTTAAAGTTGTTTATAAAGGGCAAGAA 
GATC CAT ATAGT CAT CAG AAAGAAG AT ATG ACT AAAAr AGGTGAACAG CT 
CAGT CATTCAACT CAAG CCAATGAAAATACAG CAAAAGTAACCTTTG CT A 
ATATTGACT GGT CACATTAT AGTAAGGTT ACTGTG AATGGAAAAGAAGTT 
GGTAAAGGTAGTGAGTTACCTTTAACTAAAGGATGGACAACATTTGTATT 
ACATAAAACAGAAAATTCATTAAATGTTAAAAGTTTGATTATGGAGACGG 
GT AGTGTAAGTAAGAAAGTTCAACAACTT C CTTTAAGTCCTAGATTAT CT 
AAAAATAAG CATATG AGGG ATATG CTACTTACTATG CAAAAAGATTCAG C 
GTATTACGAAACAAGTGACAGTCTAGTCCTTCGAATTAATCTCACTGCAG 
ATACTAAACTTA^TTTTAATGCTGTTAAAGGAGCGAGTGCTCTTACTGAA 
AATATGATG ATGAGACAGTTTG CAGTTGCTGG AC CACAAGATGAT C CTGT 
TAGTGAACATAAATAC C CAT CAGTATTTCT CTTAACT CCTG C CTTATTGG 
AAACTGCTAGTGAGG CAACTCTAAATGGTAAGGAAAT CACAG CAT CTGGT 
ATTAT CGGTCACATCAAGGATGGT GATAAAAG CAAG CATGTTGAAGT CAA 
AATGGTG AATGAAAATGGAG ACATG CTAGG AAC C C CTGTTATTATTCAAG 
GTAAAGACTTGACTAAT CG AACAAAAC CATTAATGAGTGGACGTAGAGTA 
CITTATGCCGGTAAA.CAATATGAGTTC CGGG CTAAATTACCACTTAGTCG 
TTTTAACACTTGGATTAGGGTT GAAGTGGT AACAG AAGCAGGAg aGa a ag 
cAaGTATTGTTCGTCG CATGTT CTTTGAC CAATCAGTTC CAG AG CTTAAC 
ACAG CAGTTG CTAAACGTGATTTGACTT CTGATACTG CT CTTAT C CACAT 
CGTTGC C^\AAGATGACT CTCTAAAACTAAAATTATATCAAGATGATTCAT 
TACTTGAAT CTGTTGATAAAAC CGGT CTTT ATAGTTTTAGAAATGGTGT A 
GAAAT CACT AAAGAT ATGACAGTAC CACTAGAATTTGGAGAT AAT ATTAT 
TAAGTTATCTGCTGTTGACTTATCAAATTATCGTCGTAATGAGACCCTTC 
ATAT CTATAGAAACCGTTTTGATGTTAAAG CAAG C CAAATGACAGCTGAC 
AAAGGAGCTAAAGTAACTGT GG ATATGTTGATGAAGCACTTAGTTGTTC C 
AGAAATGGCAGGAGCTTATACATTAACAAT CGACGAAGCTCCAAACACAA 
ATGAATCAGGAATGTTAACAAACG CTAAAGTAT CGATTCATT ATGTAAAT 
GGTGGTGTTGATAAAGTTG ATGTT C CG ATT AAAGTAGTTGACTTAGAAG C 
TATTCGTAAAG CTGAAGAAG CACATAAAG CTGACGAAGCACGTAA^G CTG 
AAGAAGCACGTAAAGCTGAAGAAGCACATAAAGCTGAAGAAGTACGTAAA 
GCTGAAGAAGCACATAAAGTCGAAGAAGCACCGTAAAGCTGAAGAGGGAC 
ATAAAACCCAAGAAGCACCTATAGTTGAAGAAGGCrACAAGGTTAATAAC 
GTTC^TCAAACTGATACTACAGTTAAAG CGTCTGATTTAC CAAAGACTAA 
GACAGTTTC CG CAGTTCAT ATGGCTAGAACAGACAATAAACAGATAACTT 
CACAT CAGACACATGTTG 
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msa235280.2{l95_COHl} ttgaataata aaggtgtcgg tggcgaTGGT GTCCAAATTT ATCAATACTA 

msa235280.2{l95_M732} ttgaataata aaggtgtcgg tggcgaTGGT GTCCAAATTT ATCAATACTA 

msa235280.2{l95_M78l} ttgaataata aaggtgtcgg tggcgaTGGT GTCCAAATTT ATCAATACTA 

msa235280 .2{195JH36B} TGGT GTCCAAATTT ATCAATACTA 

msa235280 .2{l95_JM9130013 } TGGT GTCCAAATTT ATCAATACTA 

msa235280.2{l95_18RS2l} ttgaataata aaggtgtcgg tggcgaTGGT GTCCAAATTT ATCAATACTA 

msa235280.2{l95_2603> ttgaataata aaggtgtcgg tggcgaTGGT GTCCAAATTT ATCAATACTA 

msa235280 . 2 { 195_A909 } ttgaataata aaggtgtcgg tggcgaTGGT GTCCAAATTT ATCAATACTA 

Consensus **** ********** ********** 



msa235280.2fl95_COHl> 
msa23S280.2{l95_M732} 
msa235280 . 2 { 195_M781 } 
msa235280.2{l95_H36B} 
msa235280 .2{l95_JM9130013} 
msa235280 . 2 { 195_18RS21 } 
msa235280.2f 195_2603} 
msa235280.2{195_A90 9} 
Consensus 
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TATCAAAATG 
TATCAAAATG 
TATCAAAATG 
TATCAAAATG 
TATCAAAATG 
TATCAAAATG 
TATCAAAATG 
TATCAAAATG 



GACAACAATA 
GACAACAATA 
GACAACAATA 
GACAACAATA 
GACAACAATA 
GACAACAATA 
GACAACAATA 
GACAACAATA 



********** ********** 



AACCTTACTT 
AACCTTACTT 
AACCTTACTT 
AACCTTACTT 
AACCTTACTT 
AACCTTACTT 
AACCTTACTT 
AACCTTACTT 
********** 



AAGTCCCAAA 
AAGTCCCAAA 
AAGTCCCAAA 
AAGTCCCAAA 
AAGTCCCAAA 
AAGTCCCAAA 
AAGTCCCAAA 
AAGTCCCAAA 
********** 
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GATAAGACTA 
GATAAGACTA 
GATAAGACTA 
GATAAGACTA 
GATAAGACTA 
GATAAGACTA 
GATAAGACTA 
GATAAGACTA 
********** 



msa235280 . 2 ( 195_COHl} 
msa235280.2{195_M732} 
msa235280.2{l95_M78l} 
msa235280.2{l95_H36B} 
msa235280.2{l95_JM9130013} 
msa235280 . 2 { 195_18RS2 1 } 
msa235280.2(l95_2603} 
msa235280.2{195_A909} 
Consensus 



msa235280 . 2 f 195_COHl} 
msa235280.2fl95_M732) 
msa235280.2(195_M78l} 
msa235280.2{l95_H36B} 



101 

CTGTAGAGAA 
CTGTAGAGAA 
CTGTAGAGAA 
CTGTAGAGAA 
CTGTAGAGAA 
CTGTAGAGAA 
CTGTAGAGAA 
CTGTAGAGAA 



GTTAGAAGAT 
GTTAGAAGAT 
GTTAGAAGAT 
GTTAGAAGAT 
GTTAGAAGAT 
GTTAGAAGAT 
GTTAGAAGAT 
GTTAGAAGAT 



CGCTGGAAAA 
CGCTGGAAAA 
CGCTGGAAAA 
CGCTGGAAAA 
CGCTGGAAAA 
CGCTGGAAAA 
CGCTGGAAAA 
CGCTGGAAAA 



AAATTACTTT 
AAATTACTTT 
AAATTACTTT 
AAATTACTTT 
AAATTACTTT 
AAATTACTTT 
AAATTACTTT 
AAATTACTTT 



150 

CAAAGTTCAG 
CAAAGTTCAG 
CAAAGTTCAG 
CAAAGTTCAG 
CAAAGTTCAG 
CAAAGTTCAG 
CAAAGTTCAG 
CAAAGTTCAG 



********** ********** ********** ********** ********** 

151 200 

GATACTGGCA TTGGTTTGAA AGACGTTTAT CTTCAATCTG TTAAGTATGT 

GATACTGGCA TTGGTTTGAA AGACGTTTAT CTTCAATCTG TTAAGTATGT 

GATACTGGCA TTGGTTTGAA AGACGTTTAT CTTCAATCTG TTAAGTATGT 

GATACTGGCA TTGGTTTGAA AGACGTTTAT CTTCAATCTG TTAAGTATGT 



845 



WO 2004/018646 



Table 51: Comparative Sequences relating to SAG0677 



msa235280 .2{l95_JM9130013} 
msa235280.2{l95_18RS2l} 
msa235280 . 2 { 195_2603 } 
msa235280.2(l95_A909} 
Consensus 



GATACTGGCA TTGGTTTGAA AGACGTTTAT CTTCAATCTG TTAAGTATGT 
GATACTGGCA TTGGTTTGAA AGACGTTTAT CTTCAATCTG TTAAGTATGT 
GATACTGGCA TTGGTTTGAA AGACGTTTAT CTTCAATCTG TTAAGTATGT 
GATACTGGCA TTGGTTTGAA AGACGTTTAT CTTCAATCTG TTAAGTATGT 
********** ********** ********** ********** ********** 



msa235280. 
msa235280. 
msa235280. 
msa235280 . 
3a235280 .2(195 
msa235280 .2{' 
msa235280 . 
msa235280 



2{l95_COHl} 
2{l95_M732} 
2{195_M781} 
2{195_H36B} 
__JM9130013} 
195_18RS2l} 
2 {195^2603} 
2{195_A909} 
Consensus 



201 

TGGTGGTGGC 
TGGTGGTGGC 
TGGTGGTGGC 
TGGTGGTGGC 
TGGTGGTGGC 
TGGTGGTGGC 
TGGTGGTGGC 
TGGTGGTGGC 



AATAATAATT 
AATAATAATT 
AATAATAATT 
AATAATAATT 
AATAATAATT 
AATAATAATT 
AATAATAATT 
AATAATAATT 



********** ********** 



TAGACCTTAT 
TAGACCTTAT 
TAGACCTTAT 
TAGACCTTAT 
TAGACCTTAT 
TAGACCTTAT 
TAGACCTTAT 
TAGACCTTAT 
********** 



CACACCTCCA 
CACACCTCCA 
CACACCTCCA 
CACACCTCCA 
CACACCTCCA 
CACACCTCCA 
CACACCTCCA 
CACACCTCCA 
********** 



250 

GGATTTAAAA 
GGATTTAAAA 
GGATTTAAAA 
GGATTTAAAA 
GGATTTAAAA 
GGATTTAAAA 
GGATTTAAAA 
GGATTTAAAA 
********** 



msa235280.2{l95_COHl} 
msa235280 . 2 { 195_M732 } 
msa2 3 52 8 0 . 2 { 19 5__M78 1 } 
msa235280 . 2 { 195_H36B } 
msa235280.2{l95_JM9130013} 
msa235280.2{l95_18RS2l) 
msa235280.2(l95_2603} 
msa235280 . 2 { 195_A909 } 
Consensus 



251 

AAGAAGATAA 
AAGAAGATAA 
AAGAAGATAA 
AAGAAGATAA 
AAGAAGATAA 
AAGAAGATAA 
AAGAAGATAA 
AAGAAGATAA 



AAAAGTTGAA 
AAAAGTTGAA 
AAAAGTTGAA 
AAAAGTTGAA 
AAAAGTTGAA 
AAAAGTTGAA 
AAAAGTTGAA 
AAAAGTTGAA 



AAACCAAAAT 
AAACCAAAAT 
AAACCAAAAT 
AAACCAAAAT 
AAACCAAAAT 
AAACCAAAAT 
AAACCAAAAT 
AAACCAAAAT 



TAGACCGTCC 
TAGACCGTCC 
TAGACCGTCC 
TAGACCGTCC 
TAGACCGTCC 
TAGACCGTCC 
TAGACCGTCC 
TAGACCGTCC 



300 

ACCAGGAATT 
ACCAGGAATT 
ACCAGGAATT 
ACCAGGAATT 
ACCAGGAATT 
ACCAGGAATT 
ACCAGGAATT 
ACCAGGAATT 



301 350 

msa2 35280.2* 19 5J20H1 } GATTTACCAg CACCAACTTC AATGAGAAGT TTTGATTATT CAACCCCACC 

msa235280.2(195_M732} GATTTACCAg CACCAACTTC AATGAGAAGT TTTGATTATT CAACCCCACC 

msa235280 ,2f 195_M78lj GATTTACCAg CACCAACTTC AATGAGAAGT TTTGATTATT CAACCCCACC 

msa235280.2{195_H36B} GATTTACCAg CACCAACTTC AATGAGAAGT TTTGATTATT CAACCCCACC 

msa235280.2{!95_JM9130013} GATTTACCAg CACCAACTTC AATGAGAAGT TTTGATTATT CAACCCCACC 

msa235280.2(l95_18RS2l} GATTTACCAg CACCAACTTC AATGAGAAGT TTTGATTATT CAACCCCACC 

msa235280.2fl95_2603} GATTTACCAg CACCAACTTC AATGAGAAGT TTTGATTATT CAACCCCACC 

msa235280.2(195_A909) GATTTACCAc CACCAACTTC AATGAGAAGT TTTGATTATT CAACCCCACC 

Consensus *********- ********** ********** ********** ********** 

351 400 

msa235280 .2{l95_COHl} GGGAACTAAG CCAAGCAAAC CCAAAGATAG TTTATCAACT CCTCCAGGTT 

ins a2 3 52 80 .2{l95_M732} GGGAACTAAG CCAAGCAAAC CCAAAGATAG TTTATCAACT CCTCCAGGTT 

msa235280 -2{195_M781} GGGAACTAAG CCAAGCAAAC CCAAAGATAG TTTATCAACT CCTCCAGGTT 

msa235280.2{l95_H36B} GGGAACTAAG CCAAGCAAAC CCAAAGATAG TTTATCAACT CCTCCAGGTT 

msa235280.2{l95_jJM9130013} GGGAACTAAG CCAAGCAAAC CCAAAGATAG TTTATCAACT CCTCCAGGTT 

msa235280 .2{l95_18RS2l} GGGAACTAAG CCAAGCAAAC CCAAAGATAG TTTATCAACT CCTCCAGGTT 

msa235280.2{l95_2603} GGGAACTAAG CCAAGCAAAC CCAAAGATAG TTTATCAACT CCTCCAGGTT 

msa235280 .2{195_A909} GGGAACTAAG CCAAGCAAAC CCAAAGATAG TTTATCAACT CCTCCAGGTT 

Consensus ********** ********** ********** ********** ********** 

401 450 

msa235280.2{l95_COHl} TCCCAGATTT AAACACGCCG CCGGATGAAG cCACcAAAGG ATAGTAAAAA 

msa235280.2{l95_M732} TCCCAGATTT AAACACGCCG CCGGATGAAG CCACcAAAGG ATAGTAAAAA 

msa235280.2(l95_M78l} TCCCAGATTT AAACACGCCG CCGGATGAAG cCACcAAAGG ATAGTAAAAA 

msa235280.2{195_H36B} TCCCAGATTT AAACACGCCG CCGGATGAAG -CACtAAAGG ATAGTAAAAA 

msa235280 .2{195_JM9130013 } TCCCAGATTT AAACACGCCG CCGGATGAAG . CACcAAAGG ATAGTAAAAA 

tnsa235280.2{l95_18RS2l} TCCCAGATTT AAACACGCCG CCGGATGAAG .CACcAAAGG ATAGTAAAAA 

msa235280-2{l95_2603} TCCCAGATTT AAACACGCCG CCGGATGAAG .CACcAAAGG ATAGTAAAAA 

msa235280.2{l95_A909} TCCCAGATTT AAACACGCCG CCGGATGAAG . CACtAAAGG ATAGTAAAAA 

Consensus ********** ********** ********** _***_***** ********** 

451 500 

msa235280.2{l95_COHl} AGACGCTATT GAAGATAAAT CAGGAGCAAT TAAATATGCT AAGTCTCTTC 

msa235280.2{l95_M732} AGACGCTATT GAAGATAAAT CAGGAGCAAT TAAATATGCT AAGTCTCTTC 

msa235280.2{l95_M781} AGACGCTATT GAAGATAAAT CAGGAGCAAT TAAATATGCT AAGTCTCTTC 

msa235280.2{l95_H36Bj AGACGCTATT GAAGATAAAT CAGGAGCAAT TAAATATGCT AAGTCTCTTC 

msa235280.2{l95_JM9130013) AGACGCTATT GAAGATAAAT CAGGAGCAAT TAAATATGCT AAGTCTCTTC 

msa235280.2{l95_18RS2l} AGACGCTATT GAAGATAAAT CAGGAGCAAT TAAATATGCT AAGTCTCTTC 

msa2 352 80. 2 { 195J2603} AGACGCTATT GAAGATAAAT CAGGAGCAAT TAAATATGCT AAGTCTCTTC 

msa235280 -2{195_A90S} AGACGCTATT GAAGATAAAT CAGGAGCAAT TAAATATGCT AAGTCTCTTC 

Consensus ********** ********** ********** ********** ********** 



501 550 

mBa235280.2{l95_COHl> AACTTAGCTT TGTTGATGaC CCTATTTTAG CTAGCAAAGT AAATGGCAAA 

ms a2 35280.2(19 5__M7 3 2 } AACTTAGCTT TGTTGATGaC CCTATTTTAG CTAGCAAAGT AAATGGCAAA 

msa235280 .2{l95_M78l} AACTTAGCTT TGTTGATGaC CCTATTTTAG CTAGCAAAGT AAATGGCAAA 

msa235280 -2(195_H36B} AACTTAGCTT TGTTGATGaC CCTATTTTAG CTAGCAAAGT AAATGGCAAA 

msa235280.2(l95_JM9130013j AACTTAGCTT TGTTGATGaC CCTATTTTAG CTAGCAAAGT AAATGGCAAA 

msa235280.2{l95_18RS21) AACTTAGCTT TGTTGATGaC CCTATTTTAG CTAGCAAAGT AAATGGCAAA 

msa23 528 0.2 ( 195^2603} AACTTAGCTT TGTTGATGgC CCTATTTTAG CTAGCAAAGT AAATGGCAAA 

msa235280.2{l95_A909} AACTTAGCTT TGTTGATGaC CCTATTTTAG CTAGCAAAGT AAATGGCAAA 

Consensus ********** ********_* ********** ********** ********** 



846 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



551 600 

msa235280.2{l95_COHl} ATATTACAAG TCGAATCTGA TGGCAAATTA GTCATTCCTA GAAATGCTTT 

msa235280.2(l95_M732} ATATTACAAG TCGAATCTGA TGGCAAATTA GTCATTCCTA GAAATGCTTT 

msa235280.2(l95_M78l} ATATTACAAG TCGAATCTGA TGGCAAATTA GTCATTCCTA GAAATGCTTT 

msa2 35280.2(19 5_H3 6 B } ATATTACAAG TCGAATCTGA TGGCAAATTA GTCATTCCTA GAAATGCTTT 

msa235280.2{l95_JM9130013} ATATTACAAG TCGAATCTGA TGGCAAATTA GTCATTCCTA GAAATGCTTT 

m9a2 35280.2(19 5_1 8RS21 } ATATTACAAG TCGAATCTGA TGGCAAATTA GTCATTCCTA GAAATGCTTT 

msa235280.2(l95_2603} ATATTACAAG TCGAATCTGA TGGCAAATTA GTCATTCCTA GAAATGCTTT 

msa23528 0.2(l95_A909} ATATTACAAG TCGAATCTGA TGGCAAATTA GTCATTCCTA GAAATGCTTT 

Consensus ********** ********** ********** ********** ********** 

601 650 

msa2352 80.2{l95_COHl} GTCAGCTAAT CAATTTGATG ACACTAGTCT TAAAATTTAT CGTAATAATA 

msa2 3 52 8 0 . 2 { 19 5_M73 2 } GTCAGCTAAT CAATTTGATG ACACTAGTCT TAAAATTTAT CGTAATAATA 

msa235280.2(l95_M78l} GTCAGCTAAT CAATTTGATG ACACTAGTCT TAAAATTTAT CGTAATAATA 

msa235280.2{l95_H36B} GTCAGCTAAT CAATTTGATG ACACTAGTCT TAAAATTTAT CGTAATAATA 

msa235280.2{l95_JM9130013} GTCAGCTAAT CAATTTGATG ACACTAGTCT TAAAATTTAT CGTAATAATA 

msa235280 . 2 { 195_18RS21 } GTCAGCTAAT CAATTTGATG ACACTAGTCT TAAAATTTAT CGTAATAATA 

msa235280.2(l95_2603) GTCAGCTAAT CAATTTGATG ACACTAGTCT TAAAATTTAT CGTAATAATA 

msa235280.2{l95_A909} GTCAGCTAAT CAATTTGATG ACACTAGTCT TAAAATTTAT CGTAATAATA 

Consensus ********** ********** ********** ********** ********** 

651 700 

msa235280.2(l95_COHl} ATCGCAATAA AGAAATTACT ATCACAACAG ATTATTTTGC AGATACAAAA 

msa235280.2{l95_M732} ATCGCAATAA AGAAATTACT ATCACAACAG ATTATTTTGC AGATACAAAA 

msa235280.2{l95_M781} ATCGCAATAA AGAAATTACT ATCACAACAG ATTATTTTGC AGATACAAAA 

msa23 5280 .2(19 5_H36B} ATCGCAATAA AGAAATTACT ATCACAACAG ATTATTTTGC AGATACAAAA 

msa235280.2{l95_JM9130013} ATCGCAATAA AGAAATTACT ATCACAACAG ATTATTTTGC AGATACAAAA 

ms a2 35 2 8 0 . 2 { 19 5_1 8 RS2 1 } ATCGCAATAA AGAAATTACT ATCACAACAG ATTATTTTGC AGATACAAAA 

msa23 52 8 0.2 {l95_2603} ATCGCAATAA AGAAATTACT ATCACAACAG ATTATTTTGC AGATACAAAA 

msa235280.2(l95_A909} ATCGCAATAA AGAAATTACT ATCACAACAG ATTATTTTGC AGATACAAAA 

Consensus ********** ********** ********** ********** ********** 



msa235280.2{l95^COHl} 
msa235280.2{l95_M732} 
msa235280.2(195_M78l} 
msa235280 . 2{l95_H36B} 
msa235280 . 2( 1'95_JM9130013 j 
msa235280.2(l95_18RS21) 
msa23 5280 . 2(19 5_2603 } 
msa235280.2(l95_A909} 
Consensus 



701 

TATGTCAATA 
TATGTCAATA 
TATGTCAATA 
TATGTCAATA 
TATGTCAATA 
TATGTCAATA 
TATGTCAATA 
TATGTCAATA 



TCACAGCGGT 
TCACAGCGGT 
TCACAGCGGT 
TCACAGCGGT 
TCACAGCGGT 
TCACAGCGGT 
TCACAGCGGT 
TCACAGCGGT 



TGACTATTTG 
TGACTATTTG 
TGACTATTTG 
TGACTATTTG 
TGACTATTTG 
TGACTATTTG 
TGACTATTTG 
TGACTATTTG 



AGCAATACTA 
AGCAATACTA 
AGCAATACTA 
AGCAATACTA 
AGCAATACTA 
AGCAATACTA 
AGCAATACTA 
AGCAATACTA 



750 

CTTTTGAGCA 
CTTTTGAGCA 
CTTTTGAGCA 
CTTTTGAGCA 
CTTTTGAGCA 
CTTTTGAGCA 
CTTTTGAGCA 
CTTTTGAGCA 



********** ********** ********** ********** ********** 



751 

msa23528 0.2(l95_COHl} ATTAGCTACT GGTGAAACAG 

tnsa23528 0.2(l95_M732} ATTAGCTACT GGTGAAACAG 

msa2352 80.2(195_M78l} ATTAGCTACT GGTGAAACAG 

msa2352 80.2(l95JH36B} ATTAGCTACT GGTGAAACAG 

msa235280.2{l95_JM9130013} ATTAGCTACT GGTGAAACAG 

msa235280 .2(195_18RS21) ATTAGCTACT GGTGAAACAG 

msa235280.2{l95_2603} ATTAGCTACT GGTGAAACAG 

msa235280.2(l95_A909} ATTAGCTACT GGTGAAACAG 

Consensus ********** ********** 



TAGATTACCA 
TAGATTACCA 
TAGATTACCA 
TAGATTACCA 
TAGATTACCA 
TAGATTACCA 
TAGATTACCA 
TAGATTACCA 



TGCCATTGTA 
TGCCATTGTA 
TGCCATTGTA 
TGCCATTGTA 
TGCCATTGTA 
TGCCATTGTA 
TGCCATTGTA 
TGCCATTGTA 



800 

TTTTCAAGCT 
TTTTCAAGCT 
TTTTCAAGCT 
TTTTCAAGCT 
TTTTCAAGCT 
TTTTCAAGCT 
TTTTCAAGCT 
TTTTCAAGCT 



********** ********** ********** 



801 850 

msa235280.2{l95_COHl} TTGCTGCTAT TAAAGACAAG GGTGGTAAGA TTTATGTtAA CGATAAATTG 

msa235280.2f 195_M732} TTGCTGCTAT TAAAGACAAG GGTGGTAAGA TTTATGTtAA CGATAAATTG 

msa235280.2(l95_M78l) TTGCTGCTAT TAAAGACAAG GGTGGTAAGA TTTATGTtAA CGATAAATTG 

msa235280.2(l95_H36B) TTGCTGCTAT TAAAGACAAG GGTGGTAAGA TTTATGTcAA CGATAAATTG 

msa235280 . 2 { 195_JM913 0013 } TTGCTGCTAT TAAAGACAAG GGTGGTAAGA TTTATGTtAA CGATAAATTG 

msa235280 .2 {l95_18RS2lj TTGCTGCTAT TAAAGACAAG GGTGGTAAGA TTTATGTtAA CGATAAATTG 

msa235280.2{l95_2603) TTGCTGCTAT TAAAGACAAG GGTGGTAAGA TTTATGTtAA CGATAAATTG 

msa235280.2(195_A909} TTGCTGCTAT TAAAGACAAG GGTGGTAAGA TTTATGTtAA CGATAAATTG 

Consensus ********** ********** ********** *******_** ********** 



851 

msa235280.2f!95_COHl} CAAGAAACTT CTCGTATAGC 

msa2352 80.2(l95_M732} CAAGAAACTT CTCGTATAGC 

msa235280.2{l95_M78l} CAAGAAACTT CTCGTATAGC 

msa235280.2(l95_H36B} CAAGAAACTT CTCGTATAGC 

msa235280.2{l95_JM9130013} CAAGAAACTT CTCGTATAGC 

msa235280.2{l95__18RS2l} CAAGAAACTT CTCGTATAGC 

msa235280.2f 195_2603) CAAGAAACTT CTCGTATAGC 

msa235280.2{195_A909} CAAGAAACTT CTCGTATAGC 

Consensus ********** ********** 



GCTTAAAGAT 
GCTTAAAGAT 
GCTTAAAGAT 
GCTTAAAGAT 
GCTTAAAGAT 
GCTTAAAGAT 
GCTTAAAGAT 
GCTTAAAGAT 



AAATCTGTTA 
AAATCTGTTA 
AAATCTGTTA 
AAATCTGTTA 
AAATCTGTTA 
AAATCTGTTA 
AAATCTGTTA 
AAATCTGTTA 



********** ********** 



900 

AGATTGGTAT 
AGATTGGTAT 
AGATTGGTAT 
AGATTGGTAT 
AGATTGGTAT 
AGATTGGTAT 
AGATTGGTAT 
AGATTGGTAT 
********** 



msa235280.2(l95_COHl} 
msa235280.2(l95_M732} 
msa235280.2(l95_M78l} 
msa235280.2(l95_H36B) 
msa235280.2(l95__JM9130013) 



901 

TGAATTACCA AATGATGTCA 
TGAATTACCA AATGATGTCA 
TGAATTACCA AATGATGTCA 
TGAATTACCA AATGATGTCA 
TGAATTACCA AATGATGTCA 



950 

GACATATTGA TAGTTTATCT GTTCGTCGTT 
GACATATTGA TAGTTTATCT GTTCGTCGTT 
GACATATTGA TAGTTTATCT GTTCGTCGTT 
GACATATTGA TAGTTTATCT GTTCGTCGTT 
GACATATTGA TAGTTTATCT GTTCGTCGTT 



847 



WO 2004/018646 



PCT/US2003/026827 
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msa235280 .2 { 195_18RS2l} 
msa235280 .2{ 195^2603} 
msa235280 . 2 ( 195_A909 } 
Consensus 



TGAATTACCA AATGATGTCA GACATATTGA TAGTTTATCT GTTCGTCGTT 
TGAATTACCA AATGATGTCA GACATATTGA TAGTTTATCT GTTCGTCGTT 
TGAATTACCA AATGATGTCA GACATATTGA TAGTTTATCT GTTCGTCGTT 
********** ********** ********** ********** ********** 



msa235280 . 2{l95_COHl} 
msa23S280 . 2 { 195_M732 } 
msa235280 .2{195_M781} 
msa235280 . 2{ 195_H36B} 
msa235280 . 2{ 195_JM9130013 } 
msa235280.2(l95_18RS2l} 
msa235280.2{l95_2603} 
msa235280 . 2 { 195_A909 } 
Consensus 



951 

TGAATGAGGT 
TGAATGAGGT 
TGAATGAGGT 
TGAATGAGGT 
TGAATGAGGT 
TGAATGAGGT 
TGAATGAGGT 
TGAATGAGGT 
********** 



TAAAACTGTT 
TAAAACTGTT 
TAAAACTGTT 
TAAAACTGTT 
TAAAACTGTT 
TAAAACTGTT 
TAAAACTGTT 
TAAAACTGTT 
********** 



GATAATATCT 
GATAATATCT 
GATAATATCT 
GATAATATCT 
GATAATATCT 
GATAATATCT 
GATAATATCT 
GATAATATCT 
********** 



TGAAAAATGA 
TGAAAAATGA 
TGAAAAATGA 
TGAAAAATGA 
TGAAAAATGA 
TGAAAAATGA 
TGAAAAATGA 
TGAAAAATGA 
********** 



1000 
TGAACAAGAC 
TGAACAAGAC 
TGAACAAGAC 
TGAACAAGAC 
TGAACAAGAC 
TGAACAAGAC 
TGAACAAGAC 
TGAACAAGAC 
********** 



msa2 35280.2(19 5_C0H1 } 
msa235280 . 2 { 195_M732 } 
msa235280.2f 195JM781) 
msa235280.2{195__H36B} 
msa235280 . 2 { 195_JM9130013 } 
msa235280 . 2 { 195_18RS21 } 
msa235280 . 2{ 195_2603 } 
msa235280 . 2{ 195_A909 } 
Consensus 



1001 

ATTAATCTCA 
ATTAATCTCA 
ATTAATCTCA 
ATTAATCTCA 
ATTAATCTCA 
ATTAATCTCA 
ATTAATCTCA 
ATTAATCTCA 



GCAAAACTTA 
GCAAAACTTA 
GCAAAACTTA 
GCAAAACTTA 
GCAAAACTTA 
GCAAAACTTA 
GCAAAACTTA 
GCAAAACTTA 



CCAATTAAAA 
CCAATTAAAA 
CCAATTAAAA 
CCAATTAAAA 
CCAATTAAAA 
CCAATTAAAA 
CCAATTAAAA 
CCAATTAAAA 



********** ********** ********** 



TACAACCCGA 
TACAACCCGA 
TACAACCCGA 
TACAACCCGA 
TACAACCCGA 
TACAACCCGA 
TACAACCCGA 
TACAACCCGA 
********** 



1050 
CAAATCGTCG 
CAAATCGTCG 
CAAATCGTCG 
CAAATCGTCG 
CAAATCGTCG 
CAAATCGTCG 
CAAATCGTCG 
CAAATCGTCG 
********** 



1051 1100 

msa235280.2{l95__COHl} TCTAGAGTTT ACTATTAATA ACATTAACTC AAGTTCAGAA ATCATGACCA 

msa235280.2{l95_M732} TCTAGAGTTT ACTATTAATA ACATTAACTC AAGTTCAGAA ATCATGACCA 

msa235280.2{ 195J4781} TCTAGAGTTT ACTATTAATA ACATTAACTC AAGTTCAGAA ATCATGACCA 

msa235280 .2{195„H36B} TCTAGAGTTT ACTATTAATA ACATTAACTC AAGTTCAGAA ATCATGACCA 

msa2 35280.2(19 5_JM91 30013} TCTAGAGTTT ACTATTAATA ACATTAACTC AAGTTCAGAA ATCATGACCA 

msa2 35280. 2(19 5_1 8RS2 1 } TCTAGAGTTT ACTATTAATA ACATTAACTC AAGTTCAGAA ATCATGACCA 

msa235280.2(l95_2603} TCTAGAGTTT ACTATTAATA ACATTAACTC AAGTTCAGAA ATCATGACCA 

msa235280.2(l95_A909} TCTAGAGTTT ACTATTAATA ACATTAACTC AAGTTCAGAA ATCATGACCA 

Consensus ********** ********** ********** ********** ********** 

1101 1150 

msa2 3 52 8 0 . 2 ( 195_C0H1 } CTTTCAAAGA TGGAAAGATG CCAGAATTGG TTGAACAAAA AGATGTTTCT 

msa235280.2(l95_M732} CTTTCAAAGA TGGAAAGATG CCAGAATTGG TTGAACAAAA AGATGTTTCT 

msa235280.2(l95_M78l} CTTTCAAAGA TGGAAAGATG CCAGAATTGG TTGAACAAAA AGATGTTTCT 

msa23 5280.2(l95_H36Bj CTTTCAAAGA TGGAAAGATG CCAGAATTGG TTGAACAAAA AGATGTTTCT 

msa235280 .2(195_JM9 130013} CTTTCAAAGA TGGAAAGATG CCAGAATTGG TTGAACAAAA AGATGTTTCT 

msa2352B0.2{l95_18RS2l} CTTTCAAAGA TGGAAAGATG CCAGAATTGG TTGAACAAAA AGATGTTTCT 

msa235280.2(l95_2603} CTTTCAAAGA TGGAAAGATG CCAGAATTGG TTGAACAAAA AGATGTTTCT 

msa235280.2(195_A909} CTTTCAAAGA TGGAAAGATG CCAGAATTGG TTGAACAAAA AGATGTTTCT 

Consensus ********** ********** ********** ********** ********** 



msa2 3 52 8 0 . 2 ( 195_C0H1 J 
msa235280.2(l95_M732} 
msa235280.2(l95_M78l} 
msa235280 . 2 { 195_H36B} 
msa235280 -2{l95_JM9130013) 
msa235280 . 2 ( 195_18RS2l} 
msa235280 .2{195_2603} 
msa235280.2(l95_A909} 
Consensus 



1151 

TTGGATATAA 
TTGGATATAA 
TTGGATATAA 
TTGGATATAA 
TTGGATATAA 
TTGGATATAA 
TTGGATATAA 
TTGGATATAA 



ACGATATGGA 
ACGATATGGA 
ACGATATGGA 
ACGATATGGA 
ACGATATGGA 
ACGATATGGA 
ACGATATGGA 
ACGATATGGA 



********** ********** 



CATGAGTAAG 
CATGAGTAAG 
CATGAGTAAG 
CATGAGTAAG 
CATGAGTAAG 
CATGAGTAAG 
CATGAGTAAG 
CATGAGTAAG 
********** 



TTTAAAACTA 
TTTAAAACTA 
TTTAAAACTA 
TTTAAAACTA 
TTTAAAACTA 
TTTAAAACTA 
TTTAAAACTA 
TTTAAAACTA 
********** 



1200 
TTCGACTTGG 
TTCGACTTGG 
TTCGACTTGG 
TTCGACTTGG 
TTCGACTTGG 
TTCGACTTGG 
TTCGACTTGG 
TTCGACTTGG 
********** 



1201 1250 

msa235280.2(l95_COHl} ACGAAAGGAT TCTGAATTTA AGGGACAACT TATTGCAAAA ACTGGAACAG 

msa235280.2f 195_M732} ACGAAAGGAT TCTGAATTTA AGGGACAACT TATTGCAAAA ACTGGAACAG 

msa2 35280. 2(19 5_M7 81} ACGAAAGGAT TCTGAATTTA AGGGACAACT TATTGCAAAA ACTGGAACAG 

msa235280.2(l95_H36B} ACGAAAGGAT TCTGAATTTA AGGGACAACT TATTGCAAAA ACTGGAACAG 

msa2 35280. 2(19 5_JM9 130013} ACGAAAGGAT TCTGAATTTA AGGGACAACT TATTGCAAAA ACTGGAACAG 

msa2 35280. 2(19 5_1 8 RS2 1 } ACGAAAGGAT TCTGAATTTA AGGGACAACT TATTGCAAAA ACTGGAACAG 

msa235280.2(l95_2603} ACGAAAGGAT TCTGAATTTA AGGGACAACT TATTGCAAAA ACTGGAACAG 

msa235280 .2{195_A909} ACGAAAGGAT TCTGAATTTA AGGGACAACT TATTGCAAAA ACTGGAACAG 

Consensus ********** ********** ********** ********** ********** 

1251 1300 

msa235280 .2(l95_COHl} TTGAATTAGA TATGTTTTTC AAACAATCTC AAGACCCAGC TTCAATTATT 

msa235280 . 2? 195_M732) TTGAATTAGA TATGTTTTTC AAACAATCTC AAGACCCAGC TTCAATTATT 

msa235280 .2{195_M781} TTGAATTAGA TATGTTTTTC AAACAATCTC AAGACCCAGC TTCAATTATT 

msa2 35280.2(19 5_H36B } TTGAATTAGA TATGTTTTTC AAACAATCTC AAGACCCAGC TTCAATTATT 

msa235280.2{l95_JM9130013} TTGAATTAGA TATGTTTTTC AAACAATCTC AAGACCCAGC TTCAATTATT 

msa235280 .2{l95_18RS2l} TTGAATTAGA TATGTTTTTC AAACAATCTC AAGACCCAGC TTCAATTATT 

msa235280 .2{l95_2603} TTGAATTAGA TATGTTTTTC AAACAATCTC AAGACCCAGC TTCAATTATT 

msa235280.2(l95_A909} TTGAATTAGA TATGTTTTTC AAACAATCTC AAGACCCAGC TTCAATTATT 

Consensus ********** ********** ********** ********** ********** 



1301 



1350 



848 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



msa235280.2fl95_COHl) AAAAAAATAT ACCTTATCCA AAATGGTGTT CCAAATGAAT TGAAAAAATT 

msa2 35280.2(19 5_M73 2 } AAAAAAATAT ACCTTATCCA -AAATGGTGTT CCAAATGAAT TGAAAAAATT 

tnsa2 35280.2(19 5_M7 8 1 } AAAAAAATAT ACCTTATCCA AAATGGTGTT CCAAATGAAT TGAAAAAATT 

msa23528 0.2(l95_H36B} AAAAAAATAT ACCTTATCCA AAATGGTGTT CCAAATGAAT TGAAAAAATT 

msa235280.2{l9 5_JM9 130013} AAAAAAATAT ACCTTATCCA AAATGGTGTT CCAAATGAAT TGAAAAAATT 

msa235280.2(l95_18RS2l} AAAAAAATAT ACCTTATCCA AAATGGTGTT CCAAATGAAT TGAAAAAATT 

msa235280.2(l95_2603} AAAAAAATAT ACCTTATCCA AAATGGTGTT CCAAATGAAT TGAAAAAATT 

msa235280.2(l95__A909} AAAAAAATAT ACCTTATCCA AAATGGTGTT CCAAATGAAT TGAAAAAATT 

Consensus ********** ********** ********** ********** ********** 



, 1351 1400 

msa23528 0.2(l95_COHl} TGACTCTAGT TTTGGTTTAA CTGAAAGTCA GATAGATGGA TACTATATTT 

msa235280 .2(195_M732} TGACTCTAGT TTTGGTTTAA CTGAAAGTCA GATAGATGGA TACTATATTT 

msa235280 .2{195_M78l} TGACTCTAGT TTTGGTTTAA CTGAAAGTCA GATAGATGGA TACTATATTT 

msa2352 80 .2(195_H36B} TGACTCTAGT TTTGGTTTAA CTGAAAGTCA GATAGATGGA TACTATATTT 

msa235280 . 2{l95_JM9130013 } TGACTCTAGT TTTGGTTTAA CTGAAAGTCA GATAGATGGA TACTATATTT 

msa235280 . 2 { 195_18RS21 } TGACTCTAGT TTTGGTTTAA CTGAAAGTCA GATAGATGGA TACTATATTT 

msa235280.2f 195_2603} TGACTCTAGT TTTGGTTTAA CTGAAAGTCA GATAGATGGA TACTATATTT 

msa235280.2(195_A909} TGACTCTAGT TTTGGTTTAA CTGAAAGTCA GATAGATGGA TACTATATTT 

Consensus ********** ********** ********** ********** ********** 



msa235280.2(l95_COHl} ATAAAGATGC AATTAACCTT AAATTTAAAT TAACCAGTGG TGCAAGTCTT 

msa235280.2(l95_M732> ATAAAGATGC AATTAACCTT AAATTTAAAT TAACCAGTGG TGCAAGTCTT 

msa235280.2{l95_M781) ATAAAGATGC AATTAACCTT AAATTTAAAT TAACCAGTGG TGCAAGTCTT 

msa2 35280.2(19 5_H3 6B } ATAAAGATGC AATTAACCTT AAATTTAAAT TAACCAGTGG TGCAAGTCTT 

msa235280 . 2{195_JM9130013 } ATAAAGATGC AATTAACCTT AAATTTAAAT TAACCAGTGG TGCAAGTCTT 

msa235280.2{l95_18RS2lj ATAAAGATGC AATTAACCTT AAATTTAAAT TAACCAGTGG TGCAAGTCTT 

msa235280.2(l95_2603} ATAAAGATGC AATTAACCTT AAATTTAAAT TAACCAGTGG TGCAAGTCTT 

msa235280.2(l95_A909} ATAAAGATGC AATTAACCTT AAATTTAAAT TAACCAGTGG TGCAAGTCTT 

Consensus ********** ********** ********** ********** ********** 



msa235280.2(l95_COHl} AAAGTTGTTT ATAAAGGGCA AGAAGATCCA TATAGTCATC AGAAAGAAGA 

msa235280.2(l95_M732} AAAGTTGTTT ATAAAGGGCA AGAAGATCCA TATAGTCATC AGAAAGAAGA 

msa235280.2(l95_M78lJ AAAGTTGTTT ATAAAGGGCA AGAAGATCCA TATAGTCATC AGAAAGAAGA 

msa235280.2(l95_H36B} AAAGTTGTTT ATAAAGGGCA AGAAGATCCA TATAGTCATC AGAAAGAAGA 

msa235280.2(l95_JM9130013} AAAGTTGTTT ATAAAGGGCA AGAAGATCCA TATAGTCATC AGAAAGAAGA 

msa235280 .2{l95_18RS2lJ AAAGTTGTTT ATAAAGGGCA AGAAGATCCA TATAGTCATC AGAAAGAAGA 

msa235280.2(195_2803} AAAGTTGTTT ATAAAGGGCA AGAAGATCCA TATAGTCATC AGAAAGAAGA 

msa235280.2{l95_A909} AAAGTTGTTT ATAAAGGGCA AGAAGATCCA TATAGTCATC AGAAAGAAGA 

Consensus ********** ********** ********** ********** ********** 



msa23 52 8 0 . 2 { 195_COHl } TATGACTAAA AaAGGTGAAC AGCTCAGTCA TTCAACTCAA GCCAATGAAA 

msa235280.2(l95_M732} TATGACTAAA AaAGGTGAAC AGCTCAGTCA TTCAACTCAA GCCAATGAAA 

msa235280.2(l95_M78l) TATGACTAAA AaAGGTGAAC AGCTCAGTCA TTCAACTCAA GCCAATGAAA 

msa235280.2{l95_H36B} TATGACTAAA AaAGGTGAAC AGCTCAGTCA TTCAACTCAA GCCAATGAAA 

msa235280.2{l95_JM9130013} TATGACTAAA AaAGGTGAAC AGCTCAGTCA TTCAACTCAA GCCAATGAAA 

msa23528 0.2(l95_JL8RS2l} TATGACTAAA AaAGGTGAAC AGCTCAGTCA TTCAACTCAA GCCAATGAAA 

msa235280.2{l95_2603} TATGACTAAA AaAGGTGAAC AGCTCAGTCA TTCAACTCAA GCCAATGAAA 

msa235280.2{l95_A909} TATGACTAAA AaAGGTGAAC AGCTCAGTCA TTCAACTCAA GCCAATGAAA 

Consensus ********** *_******** ********** ********** ********** 



msa23528 0.2(l95_COHl} ATACAGCAAA AGTAACCTTT GCTAATATTG ACTGGTCACA TTATAGTAAG 

msa235280.2{l95_M732} ATACAGCAAA AGTAACCTTT GCTAATATTG ACTGGTCACA TTATAGTAAG 

msa2352 80.2(l95_M78l} ATACAGCAAA AGTAACCTTT GCTAATATTG ACTGGTCACA TTATAGTAAG 

msa235280.2(l95_H36B} ATACAGCAAA AGTAACCTTT GCTAATATTG ACTGGTCACA TTATAGTAAG 

msa23 5280 .2{l95_JM9130 013} ATACAGCAAA AGTAACCTTT GCTAATATTG ACTGGTCACA TTATAGTAAG 

msa2 3 52 8 0 . 2 ( 195_18RS2 1 } ATACAGCAAA AGTAACCTTT GCTAATATTG ACTGGTCACA TTATAGTAAG 

msa23528 0.2{l95_2603} ATACAGCAAA AGTAACCTTT GCTAATATTG ACTGGTCACA TTATAGTAAG 

ms a2 3 52 8 0 . 2 ( 19 5_A9 0 9 } ATACAGCAAA AGTAACCTTT GCTAATATTG ACTGGTCACA TTATAGTAAG 

Consensus ********** ********** ********** ********** ********** 



msa235280 . 2{l95_COHl} GTTACTGTGA ATGGAAAAGA AGTTGgTAAA GGTAGTGAGT TACCTTTAAC 

msa235280.2(l95_M732} GTTACTGTGA ATGGAAAAGA AGTTGgTAAA GGTAGTGAGT TACCTTTAAC 

msa23528 0.2(195_M78l} GTTACTGTGA ATGGAAAAGA AGTTGgTAAA GGTAGTGAGT TACCTTTAAC 

msa2 35280. 2 { 195_H3 6B ) GTTACTGTGA ATGGAAAAGA AGTTGgTAAA GGTAGTGAGT TACCTTTAAC 

msa23 5280.2{l95_JM9130013} GTTACTGTGA ATGGAAAAGA AGTTGgTAAA GGTAGTGAGT TACCTTTAAC 

ms a2 3 5 2,8 0 . 2 { 1 9 5_1 8RS2 1 } GTTACTGTGA ATGGAAAAGA AGTTGtTAAA GGTAGTGAGT TACCTTTAAC 

msa2 35280. 2(19 5_2 603} GTTACTGTGA ATGGAAAAGA AGTTGtTAAA GGTAGTGAGT TACCTTTAAC 

msa235280.2(l95_A909} GTTACTGTGA ATGGAAAAGA AGTTGgTAAA GGTAGTGAGT TACCTTTAAC 

Consensus ********** ********** *****_**** ********** ********** 



. . 1651 1700 

tnsa235280.2f 195_COHl| TAAAGGATGG ACAACATTTG TATTACATAA AACAGAAAAT TCATTAAATG 

msa235280 . 2{195_M732 } TAAAGGATGG ACAACATTTG TATTACATAA AACAGAAAAT TCATTAAATG 

msa235280.2(l95_M78l} TAAAGGATGG ACAACATTTG TATTACATAA AACAGAAAAT TCATTAAATG 

msa235280.2{l95_H36B) TAAAGGATGG ACAACATTTG TATTACATAA AACAGAAAAT TCATTAAATG 

msa235280 -2{l95_JM9130013} TAAAGGATGG ACAACATTTG TATTACATAA AACAGAAAAT TCATTAAATG 

msa235280.2(l95_18RS2l} TAAAGGATGG ACAACATTTG TATTACATAA AACAGAAAAT TCATTAAATG 



849 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



msa23528 0.2(l95_2603l 
msa235280 .2{195_A909} 
Consensus 



TAAAGGATGG ACAACATTTG TATTACATAA AACAGAAAAT TCATTAAATG 
TAAAGGATGG ACAACATTTG TATTACATAA AACAGAAAAT TCATTAAATG 
********** ********** ********** ********** ********** 



1701 1750 

msa235280 .2{l95_COHl} TTAAAAGTTT GATTATGGAG ACGGGTAGTG TAAGTAAGAA AGTTCAACAA 

msa235280 . 2 {195_M732 } TTAAAAGTTT GATTATGGAG ACGGGTAGTG TAAGTAAGAA AGTTCAACAA 

msa235280.2{l95_M78l} TTAAAAGTTT GATTATGGAG ACGGGTAGTG TAAGTAAGAA AGTTCAACAA 

msa235280 .2{195_H36B> TTAAAAGTTT GATTATGGAG ACGGGTAGTG TAAGTAAGAA AGTTCAACAA 

msa235280.2{l95_JM913 0013} TTAAAAGTTT GATTATGGAG ACGGGTAGTG TAAGTAAGAA AGTTCAACAA 

msa2352 80.2(l95_18RS2l} TTAAAAGTTT GATTATGGAG ACGGGTAGTG TAAGTAAGAA AGTTCAACAA 

msa23 5280 ,2{195_2603} TTAAAAGTTT GATTATGGAG ACGGGTAGTG TAAGTAAGAA AGTTCAACAA 

msa23 5280 .2(19 5_A909} TTAAAAGTTT GATTATGGAG ACGGGTAGTG TAAGTAAGAA AGTTCAACAA 

Consensus ********** ********** ********** ********** ********** 

1751 1800 

msa235280.2{l95_COHl} CTTCCTTTAA GTCCTAGATT ATCTAAAAAT AAGCATATGA GGGATATGCT 

msa235280.2(l95_M732} CTTCCTTTAA GTCCTAGATT ATCTAAAAAT AAGCATATGA GGGATATGCT 

msa23 5280.2(l95_M78l} CTTCCTTTAA GTCCTAGATT ATCTAAAAAT AAGCATATGA GGGATATGCT 

msa235280.2{l95_H36B} CTTCCTTTAA GTCCTAGATT ATCTAAAAAT AAGCATATGA GGGATATGCT 

tnsa235280 .2{195_JM9130013} CTTCCTTTAA GTCCTAGATT ATCTAAAAAT AAGCATATGA GGGATATGCT 

msa235280.2(l95_18RS2l} CTTCCTTTAA GTCCTAGATT ATCTAAAAAT AAGCATATGA GGGATATGCT 

msa235280 .2{195_2603} CTTCCTTTAA GTCCTAGATT ATCTAAAAAT AAGCATATGA GGGATATGCT 

msa235280.2{l95_A909} CTTCCTTTAA GTCCTAGATT ATCTAAAAAT AAGCATATGA GGGATATGCT 

Consensus ********** ********** ********** ********** ********** 



1801 

msa235280.2(l95_COHl} ACTTACTATG 

msa235280 .2{195_M732} ACTTACTATG 

msa235280 .2(l95_M7Bl} ACTTACTATG 

msa235280 . 2 { 195__H36B } ACTTACTATG 

msa235280 .2{195_JM9130013} ACTTACTATG 

msa235280.2{l95_18RS2l} ACTTACTATG 

msa235280.2{l95_2603} ACTTACTATG 

msa2 35280 .2{l95_A909} ACTTACTATG 

Consensus ********** 



CAAAAAGATT 
CAAAAAGATT 
CAAAAAGATT 
CAAAAAGATT 
CAAAAAGATT 
CAAAAAGATT 
CAAAAAGATT 
CAAAAAGATT 



CAGCGTATTA 
CAGCGTATTA 
CAGCGTATTA 
CAGCGTATTA 
CAGCGTATTA 
CAGCGTATTA 
CAGCGTATTA 
CAGCGTATTA 



CGAAACAAGT 
CGAAACAAGT 
CGAAACAAGT 
CGAAACAAGT 
CGAAACAAGT 
CGAAACAAGT 
CGAAACAAGT 
CGAAACAAGT 



********** ********** ********** 



1850 
GACAGTCTAG 
GACAGTCTAG 
GACAGTCTAG 
GACAGTCTAG 
GACAGTCTAG 
GACAGTCTAG 
GACAGTCTAG 
GACAGTCTAG 
********** 



1851 1900 

msa235280.2f 195_COHl} TCCTTCGAAT TAATCTCACT GCAGATACTA AACTTAATTT TAATGCTGTT 

msa235280.2(195_M732} TCCTTCGAAT TAATCTCACT GCAGATACTA AACTTAATTT TAATGCTGTT 

ms a2 35280.2(19 5_M7 8 1 } TCCTTCGAAT TAATCTCACT GCAGATACTA AACTTAATTT TAATGCTGTT 

msa235280 . 2{195_H36B) TCCTTCGAAT TAATCTCACT GCAGATACTA AACTTAATTT TAATGCTGTT 

msa23 52 8 0 . 2 { 19 5_JM9 130013} TCCTTCGAAT TAATCTCACT GCAGATACTA AACTTAATTT TAATGCTGTT 

msa235280.2{l95_18RS2l} TCCTTCGAAT TAATCTCACT GCAGATACTA AACTTAATTT TAATGCTGTT 

msa235280.2{l95_2603} TCCTTCGAAT TAATCTCACT GCAGATACTA AACTTAATTT TAATGCTGTT 

msa235280 . 2{l95_A909} TCCTTCGAAT TAATCTCACT GCAGATACTA AACTTAATTT TAATGCTGTT 

Consensus ********** ********** ********** ********** ********** 

1901 1950 

msa235280 . 2 { 195_C0H1 } AAAGGAGCGA GTGCTCTTAC TGAAAATATG ATGATGAGAC AGTTTGCAGT 

maa2 35280 . 2 { 195_M732 } AAAGGAGCGA GTGCTCTTAC TGAAAATATG ATGATGAGAC AGTTTGCAGT 

msa235280 .2{l95_M78l] AAAGGAGCGA GTGCTCTTAC TGAAAATATG ATGATGAGAC AGTTTGCAGT 

msa235280.2{l95_H36B} AAAGGAGCGA GTGCTCTTAC TGAAAATATG ATGATGAGAC AGTTTGCAGT 

msa235280.2{l95_tTM9130013} AAAGGAGCGA GTGCTCTTAC TGAAAATATG ATGATGAGAC AGTTTGCAGT 

msa235280.2{l95_18RS2l} AAAGGAGCGA GTGCTCTTAC TGAAAATATG ATGATGAGAC AGTTTGCAGT 

msa235280.2{l95_2603} AAAGGAGCGA GTGCTCTTAC TGAAAATATG ATGATGAGAC AGTTTGCAGT 

msa235280.2{!95_A909} AAAGGAGCGA GTGCTCTTAC TGAAAATATG ATGATGAGAC AGTTTGCAGT 

Consensus ********** ********** ********** ********** ********** 

1951 2000 

msa2 35280.2(19 5_C0H1 } TGCTGGACCA CAAGATGATC CTGTTAGTGA ACATAAATAC CCATCAGTAT 

msa235280.2(l95_M732} TGCTGGACCA CAAGATGATC CTGTTAGTGA ACATAAATAC CCATCAGTAT 

msa2 35280. 2(19 5_M7 8 1 J TGCTGGACCA CAAGATGATC CTGTTAGTGA ACATAAATAC CCATCAGTAT 

msa235280 -2{195_H36B} TGCTGGACCA CAAGATGATC CTGTTAGTGA ACATAAATAC CCATCAGTAT 

msa235280 . 2{l95_JM9130013 } TGCTGGACCA CAAGATGATC CTGTTAGTGA ACATAAATAC CCATCAGTAT 

msa235280.2(l95_18RS2l) TGCTGGACCA CAAGATGATC CTGTTAGTGA ACATAAATAC CCATCAGTAT 

msa2 35280. 2(19 5_2 603} TGCTGGACCA CAAGATGATC CTGTTAGTGA ACATAAATAC CCATCAGTAT 

msa2352B0.2(l95_A909} TGCTGGACCA CAAGATGATC CTGTTAGTGA ACATAAATAC CCATCAGTAT 

Consensus ********** /********** *********** ********** ********** 



msa2 35280. 2(19 5_C0H1 } 
msa235280 . 2 ( 195_M732 } 
msa235280.2{l95_M78i; 
msa235280 .2{195_H36B] 
msa235280.2{l95_JM9130013; 
rasa235280.2{l95_18RS21. 
msa235280 . 2 ( 195__2603 
msa23S280 . 2 ( 195_A909 } 
Consensus 



2001 

TTCTCTTAAC 
TTCTCTTAAC 
TTCTCTTAAC 
TTCTCTTAAC 
TTCTCTTAAC 
TTCTCTTAAC 
TTCTCTTAAC 
TTCTCTTAAC 
********** 



TCCTGCCTTA 
TCCTGCCTTA 
TCCTGCCTTA 
TCCTGCCTTA 
TCCTGCCTTA 
TCCTGCCTTA 
TCCTGCCTTA 
TCCTGCCTTA 
********** 



TTGGAAACTG 
TTGGAAACTG 
TTGGAAACTG 
TTGGAAACTG 
TTGGAAACTG 
TTGGAAACTG 
TTGGAAACTG 
TTGGAAACTG 
********** 



CTAGTGAGGC 
CTAGTGAGGC 
CTAGTGAGGC 
CTAGTGAGGC 
CTAGTGAGGC 
CTAGTGAGGC 
CTAGTGAGGC 
CTAGTGAGGC 
********** 



2050 
AACTCTAAAT 
AACTCTAAAT 
AACTCTAAAT 
AACTCTAAAT 
AACTCTAAAT 
AACTCTAAAT 
AACTCTAAAT 
AACTCTAAAT 
********** 



msa235280 . 2 { 195_C0H1 } 



2051 2100 
GGTAAGGAAA TCACAGCATC TGGTATTATC GGTCACATCA AGGATGGTGA 



850 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



msa23 5280 .2( 195_M732} 
msa235280 . 2 { 195_M781 } 
msa235280 .2{l95_H36B} 
msa2352 80 . 2 { 195_JM9130013 } 
msa235280.2{l95_18RS21> 
msa235280 . 2 { 195_2603 } 
msa235280.2{l95_A909} 
Consensus 



msa235280 .2{195_C0H1} 
msa235280 . 2{l95_M732} 
msa235280 ,2{195_M781} 
msa235280 .2{195_H36B} 
msa235280.2{l95_JM9130013} 
msa235280.2{l95_JL8RS2l} 
msa235280 .2{l95_2603] 
msa235280 .2{l95_A909} 
Consensus 



msa235280 . 2 { 195_C0H1 } 
msa235280 .2{195_M732} 
msa235280 .2(195_M781} 
msa235280.2{195_H36B} 
msa235280.2{l95_JM9130013} 
msa235280 . 2 { 195_18RS21 } 
msa235280.2{l95_2603} 
msa235280 . 2 { 195_A909} 
Consensus 



GGTAAGGAAA 
GGTAAGGAAA 
GGTAAGGAAA 
GGTAAGGAAA 
GGTAAGGAAA 
GGTAAGGAAA 
GGTAAGGAAA 
********** 

2101 

TAAAAGCAAG 
TAAAAGCAAG 
TAAAAGCAAG 
TAAAAGCAAG 
TAAAAGCAAG 
TAAAAGCAAG 
TAAAAGCAAG 
TAAAAGCAAG 



TCACAGCATC 
TCACAGCATC 
TCACAGCATC 
TCACAGCATC 
TCACAGCATC 
TCACAGCATC 
TCACAGCATC 
********** 



TGGTATTATC 
TGGTATTATC 
TGGTATTATC 
TGGTATTATC 
TGGTATTATC 
TGGTATTATC 
TGGTATTATC 



GGTCACATCA 
GGTCACATCA 
GGTCACATCA 
GGTCACATCA 
GGTCACATCA 
GGTCACATCA 
GGTCACATCA 



AGGATGGTGA 
AGGATGGTGA 
AGGATGGTGA 
AGGATGGTGA 
AGGATGGTGA 
AGGATGGTGA 
AGGATGGTGA 



********** ********** ********** 



CATGTTGAAG 
CATGTTGAAG 
CATGTTGAAG 
CATGTTGAAG 
CATGTTGAAG 
CATGTTGAAG 
CATGTTGAAG 
CATGTTGAAG 



TCAAAATGGT 
TCAAAATGGT 
TCAAAATGGT 
TCAAAATGGT 
TCAAAATGGT 
TCAAAATGGT 
TCAAAATGGT 
TCAAAATGGT 



GAATGAAAAT 
GAATGAAAAT 
GAATGAAAAT 
GAATGAAAAT 
GAATGAAAAT 
GAATGAAAAT 
GAATGAAAAT 
GAATGAAAAT 



2150 
GGAGACATGC 
GGAGACATGC 
GGAGACATGC 
GGAGACATGC 
GGAGACATGC 
GGAGACATGC 
GGAGACATGC 
GGAGACATGC 



********** ********** ********** ********** ********** 



2151 

TAGGAACCCC 
TAGGAACCCC 
TAGGAACCCC 
TAGGAACCCC 
TAGGAACCCC 
TAGGAACCCC 
TAGGAACCCC 
TAGGAACCCC 
********** 



TGTTATTATT 
TGTTATTATT 
TGTTATTATT 
TGTTATTATT 
TGTTATTATT 
TGTTATTATT 
TGTTATTATT 
TGTTATTATT 
********** 



CAAGGTAAAG 
CAAGGTAAAG 
CAAGGTAAAG 
CAAGGTAAAG 
CAAGGTAAAG 
CAAGGTAAAG 
CAAGGTAAAG 
CAAGGTAAAG 



ACTTGACTAA 
ACTTGACTAA 
ACTTGACTAA 
ACTTGACTAA 
ACTTGACTAA 
ACTTGACTAA 
ACTTGACTAA 
ACTTGACTAA 



2200 
TCGAACAAAA 
TCGAACAAAA 
TCGAACAAAA 
TCGAACAAAA 
TCGAACAAAA 
TCGAACAAAA 
TCGAACAAAA 
TCGAACAAAA 



********** ********** ********** 



msa235280.2{l95_COHl} 
msa235280 . 2 { 195_M732 } 
msa235280 .2{195_M781} 
msa235280 .2{195JH36B> 
msa235280.2{l95_CTM9130013} 
msa235280 . 2 { 195_18RS21 } 
msa235280.2{l95_2603j 
msa235280 ,2{195_A909} 
Consensus 



msa235280.2{l95_COHl} 
msa235280.2{l95_M732} 
msa235280 . 2 {l95_M78l} 
msa235280.2{l95_H36B} 
msa235280 . 2 { 195_JM9130013 } 
msa235280 . 2 { 195JL8RS21 } 
msa235280.2{l95_2603} 
msa235280.2{l95_A909} 
Consensus 



rasa235280 . 2 { 195_COHl } 
msa235280 . 2 { 195_M732 } 
msa235280 . 2 { 195_M781 } 
msa235280 .2{195_H36B} 
msa235280.2{l95_JM9130013} 
msa235280.2{l95_18RS2l} 
msa235280.2{l95__2603} 
msa235280 . 2 { 195_A909 } 
Consensus 



ms a2 3 52 8 0 . 2 { 19 5_COHl } 
msa235280 . 2 { 195_M732 } 
msa235280 . 2 { 195_M781 } 
msa235280.2{l95_H36B) 
msa235280.2{l95_JM9130013) 
msa235280.2{l95_18RS21} 
msa235280 . 2 { 195_2603 } 
msa235280.2{l95__A909} 
Consensus 



2201 

CCATTAATGA GTGGACGTAG 
CCATTAATGA GTGGACGTAG 
CCATTAATGA GTGGACGTAG 
CCATTAATGA GTGGACGTAG 
CCATTAATGA GTGGACGTAG 
CCATTAATGA GTGGACGTAG 
CCATTAATGA GTGGACGTAG 
CCATTAATGA GTGGACGTAG 
********** ********** 

2251 

CCGGGCTAAA 
CCGGGCTAAA 
CCGGGCTAAA 
CCGGGCTAAA 
CCGGGCTAAA 
CCGGGCTAAA 
CCGGGCTAAA 
CCGGGCTAAA 
********** 

2301 

TGGTAACAGA AG CAGG AG AG 
TGGTAACAGA AGCAGGAGAG 
TGGTAACAGA AGCAGGAGAG 
TGGTAACAGA AGCAGGAGAG 
TGGTAACAGA AGCAGGAGAG 
TGGTAACAGA AGCAGGAGAG 
TGGTAACAGA AGCAGGAGAG 
TGGTAACAGA AGCAGGAGAG 
********** ********** 



AGTACTTTAT 
AGTACTTTAT 
AGTACTTTAT 
AGTACTTTAT 
AGTACTTTAT 
AGTACTTTAT 
AGTACTTTAT 
AGTACTTTAT 



GCCGGTAAAC 
GCCGGTAAAC 
GCCGGTAAAC 
GCCGGTAAAC 
GCCGGTAAAC 
GCCGGTAAAC 
GCCGGTAAAC 
GCCGGTAAAC 



********** ********** 



TTACCACTTA 
TTACCACTTA 
TTACCACTTA 
TTACCACTTA 
TTACCACTTA 
TTACCACTTA 
TTACCACTTA 
TTACCACTTA 



GTCGTTTTAA 
GTCGTTTTAA 
GTCGTTTTAA 
GTCGTTTTAA 
GTCGTTTTAA 
GTCGTTTTAA 
GTCGTTTTAA 
GTCGTTTTAA 
********** ********** 



AAAGCAAGTA 
AAAGCAAGTA 
AAAGCAAGTA 
AAAGCAAGTA 
AAAGCAAGTA 
AAAGCAAGTA 
AAAGCAAGTA 
AAAGCAAGTA 
********** 



CACTTGGATT 
CACTTGGATT 
CACTTGGATT 
CACTTGGATT 
CACTTGGATT 
CACTTGGATT 
CACTTGGATT 
CACTTGGATT 
********** 



TTGTTCGTCG 
TTGTTCGTCG 
TTGTTCGTCG 
TTGTTCGTCG 
TTGTTCGTCG 
TTGTTCGTCG 
TTGTTCGTCG 
TTGTTCGTCG 
********** 



2351 

GACCAATCAG 
GACCAATCAG 
GACCAATCAG 
GACCAATCAG 
GACCAATCAG 
GACCAATCAG 
GACCAATCAG 
GACCAATCAG 
********** 



TTCCAGAGCT 
TTCCAGAGCT 
TTCCAGAGCT 
TTCCAGAGCT 
TTCCAGAGCT 
TTCCAGAGCT 
TTCCAGAGCT 
TTCCAGAGCT 
********** 



TAACACAGCA 
TAACACAGCA 
TAACACAGCA 
TAACACAGCA 
TAACACAGCA 
TAACACAGCA 
TAACACAGCA 
TAACACAGCA 



GTTGCTAAAC 
GTTGCTAAAC 
GTTGCTAAAC 
GTTGCTAAAC 
GTTGCTAAAC 
GTTGCTAAAC 
GTTGCTAAAC 
GTTGCTAAAC 
********** ********** 



2250 
AATATGAGTT 
AATATGAGTT 
AATATGAGTT 
AATATGAGTT 
AATATGAGTT 
AATATGAGTT 
AATATGAGTT 
AATATGAGTT 
********** 

2300 
AGGGTTGAAG 
AGGGTTGAAG 
AGGGTTGAAG 
AGGGTTGAAG 
AGGGTTGAAG 
AGGGTTGAAG 
AGGGTTGAAG 
AGGGTTGAAG 
********** 

2350 
CATGTTCTTT 
CATGTTCTTT 
CATGTTCTTT 
CATGTTCTTT 
CATGTTCTTT 
CATGTTCTTT 
CATGTTCTTT 
CATGTTCTTT 
********** 

2400 
GTGATTTGAC 
GTGATTTGAC 
GTGATTTGAC 
GTGATTTGAC 
GTGATTTGAC 
GTGATTTGAC 
GTGATTTGAC 
GTGATTTGAC 
********** 



2401 2450 

msa235280.2{l95__COHl} TTCTGATACT GCTCTTATCC ACATCGTTGC CAAAGATGAC TCTCTAAAAC 

msa23528 0.2{l95_M73 2} TTCTGATACT GCTCTTATCC ACATCGTTGC CAAAGATGAC TCTCTAAAAC 

msa23528 0.2{l95_M781} TTCTGATACT GCTCTTATCC ACATCGTTGC CAAAGATGAC TCTCTAAAAC 

maa235280 .2{l95_H36B} TTCTGATACT GCTCTTATCC ACATCGTTGC CAAAGATGAC TCTCTAAAAC 

msa235280.2{l95_JM9130013} TTCTGATACT GCTCTTATCC ACATCGTTGC CAAAGATGAC TCTCTAAAAC 

msa235280.2{l95 18RS21} TTCTGATACT GCTCTTATCC ACATCGTTGC CAAAGATGAC TCTCTAAAAC 

maa235280 .2(195 2603} TTCTGATACT GCTCTTATCC ACATCGTTGC CAAAGATGAC TCTCTAAAAC 



851 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



msa235280 .2{195_A909} TTCTGATACT GCTCTTATCC ACATCGTTGC CAAAGATGAC TCTCTAAAAC 
Consensus ********** ********** ********** ********** ********** 



2451 

msa235280 .2{l95_COHl) TAAAATTATA TCAAGATGAT TCATTACTTG 

msa2352 80.2{l95_M732) TAAAATTATA TCAAGATGAT TCATTACTTG 

tnsa23S280 .2{l95_M78l} TAAAATTATA TCAAGATGAT TCATTACTTG 

msa23 52 8 0 . 2 { 195_H3 6B } TAAAATTATA TCAAGATGAT TCATTACTTG 

msa235280 . 2 { 195_JM9130013 } TAAAATTATA TCAAGATGAT TCATTACTTG 

msa235280.2{l95_18RS2l} TAAAATTATA TCAAGATGAT TCATTACTTG 

msa2 3 5 2 8 0 . 2 { 1 9 5_2 603} TAAAATTATA TCAAGATGAT TCATTACTTG 

msa235280 .2{195_A909} TAAAATTATA TCAAGATGAT TCATTACTTG 

Consensus ********** ********** ********** 



AATCTGTTGA 
AATCTGTTGA 
AATCTGTTGA 
AATCTGTTGA 
AATCTGTTGA 
AATCTGTTGA 
AATCTGTTGA 
AATCTGTTGA 



2500 
TAAAACCGGT 
TAAAACCGGT 
TAAAACCGGT 
TAAAACCGGT 
TAAAACCGGT 
TAAAACCGGT 
TAAAACCGGT 
TAAAACCGGT 



********** ********** 



2501 2550 

msa235280 . 2 f 195_COHl} CTTTATAGTT TTAGAAATGG TGTAGAAATC ACTAAAGATA TGACAGTACC 

msa235280.2{195_M732} CTTTATAGTT TTAGAAATGG TGTAGAAATC ACTAAAGATA TGACAGTACC 

msa2352 80.2{l95_M78l} CTTTATAGTT TTAGAAATGG TGTAGAAATC ACTAAAGATA TGACAGTACC 

msa235280 .2{195_H36B} CTTTATAGTT TTAGAAATGG TGTAGAAATC ACTAAAGATA TGACAGTACC 

msa235280.2{l95_JM9130013} CTTTATAGTT TTAGAAATGG TGTAGAAATC ACTAAAGATA TGACAGTACC 

msa23528 0,2{l95_18RS2l} CTTTATAGTT TTAGAAATGG TGTAGAAATC ACTAAAGATA TGACAGTACC 

msa235280 . 2{l95_2603} CTTTATAGTT TTAGAAATGG TGTAGAAATC ACTAAAGATA TGACAGTACC 

msa235280.2{l95_A909} CTTTATAGTT TTAGAAATGG TGTAGAAATC ACTAAAGATA TGACAGTACC 

Consensus ********** ********** ********** ********** ********** 



2551 

msa235280 . 2{195_C0H1} ACTAGAATTT GGAGATAATA 

msa235280 . 2{l95_M732 } ACTAGAATTT GGAGATAATA 

msa235280 . 2{195_M78l} ACTAGAATTT GGAGATAATA 

msa235280 .2{195_H36B} ACTAGAATTT GGAGATAATA 

msa235280.2{l95_JM9130013} ACTAGAATTT GGAGATAATA 

msa235280 . 2 { 195_18RS21 J ACTAGAATTT GGAGATAATA 

msa235280.2{l95_2603} ACTAGAATTT GGAGATAATA 

msa235280 -2{195_A90 9} ACTAGAATTT GGAGATAATA 

Consensus ********** ********** 



TTAtTAAGTT 
TTAtTAAGTT 
TTAtTAAGTT 
TTAcTAAGTT 
TTAtTAAGTT 
TTAtTAAGTT 
TTAtTAAGTT 
TTAtTAAGTT 



ATCTGCTGTT 
ATCTGCTGTT 
ATCTGCTGTT 
ATCTGCTGTT 
ATCTGCTGTT 
ATCTGCTGTT 
ATCTGCTGTT 
ATCTGCTGTT 



***_****** ********** 



2600 
GACTTATCAA 
GACTTATCAA 
GACTTATCAA 
GACTTATCAA 
GACTTATCAA 
GACTTATCAA 
GACTTATCAA 
GACTTATCAA 
********** 



2601 

msa235280 . 2 { 195_C0H1 } ATTATCGTCG TAATGAGACC CTTCATATCT 

msa235280.2f 195__M732) ATTATCGTCG TAATGAGACC CTTCATATCT 

msa235280.2{l95_M781} ATTATCGTCG TAATGAGACC CTTCATATCT 

msa235280 . 2{l95_H36B} ATTATCGTCG TAATGAGACC CTTCATATCT 

msa23 5280 . 2 { 195_JM9130013 } ATTATCGTCG TAATGAGACC CTTCATATCT 

msa235280 . 2 { 195_18RS2l} ATTATCGTCG TAATGAGACC CTTCATATCT 

msa235280.2{l95_2603) ATTATCGTCG TAATGAGACC CTTCATATCT 

msa235280.2{l95_A909} ATTATCGTCG TAATGAGACC CTTCATATCT 

Consensus ********** ********** ********** 



ATAGAAACCG 
ATAGAAACCG 
ATAGAAACCG 
ATAGAAACCG 
ATAGAAACCG 
ATAGAAACCG 
ATAGAAACCG 
ATAGAAACCG 



2650 
TTTTGATGTT 
TTTTGATGTT 
TTTTGATGTT 
TTTTGATGTT 
TTTTGATGTT 
TTTTGATGTT 
TTTTGATGTT 
TTTTGATGTT 



********** ********** 



2651 2700 

msa235280 .2{l95_COHl} AAAGCAAGCC AAATGACAGC TGACAAAGGA GCTAAAGTAA CTGTGGATAT 

msa235280.2{l95_M73 2} AAAGCAAGCC AAATGACAGC TGACAAAGGA GCTAAAGTAA CTGTGGATAT 

msa235280 . 2{l95_M78l) AAAGCAAGCC AAATGACAGC TGACAAAGGA GCTAAAGTAA CTGTGGATAT 

msa235280 .2{l95_H36B) AAAGCAAGCC AAATGACAGC TGACAAAGGA GCTAAAGTAA CTGTGGATAT 

msa235280 . 2 { 195_JM9130013 } AAAGCAAGCC AAATGACAGC TGACAAAGGA GCTAAAGTAA CTGTGGATAT 

msa2 35280 .2 { 195_18RS2l} AAAGCAAGCC AAATGACAGC TGACAAAGGA GCTAAAGTAA CTGTGGATAT 

msa235280.2fl95_2 603> AAAGCAAGCC AAATGACAGC TGACAAAGGA GCTAAAGTAA CTGTGGATAT 

msa235280 .2{195_A909) AAAGCAAGCC AAATGACAGC TGACAAAGGA GCTAAAGTAA CTGTGGATAT 

Consensus ********** ********** ********** ********** ********** 

2701 2750 

msa235280.2{l95_COHl} GTTGATGAAG CACTTAGTTG TTCCAGAAAT GGCAGGAGCT TATACATTAA 

msa235280.2{l95_M732} GTTGATGAAG CACTTAGTTG TTCCAGAAAT GGCAGGAGCT TATACATTAA 

msa235280.2{l95_M78l} GTTGATGAAG CACTTAGTTG TTCCAGAAAT GGCAGGAGCT TATACATTAA 

msa2352 80.2{l95_H36B} GTTGATGAAG CACTTAGTTG TTCCAGAAAT GGCAGGAGCT TATACATTAA 

msa235280.2{l95_JM9130013} GTTGATGAAG CACTTAGTTG TTCCAGAAAT GGCAGGAGCT TATACATTAA 

msa235280 .2 {195_18RS21) GTTGATGAAG CACTTAGTTG TTCCAGAAAT GGCAGGAGCT TATACATTAA 

msa235280.2{l95_2603) GTTGATGAAG CACTTAGTTG TTCCAGAAAT GGCAGGAGCT TATACATTAA 

msa2352 80 .2{195__A909} GTTGATGAAG CACTTAGTTG TTCCAGAAAT GGCAGGAGCT TATACATTAA 

Consensus ********** ********** ********** ********** ********** 

2751 2800 

msa235280.2{l95_COHl} CAATCGACGA AGcTCCAAAC ACAAATGAAT CAGGAATGTT AACAAACGCT 

ms a2 35280 .2{195_M732} CAATCGACGA AGcTCCAAAC ACAAATGAAT CAGGAATGTT AACAAACGCT 

msa235280.2fl95__M78l} CAATCGACGA AGcTCCAAAC ACAAATGAAT CAGGAATGTT AACAAACGCT 

msa235280.2{195_H36B> CAATCGACGA AGcTCCAAAC ACAAATGAAT CAGGAATGTT AACAAACGCT 

msa235280 . 2 { 195_JM9130013 } CAATCGACGA AGcTCCAAAC ACAAATGAAT CAGGAATGTT AACAAACGCT 

msa235280.2{l95_18RS2l} CAATCGACGA AGcTCCAAAC ACAAATGAAT CAGGAATGTT AACAAACGCT 

msa235280.2{l95_2603} CAATCGACGA AGcTCCAAAC ACAAATGAAT CAGGAATGTT AACAAACGCT 

msa235280.2{l95_A909} CAATCGACGA AGaTCCAAAC ACAAATGAAT CAGGAATGTT AACAAACGCT 

Consensus ********** **_******* ********** ********** ********** 



ms a2 3 52 8 0 . 2 { 1 9 5_COHl } 
msa235280.2{l95_M732} 



2801 2850 
AAAGTATCGA TTCATTATGT AAATGGTGGT GTTGATAAAG TTGATGTTCC 
AAAGTATCGA TTCATTATGT AAATGGTGGT GTTGATAAAG TTGATGTTCC 



852 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



msa235280 . 2{ 195__M78l} 
msa23528 0 . 2{ 195__H36B) 
msa235280 . 2 { 195_JM9130013 } 
msa235280 . 2 { 195_18RS21 } 
msa235280 .2fl95_2603} 
msa235280 .2{195_A909} 
Consensus 



AAAGTATCGA 
AAAGTATCGA 
AAAGTATCGA 
AAAGTATCGA 
AAAGTATCGA 
AAAGTATCGA 



TTCATTATGT 
TTCATTATGT 
TTCATTATGT 
TTCATTATGT 
TTCATTATGT 
TTCATTATGT 



AAATGGTGGT 
AAATGGTGGT 
AAATGGTGGT 
AAATGGTGGT 
AAATGGTGGT 
AAATGGTGGT 



GTTGATAAAG 
GTTGATAAAG 
GTTGATAAAG 
GTTGATAAAG 
GTTGATAAAG 
GTTGATAAAG 



********** ********** ********** ********** 



TTGATGTTCC 
TTGATGTTCC 
TTGATGTTCC 
TTGATGTTCC 
TTGATGTTCC 
TTGATGTTCC 
********** 



2851 

msa23 52 8 0 . 2 { 195_C0H1 } GATTAAAGTA 

msa235280 .2{195_M732} GATTAAAGTA 

msa235280.2{l95_M78l) GATTAAAGTA 

msa2 35280. 2 { 195_H3 6B } GATTAAAGTA 

msa2 35280.2(19 5_JM9 13 0013} GATTAAAGTA 

msa235280.2{l95_18RS2l} GATTAAAGTA 

msa235280.2f 1 95^2603} GATTAAAGTA 

msa235280.2{195_A909} GATTAAAGTA 

Consensus ********** 



2900 

GTTGACTTAG AAGCTATTcg taaagctgaa gaagcacata 
GTTGACTTAG AAGCTATTcg taaagctgaa gaagcacata 
GTTGACTTAG AAGCTATTcg taaagctgaa gaagcacata 
GTTGACTTAG AAGCTATTcg taaagctgaa gaagcacata 
GTTGACTTAG AAGCTATTcg taaagctgaa gaagcacata 

GTTGACTTAG AAGCTATT 

GTTGACTTAG AAGCTATT 

GTTGACTTAG AAGCTATTcg taaagctgaa gaagcacata 
********** ********__ 



2901 2950 

msa2 35280 .2(19 5_COHl| aagctgacga agcacgtaaa gctgaagaag caCGTAAAGC TGAaGAAGCA 

msa235280 . 2 { 195__M732 } aagctgacga agcacgtaaa gctgaagaag caCGTAAAGC TGAaGAAGCA 

msa235280 .2{l95_M78l} aagctgacga agcacgtaaa gctgaagaag caCGTAAAGC TGAaGAAGCA 

msa235280 .2{195_H36B) aagctgacga agcacgtaaa gctgaagaag caCGTAAAGC TGAcGAAGCA 

msa235280 . 2 { 195_JM9130013 } aagctgacga agcacgtaaa gctgaagaag caCGTAAAGC TGAaGAAGCA 

msa235280 .2{l95_18RS2l} CGTAAAGC TGAaGAAGCA 

msa235280 . 2{l95_2603} . .' cgtaaa gctgaagaag caCGTAAAGC TGAaGAAGCA 

msa235280.2{l95_A909} aagctgacga agcacgtaaa gctgaagaag caCGTAAAGC TGAaGAAGCA 

Consensus • ******** *** _****** 



msa235280 . 2(l95_COHl} 
msa23 52 8 0 . 2 ( 195J4732 } 
msa235280.2(l95_M78l] 
msa235280 .2{195_H36B] 
rasa235280 . 2 { 195_JM9130013 ] 
msa235280 . 2 { 195_18RS2i; 
msa235280.2(l95_2603 
msa235280.2{l95_A909j 
Consensus 



2951 

CaTAAAGCTG 
CaTAAAGCTG 
CaTAAAGCTG 
CaTAAAGCTG 
CaTAAAGCTG 
CaTAAAGCTG 
CgTAAAGCTG 
CgTAAAGCTG 
*-******** 



3000 

AAGAAGtAcg taaagctgaa gaagcacata aagtcgaaga 
AAGAAGtAcg taaagctgaa gaagcacata aagtcgaaga 
AAGAAGtAcg taaagctgaa gaagcacata aagtcgaaga 
AAGAAGtAcg taaagctgaa gaagcacata aagtcgaaga 
AAGAAGtAcg taaagctgaa gaagcacata aagtcgaaga 

AAGAAGcA 

AAGAAGcA 

AAGAAGcA 

***★**_*__ ____ 



3001 3O50 

msa2 35280.2(19 5_C0H1 } agca . CGTAA AGCTGAAGAG GGACATAAAA CCCAAGAAGC ACCTATAGTT 

msa235280.2(l95_M732) agca. CGTAA AGCTGAAGAG GGACATAAAA CCCAAGAAGC ACCTATAGTT 

msa2 35280. 2(19 5_M7 8 1 } agcacCGTAA AGCTGAAGAG GGACATAAAA CCCAAGAAGC ACCTATAGTT 

msa2 35280. 2(19 5_H3 6B } agca . CGTAA AGCTGAAGAG GGACATAAAA CCCAAGAAGC ACCTATAGTT 

msa235280.2{l95_JM9130013) agcacCGTAA AGCTGAAGAG GGACATAAAA CCCAAGAAGC ACCTATAGTT 

msa235280.2(l95_18RS21} CGTAA AGCTGAAGAG GGACATAAAA CCCAAGAAGC ACCTATAGTT 

msa235280.2{l95 2603} CGTAA AGCTGAAGAG GGACATAAAA CCCAAGAAGC ACCTATAGTT 

msa2 35280. 2(19 5_A9 09} CGTAA AGCTGAAGAG GGACATAAAA CCCAAGAAGC ACCTATAGTT 

Consensus ***** ********** ********** ********** ********** 



3051 

msa2 3 52 8 0 . 2 { 1 95_COHl } GAAGAAGGCT 

msa235280.2(l95_M732) GAAGAAGGCT 

msa235280.2{195_M781} GAAGAAGGCT 

msa235280.2(l95_H36B} GAAGAAGGCT 

msa235280.2(l95_JM9130013} GAAGAAGGCT 

msa235280.2(l95_18RS2l) GAAGAAGGCT 

msa235280.2(l95_2603} GAAGAAGGCT 

msa235280 .2{195_A909} GAAGAAGGCT 

Consensus ********** 



ACAAaGTTAA 
ACAAaGTTAA 
ACAAaGTTAA 
ACAAgGTTAA 
ACAAgGTTAA 
ACAAgGTTAA 
ACAAgGTTAA 
ACAAgGTTAA 
****_***** 



TAACGTTCAT 
TAACGTT CAT 
TAACGTTCAT 
TAACGTTCAT 
TAACGTTCAT 
TAACGTTCAT 
TAACGTTCAT 
TAACGTTCAT 
********** 



CAAACTGATA 
CAAACTGATA 
CAAACTGATA 
CAAACTGATA 
CAAACTGATA 
CAAACTGATA 
CAAACTGATA 
CAAACTGATA 
********** 



3100 
CTACAGTTAA 
CTACAGTTAA 
CTACAGTTAA 
CTACAGTTAA 
CTACAGTTAA 
CTACAGTTAA 
CTACAGTTAA 
CTACAGTTAA 
********** 



3101 

msa2 3 52 8 0 . 2 ( 195_C0H1 } AGCGTCTGAT 

msa2 35280.2(19 5_M73 2 } AGCGTCTGAT 

msa235280.2(195_M78lJ AGCGTCTGAT 

msa235280.2(l95_H36B} AGCGTCTGAT 

msa235280 . 2{l95_JM9130013 } AGCGTCTGAT 

msa235280.2(l95_18RS2l) AGCGTCTGAT 

msa235280 .2(195_2603} AGCGTCTGAT 

msa235280.2(l95_A909} AGCGTCTGAT 

Consensus ********** 

3151 

msa235280.2(l95_COHl} GAACAGACAA 

msa235280.2(l95_M732} GAACAGACAA 

msa235280.2fl95_M78l} GAACAGACAA 

msa235280.2(195_H36B) GAACAGACAA 

msa235280.2{l95 JM9130013) GAACAGACAA 

msa235280 . 2 {l95_18RS2l} GAACAGACAA 

msa235280.2fl95_2603j GAACAGACAA 

msa23528 0.2{195_A909} GAACAGACAA 



TTACCAAAGA 
TTACCAAAGA 
TTACCAAAGA 
TTACCAAAGA 
TTACCAAAGA 
TTACCAAAGA 
TTACCAAAGA 
TTACCAAAGA 
********** 



TAAACAGATA 
TAAACAGATA 
TAAACAGATA 
TAAACAGATA 
TAAACAGATA 
TAAACAGATA 
TAAACAGATA 
TAAACAGATA 



CTAAGACAGT 
CTAAGACAGT 
CTAAGACAGT 
CTAAGACAGT 
CTAAGACAGT 
CTAAGACAGT 
CTAAGACAGT 
CTAAGACAGT 
********** 



ACTTCACATC 
ACTTCACATC 
ACTTCACATC 
ACTTCACATC 
ACTTCACATC 
ACTTCACATC 
ACTTCACATC 
ACTTCACATC 



TTCCGCAGTT 
TTCCGCAGTT 
TTCCGCAGTT 
TTCCGCAGTT 
TTCCGCAGTT 
TTCCGCAGTT 
TTCCGCAGTT 
TTCCGCAGTT 
********** 



3150 
CATATGGCTA 
CATATGGCTA 
CATATGGCTA 
CATATGGCTA 
CATATGGCTA 
CATATGGCTA 
CATATGGCTA 
CATATGGCTA 
********** 



3200 

AGACACATG t 

AGACACATGt TGAAAA 

AGACACATG t TG 

AGACACATG- 

AGACACATGt TG ~ 

AGACACATGt TGAA 

AGACACATGt TGAAAAACAA 
AGACACATGt TGAAAAACAA 



853 



WO 2004/018646 



PCT/US2003/026827 
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Consensus ********** ********** ********** *********.. ********** 

3201 3250 

msa235280 .2{l95_COHl} 

msa235280.2{l95_M732} ~ »- 

msa235280.2{l95_M78l} '■ 

msa235280.2{l95_H36B} 

msa235280.2{l95_JM9130013} 

msa235280.2{l95_JL8RS2l} 

msa235280 .2{l95_2603} ATTAAAAATA cattgccatc cactggtgac agcaaacgtg gttattatat 

msa235280 .2{195_A909} ATTAAAAATA 

Consensus ********** ********** ********** ********** ********** 

3251 3300 

msa235280 .2{l95_COHl} ' 

msa235280.2f 195_M732} 

msa235280.2{195_M78l} 

msa235280.2{l95_H36B} 

msa23 5280 .2{l95_JM913 0013} 

msa235280 .2 (195_18RS21 j 

msa235280 .2{l95_2603 } cactggaatg gctatcgtta tgctgagtgt attatttagt ttagctaaaa 

msa235280 .2{l95_A909} - 

Consensus ********** ********** ********** ********** ********** 

3301 3317 

ms a2 3 5 2 8 0 . 2 { 19 5J20H1 } 

msa235280.2f 195_M732} 

msa235280.2fl95_M78l} 

msa235280 . 2{195_H36B j 

msa2 35280 . 2 { 195_JM913 0013 } 

msa235280.2{l95_18RS2l} 

msa235280 .2{195_2603> agtttaaaag caaatat 

msa235280.2{l95_A909} 

Consensus ***************** 

SEQ ID NO. 5110 
STRAIN 2603 frame: 1 

LNNKGVGGDGVQ I YQ YY I KMDNNKP YL S P KD KTT VE KLE D RWKK I TFKVQDTGI GliKDVY 
LQSVKYVGGGNNNLDLITPPGFKKEDKKVEKPKLDRPPGIDLPAPT8MRSFDYSTPPGTK 
PSKPKDSLSTPPGFPDLNTPPDEAPKDSKKDAIEDKSGAI KYAKSLQLSFVDGP I LASKV 
NGKI LQVESDGKLVI PRNALSANQFDDTS LKI YRNNNRNKEI T I TTD YFADTKYVNI TAV 
DYLSNTTFEQIATGETVD YHAI VFS SFAAI KDKGGKI YVNDKLQETS RI ALKDKS VKI G I 
ELPNDVRHIDSLSVRRLNEVKTVDNILKNDEQDINLSKTYQLKYNPTNRRLEFTINNINS 
SSEI>TITFKDGKMPELVEQKDVSLDINDMDMSKFKTIRI,GRKDSEFKGQLIAKTGTVELD 
MFFKQSQDPAS 1 1 KKI YL I QNGVPNELKKFDS S FGLTESQI DGYYIYKDAINLKFKLTSG 
ASLKVVYKGQEDPYSHQKEDMTKKGEQLSHSTQANENTAKVTFANIDW^ 
WKGSELPLTKGWTTFVLHKTENSLNVKSLIMETGSVSKKVQQLPLSPRLSKNKHMRDML 
LTMQKD S AYYET SD SLVLR I NLTADTKLNFNAVKGAS ALT ENMMMRQ FAVAGPQDD P VS E 
HKYPSVFLLTPALLETASEATLNGKE I TASGI IGHI KLX3DKSKHVEVKMVNENGDMLGTP 
VI I QGKDLTNRTKPLMSGRRVL YAGKQYE FRAKLPLSRFNTW I RVEWTEAGEKAS I VRR 
MFFDQSVPELNTAVAKRDLTSDTAL IH I VAKDDSLKLKL YQDDSLLES VDKTGLYS FRNG 
VE I TKDMTVPLE FGDNI I KLSAVDLSNYRRNETLHI YRNRFDVKASQMTADKGAKVTVDM 
LMKHLVVPEMAGAYTLTIDEAPNTNESGMLTNAKVSIHYVN^ 

KAEEARKAEEARKAEEARKAEEGHICrQEAP I VEEG YKVNNVHQTDTTVKASDLPKTKTVS 

AVHMARTDNKQITSHQTHVEKQIKOT^ 

Y 

SEQ ID NO- 5111 

STRAIN A909 frame: 1 

LNNKG VGGDGVQ I YQ YY I KMDNNKP YL S P KDKTTVE KLEDRW KKI T F KVQDTG I GL KD VY 
LQSVKYVGGGNNNLDLITPPGFKKEDKKVEKPKLDRPPGIDLPPPTSMRSFDYSTPPGTK 
PSKPKDSLSTPPGFPDLNTPPDEALKDSKKDAIEDKSGAIKYAKSLQLSFVDDPILASKV 
NGKI LQVESDGKLVI PRNALSANQFDDTS LKI YRNNNRNKE I T I TTD YFADTKYVNI TAV 
DYLSNTTFEQLATGETVD YHAI VFSSFAAIKDKGGKIYVNDKLQETSRIALKDKSVKIGI 
ELPND VRH I DSLS VRRLNE VKTVDN I LKNDEQD I NLS KTYQLKYNPTNRRLEFT I NN INS 
SSEIMTTFKDGKMPELVEQKDVSLDINDMDMSKFKTIRLGRKDSEFKGQLIAKTGTVELD 
MFFKQSQDPASI IKKIYLIQNGVPNELKKFDSSFGLTESQIDGYYIYKDAINLKFKLTSG 
ASLKVVYKGQEDPYSHQKEDMTKKGEQLSHSTQANENTAKVTFANIDWSH^ 
VGKGSELPLTKGWTTFVLHKTENSLWKSLIMETGSVSKKVQQLPLSPRLSKNKHMRDML 
LTMQ KDS AYY ET SD S LVLR I NLTADT KLN FNAVKGAS ALTENMMMRQ FAVAG P QDD P VS E 
HKYPSVFLLTPALLETASEATLNGKE I TASGI IGHI KDGDKS KHVEVKMVNENGDMLGTP 
VI I QG KDLTNRTKPLMSGRRVL YAGKQYE FRAKLPLSRFNTW I RVEVVTEAGEKAS I VRR 
MFFDQSVPELNTAVAKRDLTSDTAL I H I VAKDDSLKLKL YQDDSLLE SVDKTGLYSFRNG 
VE I TKDMTVPLE FGDNI I KLSAVDLSNYRRNETLHI YRNRFDVKASQMTADKGAKVTVDM 
LMKHLWPEMAGAYTLTI DEDPNTNESGMLTNAKVS I HYVNGGVDKVDVP I KWDLEAIR 
KAEEAHKADEARKAEEARKAEEARKAEEARKAEEGHKTQEAP I VEEG YKVNNVHQTDTTV 
KASDLPKTKTVSAVHMARTDNKQ I TSHQTHVE KQ I KN 

SEQ ID NO. 5112 

STRAIN H36B frame: 2 

GVQIYQYY I KMDNNKPYLS PKX>KTTVEKLEDRWKKITFKVQDTGI GLKDVYLQS VKYVGG 
GNNNLDLITPPGFKKEDKKVEKPKLDRPPGIDLPAPTSMRSFDYSTPPGTKPSKPKDSLS 
TPPGFPDLNTPPDEALKDSKKDAIEDKSGAIKYAKSLQLSFVDDPILASKVNGKILQVES 
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DGKX.VIPRNALSANQFDDTSLKIYRNNNRNKEITITTDYFADTKYVNITAVDYLSNT^ 
QLATGETVDYHAI VFSS FAAI KDKGGKI YVNDKLQETSRI ALKDKS VKI G I ELPNDVRH I 
DSLS VRRLNEVKTVDNI LKNDEQD INLSKTYQLKYNPTNRRLEFT INNI NSSSE I MTTFK 
DGKMPELVEQKDVSLDINDMDMSKFKT I RLGRKDSEFKGQLI AKTGTVELDMFFKQSQDP 
AS 1 1 KKI YLI QNGVPNELKKFDSSFGLTESQIDGYY I YKDAI NLKFKLTSGASLKWYKG 
QEDPYSHQKEDMTKKGEQLSHSTQANENTAK^FANIDWSHYSKVTVNGKEVGKGSELPL 
TKGWTTFVLHKTENS LNVKS L IMETGS VS KKVQQLPLS PRLS KNKHMRDMLLTMQKDS AY 
YETSDSLVLRINLTADTKLNFNAVKGASALTENMMMRQFAVAGPQDDPVSEHKYPSVFLL 
TPALLETAS EATLNGKE I TASG IIGHI KDGDKSKHVEVKMVNENGDMLGTPVI I QGKDLT 
NRTKPLMSGRRVLYAGKQYE FRAKL PLSRFNT WI RVEWTEAGEKAS IVRRMFFDQSVPE 
LNTAVAKRDLTS DTAL I H I VAKDDSLKLKLYQDD SLLES VDKTGL YS FRNGVE I TKDMT V 
PIiE FGDN I TKLSAVDLSNYRRNETLHI YRNRFDVKASQMTADKGAK\nTVDMLMKHLVVPE 
MAGAYTLT I DEAPNTNE SGMLTNAKVS I HYVNGG VDKVD VP I KWDLEAI RKAEEAHKAD 
EARKAEEARKADEAHKAEEVRKAEEAHKVEEARKAEEGHKTQEAP I VEEGYKVNNVHQTD 
TTVKASDLPKTKTVSAVHMARTDNKQITSHQTH 

SEQ XD NO. 5113 

STRAIN 18RS21 frame: 1 

LNNKG VGGDG VQ I YQ YY I KMDNNKP YLS PKDKTTVEKLEDRWKKI TFKVQDTG I GLKDVY 
LQSVKYVGGGNNNLDLITPPGFKKEDKKVEKPKLDRPPGIDLPAPTSMRSFDYSTPPGTK 
PSKPKDSLSTPPGFPDLNTPPDEAPKDSKKDAI EDKSGAI KYAKSLQLSFVDDPI LASKV • 
NGKI LQVESDGKLVI PRNALSANQFDDTSLKI YRNNNRNKE I T I TTDYFADTKYVNITAV 
DYLSNTTFEQIATGETVDYHAIVFSSFAAI KDKGGKI YVNDKLQETSRI ALKDKS VKIGI 
ELPNDVRHIDSLSVRRLNEVKTVDNILK^EQDINLSKTYQLKYNPTNRRLEFTINNINS 
SSE IMTTFKDGKMPELVEQKDVSLD INDMDMSKFKTI RLGRKDSEFKGQLI AKTGTVELD 
MFFKQSQDPASIIKKIYLIQNGVPNELKKFDSSFGLTESQIDGYYIYKDAINLKFKLTSG 
ASLKVWKGQEDPYSHQKED>TrKKGEQLSHSTQANENTAKVTFANIDWSHYSKVTVNGKE 
VVTOSSELPLTKGWTTFVLHKTENSLNVKSLIMETGSV^ 

LTMQ KD S AYYET SD S LVLR I NLTADTKLNFNAVKGAS ALTENMMM RQ FAVAG PQDDP VS E 
HKYPS VFLLTPALLETAS EATLNGKE ITASG 1 1 GH I KDGD KS KHVEVKMVNENGDMLGT P 
VI ICX3KDLTNRTKPLMSGRRVLYAGKQYEFRAKLPLSRFNTWIRVEVVTFA 
MFFDQS VPELNTAVAKRDLTSDTAL I H I VAKDDSLKLKLYQDDSLLES VDKTGL YS FRNG 
VEITKDMTVPLEFGDNI I KLSAVDL SNYRRNETLH I YRNRFD VKASQMTADKGAKVTVDM 
LMKHLWPEMAGAYTLT I DEAPNTNESGMLTNAKVS I HYVNGG VDKVD VP I KWDLEAI R 
KAEEARKAEEARKAEEGHKTQEAP I VEEGYKVNNVHQTDTTVKASDLPKTKT VS AVHMAR 
TDNKQITSHQTHVE 

SEQ XD NO. 5114 

STRAIN M732 frame: 1 

LNNKG VGGDG VQ I YQYY I KMDNNKP YLS PKDKTTVEKLEDRWKKI TFKVQDTG I GLKDVY 
LQS VKYVGGGNNNLDLI TP PGFKKEDKKVE KPKLDRPPG I DLPAPTSMRS FD YST PPGTK 
PSKPKDSLSTPPGFPDLNTPPDEATKG . . KRRY . R . IRSN . I C . VSST . LC . . PYFS . QS 
KWQNITSRI . WQISHS . KCFVS . SI . . H . S .NLS . . . SQ . RNYYHNRLFCRYKI CQYHSG 
. LFEQYYF . AI SYW . NSRLPCHCI FKLCCY . RQGW . DLC . R . I ARNFS YSA . R . I C . DWY 
.ITK.CQTY. .FICSSFE.G.NC. .YLEK. . TRH . SQQNLPI KI QPDKSSSRVYY . .H.L 
KFRNHDHFQRWKDARIG . TKRCFFGYKRYGHE . V . NYSTWTKGF . I . GTTYCKNWNS . IR 
YVFQTISRPSFNY . KNIPYPKWCSK. IEKI .L . FWFN . KSDRWILYL . RCN .P.I. INQW 
CKS . SCL . RARRSI . SSERRYD . KR . TAQSFNSSQ . KYSKSNLC . Y . LVTL . . GYCEWKR 
SW.R. . VTFN . RMDNICIT . NRKFIKC . KFDYGDG . CK. ESSTTSFKS .II- K-AYEGYA 
TYYAKRFSVLRNK. QSSPSN . SHCRY . T . F . CC . RSECSY . KYDDETVCSCWTTR . SC . . 
T . IPISISLNSCLIGNC . .GNSKW . GNHS I WYYRSHQGW . . KQAC . SQNGE . KWRHARNP 
CYYSR . RLD . SNKTINEWT . STLCR . TI . VPG . ITT . SF . HLD . G . SGNRSRRESKYCSS 
HVL . PISSRA.HSSC .T . FDF . YCS YPHRCQR . LSKTKI ISR . FIT . IC . . NRSL . F . KW 
CRNH . RYDSTTRI WR . YY . VI CC . LIKLSS . . DPSYL . KPF . C . SKPNDS . QRS . SNCGY 
VDEALSCSRNGRSLYINNRRSSKHK. IRNVNKR . SIDSLCKWWC . . S . CSD . SS . LRSYS 
. S . RST - S . RST . S .RST . S .RST . S . RST . S . RST . SRRST . S . RGT . NPRSTYS . RRL 
QS . . RSSN . YYS . SV . FTKD . DSFRSSYG - NRQ . TDNFTSDTC . K 

SEQ XD NO. 5115 

STRAIN COH1 frame: 1 

LNNKGVGGDGVQ I YQYY I KMDNNKP YLS PKDKTTVEKLEDRWKKI TFKVQDTG I GLKDVY 
LQSVKYVGGGNNNLDLITPPGFKKEDKKVEKPKLDRPPGIDLPAPTSMRSFDYSTPPGTK 
PSKPKDSLSTPPGFPDLNTPPDEATKG . . KRRY . R . IRSN . I C - VSST -LC. . PYFS . QS 
KWQNITSRI. WQISHS. KCFVS. S I.. H.S. NLS. . . SQ .RNYYHNRLFCRYKI CQYHSG 
. LFEQYYF . AI SYW . NSRLPCHCI FKLCCY . RQGW . DLC . R . I ARNFS YSA . R . I C . DWY 
.ITK.CQTY. .FICSSFE.G.NC. .YLEK. .TRH. SQQNLPI KI QPDKSSSRVYY. .H.L 
KFRNHDHFQRWKDARIG . TKRCFFGYKRYGHE . V . NYSTWTKGF . I . GTTYCKNWNS . IR 
YVFQTISRPSFNY. KNIPYPKWCSK. IEKI . L . FWFN . KSDRWILYL . RCN . P . I . INQW 
CKS . SCL . RARRS I . SSERRYD . KR . TAQS FNSSQ . KYSKSNLC . Y . LVTL . . GYCEWKR 
SW.R. . VTFN . RMDNICIT . NRKFI KC . KFDYGDG . CK . ESSTTSFKS .U.K. AYEGYA 
TYYAKRFSVLRNK. QSSPSN . SHCRY . T . F . CC . RSECSY . KYDDETVCSCWTTR . SC . . 
T . I PI S I SLNSCLIGNC . . GNSKW . GNHS I WYYRSHQGW . . KQAC . SQNGE . KWRHARNP 
CYYSR . RLD . SNKTINEWT . STLCR . TI . VPG . ITT . SF . HLD . G . SGNRSRRESKYCSS 
HVL . PISSRA . HSSC.T . FDF . YCS YPHRCQR . LSKTKI I SR . FIT . IC . . NRSL . F . KW 
CRNH . RYDSTTRI WR . YY . VI CC . LIKLSS . . DPSYL . KPF . C . SKPNDS . QRS . SNCGY 
VDEALSCSRNGRSLYINNRRSSKHK. IRNVNKR .SIDSLCKWWC . . S . CSD . SS . LRSYS 
. S . RST . S . RST . S . RST . S . RST . S . RST . S . RST . SRRST . S . RGT .NPRSTYS . RRL 
QS . . RSSN . YYS . SV . FTKD . DSFRSSYG . NRQ . TDNFTSDTC 

SEQ ID NO. 5116 

STRAIN M781 frame: 1 

LNNKGVGGDGVQ I YQYY I KMDNNKP YL S P KD KTT VE KLEDRW KKI T F KVQDTG I GLKDVY 
LQSVKYVGGGNNNLDLITPPGFKKEDKKVEKPKLDRPPGI DLPAPTSMRS FDYSTPPGTK 
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PSKPKDSLSTPPGFPDLNTPPDEATKG . . KRRY . R . IRSN . IC . VSST . LC . . PYFS . QS 
KWQNITSRI . WQISHS . KCFVS .SI . . H. S -NLS . . . SQ . RNYYHNRLFCRYKI CQYHSG 
. LFEQYYF . AISYW . NSRLPCHCI FKLCCY . RQGW . DLC . R . IARNFSYSA . R . IC . DWY 
.ITK.CQTY. .FICSSFE.G.NC. . YLEK . . TRH . SQQNLPI KIQPDKSS SRVYY . .H.L 
KFRNHDHFQRWKDARIG . TKRCFFGYKRYGHE . V . NYSTWTKGF . I . GTTYCKNWNS . I R 
YVFQTISRPSFNY.KNIPYPKWCSK. IEKI . L . FWFN . KSDRWILYL. RCN. P . I . INQW 
CKS . SCL . RARRSI . SSERRYD . KR .TAQSFNSSQ . KYSKSNLC . Y . L.VTL . . GYCEWKR 
SW . R . . VTFN . RMDNICIT .NRKFIKC . KFDYGDG . CK . ESSTTSFKS .U.K. AYEGYA 
TYYAKRFSVLRNK. QSSPSN . SHCRY . T . F . CC . RSECSY . KYDDETVCSCWTTR . SC . . 
T.IPISISLNSCLIGNC. . GNSKW . GNHSIWYYRSHQGW . . KQAC . SQNGE . KWRHARNP 
CYYSR . RLD . SNKT TNEWT . STLCR . TI . VPG . ITT . SF . HLD . G . SGNRSRRESKYCSS 
HVL . PI SSRA . HSSC . T . FDF . YCS YPHRCQR . LSKTKI I SR . FI T . IC . . NRSL .F.KW 
CRNH . RYDSTTR I WR . YY . VICC . LI KLSS . . DPSYL . KPF . C . SKPNDS . QRS . SNCGY 
VDEALSCSRNGRSLYINNRRSSKHK . IRNVNKR . S IDSLCKWWC . . S . CSD . SS . LRSYS 
. S . RST . S . RST . S . RST . S . RST . S . RST . S . RST . SRRSTVKLKRDI KPKKHL . LKKA 
TKL I T F I KL I LQLKRL I YQRLRQFPQF I WLEQT I NR . LH I RHML 



SEQ ID NO. 5117 

STRAIN JM9130013 frame: 2 

GVQI YQYYI KMDNNKPYLS PKDKTTVEKLEDRWKKITFKVQDTG IGLKDVYLQSVKYVGG 
GNNNLDL I TPPG FKKEDKKVEKPKIjDRPPG I DLPAPTSMRS FD YSTPPGTKPSKPKDSLS 
TPPGFPDLNTPPDEAPKDSKKDAI EDKSGAI KYAKSLQLS FVDDP I LAS KVNGKI LQVES 
DGKLVI PRNALSANQFDDTSLKI YRNNNRNKEIT I TTDYFADTKYVNITAVDYLSNTTFE 
QIATGETVDYHAIVFSSFAAIKDKGGKIYVNDKLQETSRIALKDKSVKIGIELPNDVRHI 
DSLSVRRLNEVKTVDNILKNDEQDINLSKTYQLKYNPTNRRLEFTIN^ 
DGKMPELVEQKDVSLDINDMDMSKFKTIRLGRKDSEFKGQLIAKTGTVELDMFFKQSQDP 
AS 1 1 KKI YL I QNGVPNELKKFDSSFGLTESQ I DG YY I YKDAI NLKFKLT S GASLKWYKG 
QEDPYSHQKEDMTKXGEQLSHSTQANENTAKVTFANIDWSHYSKVTVNGKEVGKGSELPL 
TKGWTTFVLHKTENSLNVKSLIMETGSVSKK^ 

YETSDSLVLRINLTAOTKIoNFNAVKGASALTENMMMRQFAVAGPQDDPVSEHKYPSVFLL 
TPALLETASEATLNGKE ITASGI I GH I KDGDKSKHVEVKMVNENGDMLGTP VI I QGKDLT 
NRTKPLMSGRRVLYAGKQYEFRAKLPLSRFNTWI RVEWTEAGEKAS I VRRMFFDQSVPE 
IiNTAVAKRDLTSDTAL I HI VAKDDSLKLKLYQDD SLLES VDKTGLYS FRNGVE I TKDMTV 
PLEFGDNI IKLSAVDLSNYRRNETLHI YRNRFDVKASQMTADKGAKVTVDMLMKHLVVPE 
MAGAYTLT I D EAPNTNE S GMLTNAKVS I HYVNGG VDKVD VP I KWDLEA I RKAEEAHKAD 
EARKAEEARKAEEAHKAEEVRKAEEAHKVEEAP . S . RGT . NPRSTYS . RRLCG . . RSSN . 
YYS . SV . FTKD . DSFRSSYG . NRQ . TDNFTSDTC 

PRETTY Of: /biotmp/msa235427 . 2 {* } December 10, 2002 05:18 



msa235427.2{l95_H36B} G 

msa235427.2{l95_JM9130013} G 

msa235427.2{l95_18RS2l} LNNKGVGGDG 

msa23 5427 .2{195_2 603} LNNKGVGGDG 

msa235427 .2{l95_A909l LNNKGVGGDG 

msa235427 .2{195_C0H1} LNNKGVGGDG 

msa235427 .2{l95_M732} LNNKGVGGDG 

msa23 5427.2{l95_M78l} LNNKGVGGDG 

Consensus ********** 



VQI YQYYI KM 
VQI YQYYI KM 
VQI YQYYI KM 
VQI YQYYI KM 
VQI YQYYI KM 
VQI YQYYI KM 
VQIYQYYIKM 
VQI YQYYI KM 
********** 



DNNKPYLSPK 
DNNKPYLSPK 
DNNKPYLSPK 
DNNKPYLSPK 
DNNKPYLSPK 
DNNKPYLSPK 
DNNKPYLSPK 
DNNKPYLSPK 
********** 



DKTTVEKLED 
DKTTVEKLED 
DKTTVEKLED 
DKTTVEKLED 
DKTTVEKLED 
DKTTVEKLED 
DKTTVEKLED 
DKTTVEKLED 
********** 
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RWKKITFKVQ 
RWKKITFKVQ 
RWKKITFKVQ 
RWKKITFKVQ 
RWKKITFKVQ 
RWKKITFKVQ 
RWKKITFKVQ 
RWKKITFKVQ 
********** 
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msa23 5427 . 2 { 195_H36B} DTGIGLKDVY 

Itisa235427 .2{195_JM9130013} DTGIGLKDVY 

msa235427.2{l95_18RS2l} DTGIGLKDVY 

msa235427.2{l95_2603} DTGIGLKDVY 

msa235427.2{l95_A909} DTGIGLKDVY 

' msa235427.2{l95_COHl} DTGIGLKDVY 

msa23 5427 .2{195_M732} DTGIGLKDVY 

msa23 5427.2{l95_M78l} DTGIGLKDVY 

Consensus ********** 



LQSVKYVGGG 
LQSVKYVGGG 
LQSVKYVGGG 
LQSVKYVGGG 
LQSVKYVGGG 
LQSVKYVGGG 
LQSVKYVGGG 
LQSVKYVGGG 
********** 



NNNLDLITPP 
NNNLDDITPP 
NNNLDLITPP 
NNNLDLITPP 
NNNLDLITPP 
NNNLDLITPP 
NNNLDLITPP 
NNNLDLITPP 
********** 



GFKKEDKKVE 
GFKKEDKKVE 
GFKKEDKKVE 
GFKKEDKKVE 
GFKKEDKKVE 
GFKKEDKKVE 
GFKKEDKKVE 
GFKKEDKKVE 
********** 
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KPKLDRPPGI 
KPKLDRPPGI 
KPKLDRPPGI 
KPKLDRPPGI 
KPKLDRPPGI 
KPKLDRPPGI 
KPKLDRPPGI 
KPKLDRPPGI 
********** 
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msa235427.2{l95_H36B> DLPaPTSMRS FDYSTPPGTK PSKPKDSLST PPGFPDLNTP PDEAlKdskK 

msa235427 . 2{195_JM9130013) DLPaPTSMRS FDYSTPPGTK PSKPKDSLST PPGFPDLNTP PDEApKdskK 

msa235427.2{l95_18RS2l} DLPaPTSMRS FDYSTPPGTK PSKPKDSLST PPGFPDLNTP PDEApKdskK 

msa235427.2{l95_2603) DLPaPTSMRS FDYSTPPGTK PSKPKDSLST PPGFPDLNTP PDEApKdskK 

msa235427.2{195_A909} DLPpPTSMRS FDYSTPPGTK PSKPKDSLST PPGFPDLNTP PDEAlKdskK 

msa235427.2{l95_COHl} DLPaPTSMRS FDYSTPPGTK PSKPKDSLST PPGFPDLNTP PDEAtKg . . K 

msa235427. 2(195 M732) DLPaPTSMRS FDYSTPPGTK PSKPKDSLST PPGFPDLNTP PDEAtKg. . K 

msa235427.2{l95_M78l} DLPaPTSMRS FDYSTPPGTK PSKPKDSLST PPGFPDLNTP PDEAtKg.. K 

Consensus ***-****** ********** ********** ********** ****_* * 



msa235427.2{l95_H36B} 
msa235427.2{l95_JM9130013} 
msa235427 . 2 { 195_18RS21 J 
msa235427 . 2{ 195_2603 } 
msa235427 .2{195_A909} 
msa235427 . 2 { 195_C0H1 } 
msa235427 .2{l95_M732j 
msa235427.2{195_M781} 
Consensus 
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daiedkBgai 
daiedksgai 
daiedksgai 
daiedksgai 
daiedksgai 
rry.r.irsn 
rry.r.irsn 
rry.r.irsn 



kyakslqlsf 
kyakslqlsf 
kyakslqlsf 
kyakslqlsf 
kyakslqlsf 
.ic.vsst .1 
. ic.vsst .1 
.ic.vsst .1 



vddPilaskv 
vddPilaskv 
vddPilaskv 
vdgPilaskv 
vddPilaskv 
c. .Pyf s .qs 
c. .Pyf s .qs 
c. .Pyf s .qs 



ngkilqvesd 
ngkilqvesd 
ngkilqvesd 
ngkilqvesd 
ngkilqvesd 
kwqnitsri . 
kwqnitsri - 
kwqnitsri . 



200 

gklvipmal 
gklviprnal 
gklvipmal 
gklviprnal 
gklvipmal 
wqishs .kef 
wqishs . kef 
wqishs. kef 



856 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



msa235427 . 2 { 195_H36B} 
msa235427 . 2 { 195_JM9130013 } 
msa235427 . 2 { 195_18RS21 } 
msa235427 .2{l95_2603} 
msa235427 .2{195_A909} 
msa23 5427 . 2 { 195_C0H1 } 
msa23 542'7 . 2 f 19 5_M732 } 
msa235427.2{l95_M78l} 
Consensus 



msa23S427.2{l95_H36B} 
msa235427 . 2 { 195_JM9130013 } 
msa235427 . 2 { 195_18RS21 } 
msa235427 . 2 { 195_2603 } 
msa23 5427.2{l95_A909} 
msa235427 .2{l9 5_COHl} 
msa235427.2(l95_M732j 
msa23 5427.2{195_M781) 
Consensus 



msa235427.2{l95_H36B} 
msa235427 . 2{ 195_JM9130013 } 
msa235427 ,2{l95_18RS2l} 
msa23 5427. 2 { 195^2603} 
msa23 5427 . 2 { 195_A909} 
msa235427 . 2 { 195_COHl} 
msa2 3 5427 . 2 { 195_M732 } 
msa235427-2{l95_M78l} 
Consensus 



msa235427 .2{l95_H36Bl 
msa235427 . 2 { 195_JM9130013 } 
msa235427 . 2 { 195_18RS21 ] 
msa235427.2{l95_2603 J T 
msa235427.2{l95_A909] 
msa23 5427 . 2 { 195_C0H1 ] 
msa235427 . 2 { 195_M732 ; 
msa235427 .2{l95_M78l} 
Consensus 



msa235427 . 2 { 195_H36B} 
msa235427 . 2 { 195_JM9130013 } 
msa235427 . 2 { 195_18RS2l} 
msa235427 . 2 { 195_2603 } 
msa235427.2{l95_A909} 
msa2 3 54 2 7 . 2 { 1 9 5_C0H1 } 
msa235427 . 2 { 195_M732 } 
msa235427.2{l95_M78l} 
Consensus 



msa235427 . 2 { 195_H36B} 
msa235427 . 2 { 195_JM9130013 } 
msa235427 . 2 { 195_18RS21 } 
msa235427 .2{195_2603} 
msa235427 . 2 { 195_A909 } 
msa235427 . 2 ( 195_C0H1 J 
msa235427 .2{195_M732} 
msa235427.2{l95_M78l} 
Consensus 



201 

sanqfddtsl 
sanqf ddtsl 
sanqfddtsl 
sanqfddtsl 
sanqfddtsl 
vs . si . .h. s 
vs .si . .h.s 
vs. si. .h.s 



kiyrnnnrnk 
kiyrnnnrnk 
kiyrnnnrnk 
kiyrnnnrnk 
kiyrnnnrnk 
.nls. . .sq. 
.nls. . .sq. 
.nls . . .sq. 



eitittdyFa 
eitittdyFa 
eitittdyFa 
eitittdyFa 
eitittdyFa 
rnyyhnrlFc 
rnyyhnrlFc 
rnyyhnrlFc 



dtKyvnitav 
dtKyvnitav 
dtKyvnitav 
dtKyvnitav 
dtKyvnitav 
ryKicqyhsg 
ryKicqyhsg 
ryKicqyhsg 



250 

dylsnttFeq 
dylsnttFeq 
dylsnttFeq 
dylsnttFeq 
dylsnttFeq 
. If eqyyF .a 
.lfeqyyF.a 
. lfeqyyF.a 



251 

latgetvdyh 
latgetvdyh 
latgetvdyh 
latgetvdyh 
latgetvdyh 
isyw.nsrlp 
isyw.nsrlp 
isyw.nsrlp 



aivf ssf aai 
aivf ssf aai 
aivf ssf aai 
aivf ssf aai 
aivf ssf aai 
chcifklccy 
chcif klccy 
chcifklccy 



kdkGgkiyvn 
kdkGgkiyvn 
kdkGgkiyvn 
kdkGgkiyvn 
kdkGgkiyvn 
.rqGw.dlc . 
.rqGw.dlc . 
.rqGw.dlc . 



dklqetsria 
dklqetsria 
dklqetsria 
dklqetsria 
dklqetsria 
r.iarnfsys 
r .iarnfsys 
r . iarnfsys 



300 

lkdksvkigi 
lkdksvkigi 
lkdksvkigi 
lkdksvkigi 
lkdksvkigi 
a.r.ic. dwy 
a . r . ic . dwy 
a.r.ic .dwy 



301 

elpndvrhid 
elpndvrhid 
elpndvrhid 
elpndvrhid 
elpndvrhid 
.itk.cqty. 
.itk.cqty. 
. itk.cqty. 



slsvrrlnev 
slsvrrlnev 
slsvrrlnev 
slsvrrlnev 
slsvrrlnev 
. f icssfe.g 
.f icssfe.g 
.f icssf e .g 



ktvdniLknd 
ktvdniLknd 
ktvdniLknd 
ktvdniLknd 
ktvdniLknd 
.nc. .yLek. 
.nc. .yLek. 
.nc. .yLek. 



eqdinlskty 
eqdinlskty 
eqdinlskty 
eqdinlskty 
eqdinlskty 
. trh . sqqnl 
. trh. sqqnl 
. trh . sqqnl 



350 

qlKynPtnrr 
qlKynPtnrr 
qlKynPtnrr 
qlKynPtnrr 
qlKynPtnrr 
piKiqPdkss 
piKiqPdkss 
piKiqPdkss 



351 

lef tinnins 
lef tinnins 
lef tinnins 
lef tinnins 
lef tinnins 
srvyy. .h.l 
srvyy. .h.l 
srvyy. .h.l 



sseimttFkd 
sseimttFkd 
sseimttFkd 
sseimttFkd 
sseimttFkd 
kfrnhdhFqr 
kfrnhdhFqr 
kfrnhdhFqr 



gKmpelveqK 
gKmpelveqK 
gKmpelveqK 
gKmpelveqK 
gKmpelveqK 
wKdarig . tK 
wKdarig. tK 
wKdarig . tK 



dvsldindmd 
dvsldindmd 
dvsldindmd 
dvsldindmd 
dvsldindmd 
rcf fgykryg 
rcf fgykryg 
rcf fgykryg 



400 

mskf ktirlg 
mskfktirlg 
mskfktirlg 
mskfktirlg 
mskfktirlg 
he. v.nystw 
he. v.nystw 
he .v.nystw 



401 

rKdsefkGql 
rKdsefkGql 
rKdsefkGql 
rKdsefkGql 
rKdsefkGql 
tKgf .i.Gtt 
tKgf .i.Gtt 
tKgf .i.Gtt 



iaKtgtveld 
iaKtgtveld 
iaKtgtveld 
iaKtgtveld 
iaKtgtveld 
ycKnwns . ir 
ycKnwns . ir 
ycKnwns . ir 



mf FkqsqdPa 
mfFkqsqdPa 
mf FkqsqdPa 
mfFkqsqdPa 
mfFkqsqdPa 
yvFqtisrPs 
yvFqtisrPs 
yvFqtisrPs 



siikKiyliq 
siikKiyliq 
siikKiyliq 
siikKiyliq 
siikKiyliq 
f ny . Knipyp 
f ny . Knipyp 
f ny . Knipyp 



450 

ngvpnelkKf 
ngvpnelkKf 
ngvpnelkKf 
ngvpnelkKf 
ngvpnelkKf 
kwcsk . ieKi 
kwcsk.ieKi 
kwcsk. ieKi 



451 

dssFgltesq 
dssFgltesq 
dssFgltesq 
dssFgltesq 
dssFgltesq 
. 1 . Fwf n.ks 
.l.Fwfn.ks 
.1. Fwf n.ks 



idgyyiykda 
idgyyiykda 
idgyyiykda 
idgyyiykda 
idgyyiykda 
drwilyl . rc 
drwilyl . rc 
drwilyl.rc 



inlkf kltsg 
inlkfkltsg 
inlkfkltsg 
inlkfkltsg 
inlkfkltsg 
n . p . i . inqw 
n . p . i . inqw 
n . p . i . inqw 



aslkwykgq 
aslkwykgq 
aslkwykgq 
aslkwykgq 
aslkwykgq 
cks .scl.ra 
cks .scl.ra 
cks .scl.ra 



500 

edpyshqked 
edpyshqked 
edpyshqked 
edpyshqked 
edpyshqked 
rrsi . sserr 
rrsi . sserr 
rrsi . sserr 



msa235427 . 2 { 195_H36B} 
msa235427 . 2 { 195_JM9130013 ) 
msa235427 . 2 { 195_18RS21 } 
msa235427 . 2 { 195_2603 } 
msa235427.2{l95_A909} 
msa2 35427.2(19 5_C0H1 1 
msa235427 . 2 { 195_M732 } 
msa235427 .2{195_M78l} 
Consensus 



msa235427 . 2 { 195_H36B} 
msa235427 . 2 { 195_JM9130013 J 
msa235427.2{l95_18RS21{ 
msa235427.2{l95_2603) 



501 

mtkkgeqlsh 
mtkxgeqlsh 
mtkkgeqlsh 
mtkkgeqlsh 
mtkkgeqlsh 
yd . kr . taqs 
yd . kr . taqs 
yd . kr . taqs 



stqanentaK 
stqanentaK 
stqanentaK 
stqanentaK 
stqanentaK 
fnssq.kysK 
fnssq.kysK 
fnssq.kysK 



vtfanidwsh 
vtf anidwsh 
vtfanidwsh 
vtfanidwsh 
vtfanidwsh 
snlc .y. Ivt 
snlc.y.lvt 
snlc.y.lvt 



yskvtvngKe 
yskvtvngKe 
yskvtvngKe 
yskvtvngKe 
yskvtvngKe 
1 . . gycewKr 
1 . . gycewKr 
1 . . gycewKr 



550 

vgkgselplt 
vgkgselplt 
wkgselplt 
wkgselplt 
vgkgselplt 
sw.r . .vtfn 
sw.r. .vtfn 
sw.r. .vtfn 



5S1 600 

kgwttfvlhk tenslnvksl imetGsvskk vqqlplsprl sknkhmrdml 

kgwttfvlhk tenslnvksl imetGsvskk vqqlplsprl sknkhmrdml 

kgwttfvlhk tenslnvksl imetGsvskk vqqlplsprl sknkhmrdml 

kgwttfvlhk tenslnvksl imetGsvskk vqqlplsprl sknkhmrdml 



857 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



msa235427 . 2 { 195_A909 } 
msa235427 . 2 { 195_<:0H1 } 
msa23 5427 .2{195_M732} 
msa23 5427.2{l95_M78l) 
Consensus 



kgwttfvlhk tenslnvksl imetGsvskk vqqlplsprl sknkhmrdml 
.rmdnicit. nrkfikc.kf dygdG.ck.e ssttsf ks . i i.k.ayegy. 
.rmdnicit. nrkfikc.kf dygdG.ck.e ssttsfks.i i.k.ayegy. 
.rmdnicit. nrkfikc.kf dygdG.ck.e ssttsfks.i i.k.ayegy. 



msa235427.2{l95_H36B} 
msa235427 .2{195_JM9130013J 
msa235427 . 2 { 195_18RS2 1 } 
msa235427 . 2 { 195_2603 } 
msa235427.2{l95„A909} 
msa2 3 54 2 7 . 2 { 1 9 5_COHl } 
msa235427 . 2 { 195_M732 } 
msa235427 . 2 { 195_M78 1 } 
Consensus 



601 

lTmqkdsayy 
lTmqkdsayy 
lTmqkdsayy 
lTmqkdsayy 
lTmqkdsayy 
aTyyakrf sv 
aTyyakrf sv 
aTyyakrf sv 



etsdslvlri 
etsdslvlri 
etsdslvlri 
etsdslvlri 
etsdslvlri 
lrnk.qssps 
lmk.qssps 
lrnk.qssps 



Nltadtklnf 
Nltadtklnf 
Nltadtklnf 
Nltadtklnf 
Nltadtklnf 
N . shcry . t . 
N . shcry . t . 
N . shcry . t . 



navkgaSalt 
navkgaSalt 
navkgaSalt 
navkgaSalt 
navkgaSalt 
f .cc.rSecs 
f .cc.rSecs 
f .cc.rSecs 



650 

enmmmrqf av 
enmmmrqf av 
enmmmrqf av 
enmmmrqf av 
enmmmrqf av 
y .kyddetvc 
y .kyddetvc 
y . kyddetvc 



msa235427 
msa235427 .2{19 
msa235427.2 
msa235427 
msa235427 
msa235427 
msa235427 
msa235427 



.2{195_H36B] 
5_JM9130013 
{l95_18RS2l] 
.2{195_2603 
.2{195_A909; 
.2{l95_COHl] 
.2{195_M732} 
.2{195_M781} 
Consensus 



651 

agpqddpvse 
agpqddpvse 
agpqddpvse 
agpqddpvse 
agpqddpvse 
scwttr . sc . 
scwttr.sc. 
scwttr .sc . 



hkypsvf lit 
hkypsvf lit 
hkypsvf lit 
hkypsvf lit 
hkypsvf lit 
.t .ipisisl 
. t .ipisisl 
.t .ipisisl 



palLetasea 
palLetasea 
palLetasea 
palLetasea 
palLetasea 
nscLignc . . 
nscLignc. . 
nscLignc. . 



tlngkeitaS 
tlngkeitaS 
tlngkeitaS 
tlngkeitaS 
tlngkeitaS 
gnskw.gnhS 
gnskw.gnhS 
gnskw.gnhS 



700 

giighikdGd 
giighikdGd 
giighikdGd 
giighikdGd 
giighikdGd 
iwyyrshqGw 
iwyyrshqGw 
iwyyrshqGw 



msa235427.2{l95_H36B} 
msa235427.2{l95_JM9130013} 
msa235427 . 2 { 195_18RS21 } 
msa235427 . 2 { 195_2603 } 
msa235427 . 2 { 195_A909 } 
msa235427 . 2 { 195_C0H1 } 
msa235427 . 2 { 195_M732 } 
msa235427.2{195_M78l} 
Consensus 



701 

ksKhvevkmv 
ksKhvevkmv 
ksKhvevkmv 
ksKhvevkmv 
ksKhvevkmv 
. . Kqac . sqn 
. .Kqac. sqn 
. . Kqac . sqn 



nEngdmlgtp 
nEngdmlgtp 
nEngdmlgtp 
nEngdmlgtp 
nEngdmlgtp 
gE . kwrharn 
gE . kwrharn 
gE . kwrharn 



viiqgkdltn 
viiqgkdltn 
viiqgkdltn 
viiqgkdltn 
viiqgkdltn 
pcyysr .rid 
pcyysr .rid 
pcyysr .rid 



rtkplmsgrr 
rtkplmsgrr 
rtkplmsgrr 
rtkplmsgrr 
rtkplmsgrr 
. snktinewt 
. snktinewt 
. snktinewt 



750 

vlyagkqyef 
vlyagkqyef 
vlyagkqyef 
vlyagkqyef 
vlyagkqyef 
.stlcr.ti. 
.stlcr.ti. 
.stlcr.ti. 



msa235427.2{l95_H36B} 
msa235427.2{l95_JM9130013} 
msa235427.2{l95_18RS21> 
msa235427 . 2 { 195_2603 } 
msa235427 . 2 { 195_A909 } 
msa235427 . 2 { 195_COHl } 
msa235427 . 2 { 195_M732 } 
msa235427.2{l95_M78l} 
Consensus 



751 

raklplsrfn 
raklplsrfn 
raklplsrfn 
raklplsrfn 
raklplsrfn 
vpg.itt .sf 
vpg.itt .sf 
vpg.itt .sf 



twirvewte 
twirvewte 
twirvewte 
twirvewte 
twirvewte 
.hld.g.sgn 
.hld.g.sgn 
.hld.g.sgn 



agekaSivrr 
agekaSivrr 
agekaSivrr 
agekaSivrr 
agekaSivrr 
rsrreSkycs 
rsrreSkycs 
rsrreSkycs 



mf f dqsvpel 
mf f dqsvpel 
mf f dqsvpel 
mf f dqsvpel 
mf f dqsvpel 
shvl .pissr 
shvl .pissr 
shvl .pissr 



800 

ntavakrdlt 
ntavakrdlt 
ntavakrdlt 
ntavakrdlt 
ntavakrdlt 
a.hssc. t .f 
a.hssc. t -f 
a.hssc. t .f 



msa235427 . 2 { 195_H36B } 
msa235427 . 2 { 195_JM9130013 } 
msa235427 . 2 { 195_18RS2 1 } 
msa235427 .2{l95_2603} 
msa235427 . 2 { 195_A909 } 
msa235427.2{l95_COHl} 
msa235427 . 2 { 195_M732 } 
msa235427 . 2 { 195_M781 } 
Consensus 



801 

sdtalihiva 
sdtalihiva 
sdtalihiva 
sdtalihiva 
sdtalihiva 
df .ycsyphr 
df .ycsyphr 
df .ycsyphr 



kddsLklkly 
kddsLklkly 
kddsLklkly 
kddsLklkly 
kddsLklkly 
cqr .Lsktki 
cqr . Lsktki 
cqr .Lsktki 



qddsllesvd 
qddsllesvd 
qddsllesvd 
qddsllesvd 
qddsllesvd 
isr.fit.ic 
isr.fit .ic 
isr.fit.ic 



ktglysfrng 
ktglysfrng 
ktglysfrng 
ktglysfrng 
ktglysfrng 
. .nrsl .f .k 
. . nrsl . f . k 
. .nrsl . f .k 



850 

veitkdmtvp 
veitkdmtvp 
veitkdmtvp 
veitkdmtvp 
veitkdmtvp 
wcrnh ♦ ryds 
wcrnh . ryds 
wcrnh . ryds 



msa235427 . 2 { 195_H36B } 
msa235427.2{l95_JM9130013} 
msa235427 .2 {l95_18RS2l} 
msa235427 . 2 ( 195__2603 } 
msa235427.2{l95_A909} 
msa235427 .2{195_C0H1} 
msa235427 . 2 { 195_M732 } 
msa23 5427 . 2 { 195J478 1 } 
Consensus 



851 

lefgdnitkl 
lefgdniikl 
lefgdniikl 
lefgdniikl 
lefgdniikl 
ttriwr .yy. 
ttriwr .yy. 
ttriwr .yy. 



savdlsnyrr 
savdlsnyrr 
savdlsnyrr 
savdlsnyrr 
savdlsnyrr 
vicc.likls 
vicc.likls 
vice. likls 



netlhiYrnr 
netlhiYrnr 
netlhiYrnr 
netlhiYrnr 
netlhiYrnr 
s. .dpsYl.k 
s. .dpsYl.k 
s. .dpsYl.k 



f dvkaSqmta 
f dvkaSqmta 
f dvkaSqmta 
f dvkaSqmta 
f dvkaSqmta 
pf . c . Skpnd 
pf .c.Skpnd 
pf . c . Skpnd 



900 

dkgakvtvdm 
dkgakvtvdm 
dkgakvtvdm 
dkgakvtvdm 
dkgakvtvdm 
s . qrs . sncg 
s . qrs . sncg 
s . qrs . sncg 



msa235427 . 2 { 195_H36B) 
msa2 35427 .2{195_JM9130 013} 
nisa235427 . 2 { 195_18RS21 } 
msa235427 .2{l95_2603} 
msa235427.2fl95_A909l 
msa2 35427.2(19 5_COHl } 
msa235427.2{195_M732} 
msa235427 . 2 { 195_M781 } 
Consensus 



901 

lmkhlwpem 
lmkhlwpem 
lmkhlwpem 
lmkhlwpem 
' lmkhlwpem 
yvdealscsr 
yvdealscsr 
yvdealscsr 



aGaytltide 
aGaytltide 
aGaytltide 
aGaytltide 
aGaytltide 
nGrslyinnr 
nGrslyinnr 
nGrslyinnr 



apntnesgml 
apntnesgml 
apntnesgml 
apntnesgml 
dpntnesgml 
rsskhk. irn 
rsskhk. irn 
rsskhk. irn 



tNakvSIhyv 
tNakvSIhyv 
tNakvSIhyv 
tNakvSIhyv 
tNakvSIhyv 
vNkr .SIdsl 
vNkr.SIdsl 
vNkr. SIdsl 



950 

nggvdkvdvp 
nggvdkvdvp 
nggvdkvdvp 
nggvdkvdvp 
nggvdkvdvp 
ckwwc . . s . c 
ckwwc . . s . c 
ckwwc . . s . c 



858 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



msa235427.2{l95_H36B} 
msa235427 . 2 { 195_JM9130013 } 
msa235427 . 2 { 195_18RS2 1 } 
msa235427.2(l95_2603} 
msa235427 .2{195_A909} 
msa235427.2{l95_COHl} 
msa235427.2{!95_M732} 
msa235427 . 2{ 195_M78l} 
Consensus 



951 

ikwdleair 
ikwdleair 
ikwdlea. . 
ikwdleair 
ikwdleair 
sd.ss.lrsy 
sd.ss .lrsy 
sd.ss.lrsy 



kaeeahkade 
kaeeahkade 
.... irkaee 
kaeearkaee 
kaeeahkade 
s.s.rst.s. 
s.s.rst.s. 
s.s.rst.s. 



arkaeearka 
arkaeearka 
arkaeearka 
arkaeearka 
arkaeearka 
rst.s.rst . 
rst.s.rst. 
rst.s.rst. 



deahkaeevr 
eeahkaeevr 
eeghktqeap 
eeghktqeap 
eearkaeear 
s .rst.s.rs 
B.rst.s.rs 
s. rst.s.rs 



1000 
kaeeahkvee 
kaeeahkvee 
iveegykvnn 
iveegykvnn 
kaeeghktqe 
t.s.rst .sr 
t.s .rst .sr 
t . s .rst . sr 



1001 1050 

msa235427 .2{l95_H36B} arkaeeghkt qeapiveegy kvnnvhqtdt tvkasdlpkt ktvsavhmar 

msa235427 . 2{l95_JM9130013} ap.s.rgt.n prstys.rrl qg. .rssn.y ys . sv. f tkd .dsfrssyg. 

msa235427.2{l95_18RS2l} vhqtdttvka sdlpktktvs avhmartdnk qitshqthve 

msa235427 .2(195_2603} vhqtdttvka sdlpktktvs avhmartdnk qitshqthve kqikntlpst 

msa235427.2{195_A909} apiveegykv nnvhqtdttv kasdlpktkt vsavhmartd nkqitshqth 

msa235427.2{l95_COHl} rst.s.rgt. nprstys.rr lqs..rssn. yys.sv.ftk d. dsfrssyg 

msa235427.2{l95_M732} rst.s.rgt. nprstys . rr Iqs . . rssn . yys . sv . ftk d. dsfrssyg 

msa235427.2{l95_M78l} rstvklkrdi kpkkhl . Ikk atklitfikl ilqlkrliyq rlrqfpqfiw 

Consensus 

1051 1081 

msa235427.2{l95_H36B} tdnkqitshq th - 

msa235427.2{l95_JM9130013} nrq.tdnfts dtc - 

msa235427 . 2 { 195_18RS2l) ~ 

msa235427 .2{l95_2603} gdskrgyyit gmaivmlsvl fslakkfksk y 

msa235427.2{l95_A909j vekqikn - 

msa235427 . 2 { 195_COHl } .nrq.tdnft sdTC 

msa235427 . 2{195_M732 } .nrq.tdnft sdTC.k - 

mea235427 .2{195_M781} leqtinr.lh irhml 

Consensus ******** ********** * 
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SEQ ID NO. 5201 
STRAIN 090 

AGCX3ATACCTTTAATTTTGATATTGACCAAATTGCAGA 

CAATG CT AT CACTAAAACAGAT AAAAC AACAG AAATTATTTC CAACCAGA 

C^CAAGCCAAACTGGGCAAATTGCCTTTTTTGAAAAACTAACACCAGCA 

CAAAAGTCTGCTATCTCTGAAAAAACACGAGCTTTGGTAGATACTTTTGT 

CGGCGATCAAAATGCGCTCCTTGATTTTGGACAATCCGCAGTAGAAGGCG 

TT AAT AC CACTGTT AAT CATAT CTTGTCTG AG CAG AAAAAAATTCAAATT 

CCTCAAGTTGATGATTTACTAAAAAATGCTAATCGCGAACTAAATGGATT 

TATTG CCAAATATAAAGATG CT ACTCCGG CAG AATTAgAGAAAAAAC CAA 

ACTTGATTCAAAAATTATTCAAACAAAGCAAGACCTCGCTAC^ 

TATTTTGACTCACAAAACATCGAG CAAAAAATGGATATGATGGCaGCGAA 

TGTTGTCAAACAAGAAGATACTTTGGCAAGAAATATCG t CTCTGCTGAAA 

TG CT CATTGAAGATAATACTAAAT CTATT GAAAAT TT GGTTGG AGTT ATT 

GCTt t TATTGAATCgAGTCAAG CCGAGGCTGCTAATCG t GCAaGCCACTT 

ACAACAAGAAATT CTAG CATTAGAT AG CC aAACGT c CGAGTATCAAAT t A 

AAAGTaACCAATTAGCTCGAATGACTGAAGTTATCAATACCCTCGAACAG 

CMCATACTGAATATGTCAGCCGTCTOTACGTTGCATGGGCAACAACACC 

ACAGATGCGAAACTT GGTCAAAGT ATCGT CAGATATGCGT CAGAAACTTG 

GCATGTTACGTCGAAATACCATTCCAACAATGAAACrCTCAATCGCrCAG 

TTAGGCATGATGCAACAATCrrGTCAAATCCGGTGTCACTGCTGATGCTAT 

TGTCAACGCTAATAATG CAGCATTG CAGATG CTGG CTGAAACTAGTAAAG 

AAGCGATTCCGATGTTAGAGAAGACCGCACAAAGCCCCACTGTTT CTATT 

AAAT CTGTGACTGCATTAGCTGAAAG CTTAGTGG CTCAAAATAATGGTAT 

TATCGCTGCCATAGACAAAGGACGTAAGGAACGTGCCCaATTGGAATCTG 

CTGTTATTAAAT CGG CTGAAACAAT CAATGATTCTGT CAAAATT CGTGAT 

AAAAAAATAGTTGAAGC CTTACT CAACGAAGGT aAATCTAC C CAAGAAAA 

AGTTGATGAGTCT 

SEQ ID NO. 5202 
STRAIN A90 9 

AGCGATACCTTT AATTTTGATATTGAC CAAATTG CAGA 

CAATGCTAT CACTAAAACAGATAAAACAACAGAAATTATTTCCAAC CAGA 

CAACAAGCCAAACTGGGCAAATTGCCTTTTTTGAAAAACT 

CAAAAGTCTGCTATCTCTGAAAAAACACCAGCTT 

CGGTGACCAAAATGCGCTCCTTGATTTTGGACAATCCGCAGTAGAAGGCG 
TTAATAC C^CTGTTAATCATAT CTTGTCTG AG CAGAAAAAAATTCAAATT 
C CTCAAGTTG ATGATTT ACTAAAAAATGCTAAT CG CX3AACTAAATGG ATT 
TATTG C CAAATAT AAAGATG CTACT C CGG CAG AATTAGAGAAAAAAC CAA 
ACTTGATTCAAAAATTATTCAAACAAAGCAAGACCT CG CTACAGG AATTT 
TATTTTGACTCkCAAAACATCGAGC^^ 

TGTTGTCAAACAAGAAGATACT^GGCAAGAAATATCGTCTCTGCTGAAA 
TGCT CATTGAAGATAATACTAAAT CTATTGAAAATTTGGTTGGAGTTAwT 
GCTTTTATTGAATCGAGTC^GCCGAGGCTGCC^TCGTGCAAGCCACTT 
ACAACAAGAAATTCTAG(^TTAGATAGCCAAACGTCCGAGTATCAAATTA 
AAAGTAACCAATTAGCT CGAATGACTG AAGTTAT CAATACCCT CGAACAG 
CAACATACTGAATATGTCAG C CGT CTCTACGTTG CATGGGCAACAACAC C 
ACAGATG CGAAACTTGGTCAAAGTATCGT CAGAT ATG CGTCAAAAACTTG 
GCATGTTACGTCGAAATACCATT C CAACaATGAAACT CT CAATCG CT CAG 
TTAGGCATGATGCAACAATCTGTCAAATCCGGTGTCACTGCTGATGCTAT 
TGTCAACGCTAATAATG CAG C^TTG CAGATG CTGG CTGAAACTAGTAAAG 
AAGCGATTCCGATGTTAGAGAAGAC CG CACAAAG C CC CACTGTTT CTATT 
AAATCTGTCACrGCATTAGCTG AAAGCTTAGTGG CT CAAAATAATGGTAT 
TAT CGCTGC CAT AGACAAAGGACGTAAAG AACGTG CC CAATTAGAAT CTG 
CTGTTATTAAAT CGG CTGAAACAATCAATGATTCTGT CAAAATT CGTGAT 
AAAAAAATAGTTGAAGC CTTACTCAACGAAGGTaAATCTACCCAAGAAAA 
AGtTGATGAGTCT 

SEQ ID NO. 5203 
STRAIN H3 6B 

AG CG aTAC CTTTAATTTTGATATTG AC CAAATTG CAGAC 
AATGCTATCACTAAAACAGATAAAACAACAGAAATTAT^ 
AACAAGCCAAACTGGG CAAATTG CCTTTTTTG AAA 

AAAAGTCTG CTAT CT CTGAAAAAACAC CAG CITTGGTAGATACTTTTGTC 
GGTGACCAAAATGCGCTCCTTGATTTTGGACAATCCGCAGTAGAAGGC^ 
TAATACCACTGTTAATCATATCTTGTCTGAGCAGAAAAAAATTCAAATTC 
CT CAAGTTGATGATTTACTAAAAAATG CT AAT CG CGAACTAAATGGATTT 
ATTG C CAAATATAAAGATG CTACTCCGG CAGAATTAGAGAAAAAACCAAA 
CTTGATT CAAAAATTATTCAAACAAAG CAAGACCTCG CTACAGGAATTTT 
ATTTTG^CTC^CAAAACATCGAGCAAAAAATGGATATGATGGCAGCGAAT 
GTTGTCAAACAAGAAGATACTTTGG CAAG AAATATCGT cTCTGCTGAAAT 
G CT CATTGAAGATAATACTAAATCTATTGAAAATTTGGTTGGAGTTA^ 
CTt tTATTGAATCGAGTCAAGCCGAgGCTGCCAATCGTGCAAGCCACTTA 
CAACAAGAAATTCTAGCATTAGATAG C CAAACGT C CG AGT AT CAAATTAA 
AAGTAACCAATTAGCTCGAATCACTGAAGTTATCAATACCCTCGAACAGC 
AACATACTGAATATGTCAGCCGTCTCTACGTTGCATGGGCAACAACACCA 
CAGATGCGAAACITGGTCAAAGTATCGTCAGATATGCGTCAAAAACTTGG 
CATGTTACGT CGAAATAC CATTC CAACaATGAAACT CT CAAT CGCTCAGT 
TAGG CATGATGCAACAAT CTGT CAAAT CCGGTGT CACTG CTGATG CTATT 
GTCAAGG CTAATAATGCAG CATTG CAG ATG CTGG CTG AAACTAGTAAAGA 
AGCGATTCCGATGTTAGAGAAGACCGCTVCAAAGCCCCACTGTTTCTA'ITA 
AAT CTGTCACTGCATTATCTGAAAG CTTAGTGG CT CAAAATAATGGTATT 

atcgctgccatagacaaaggacgtaaagaacgtgcccaattagaatctgc 
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TGTTATTAAATCGG CTG AAACAAT CAATG ATT CTGT CAAAATT CGTG AT a 
AAAAAATAGTTG AAG CCTT ACT CAa CGAAGGT aAAT CT AC C CAAGAAAAA 
GTTGATG AGT CT 

SEQ ID NO. 5204 
STRAIN 18RS21 

TTTTGATATTGAC CAAATTG CAGA CAATG CTAT CACT AAAACAG AT AAAA 
CAACAGA^TTATTTCCAACCAGACAACAAGCCAAACTGG 
TTTTTTGAAAAACTAACACCAGCACAAAAGTCTGCTATCrCT 
ACCAGCTTTGGTAGATACTTTTGTCCK3CGATCAAAAT^ 
TTGGACAATCCX3CAGTAGAAGGCGTT7VATACCACTGTTAATCATATCTTG 
TCTGAG CAG AAAAAAATT CAAATT C CT CAAGTTG ATG ATTTACTAAAAAA 
TGCTAATCGCGAAOTAAATGGATTTATTGCCAAATATAAAGATGCTACTC 
CGGCAGAATTAGAGAAAAAACCAAACTTGATTCAAAA^TTATTC^AACAA 
AGCAAGACCTCGCTACAGGAATTTTATTTTGACTCACAAAACATCa^ 
AAAAATG GATATGATGG CAG CG AATGTTGTCAAACAAGAAGATACTTTGG 
CAAGAAATAT CGT CT CTGCTGAAATG CTCATTGAAGATAATACTAAATCT 
ATTGAAAATTTGGTTGG AGTTATTG CTTTTATTGAAT CGAGT CAAG CCGA 
GG CTG CTAAT CGTG CAAG C CACTT ACAACAAG AAATT CTAGCATT AGATA 
GCCAAACGTCCGAGTATCAAATTAAAAGTAACCAATTAGCTCGAATGACT 
GAAGTTATG^TACCCTCGAACAGCAACATCCTGAATATGTCAGCCGTCT 
CTACGTTGGATGGGCAACAACACCACAGATGCGAAACT^ 
CGTCAGATATG C GT CAGAAACTTGG CATGTTACGT CGAAATACCATT CCA 
ACAATGAAACT CTCAATCG CT CAGTTAGG CAT GATGCAACAAT CTGT CAA 
ATCCGGTGTGA.CTG CTGATG CTATTGT CAACG CTAATAATG CAG CATTG C 
AGATGCTGG CTG AAACTAGT AAAGAAG CGATTCCGATGTTAGAGAAGACC 
GCACAAAGCCCCACTGTTTCTATTAAATCTGTCACTGCATC^ 
CTTAGTGGCTCAAAATAATGGTATTATCGCTGCGATAGACAAAGGACGTA 
AGGAACGTG CC C aATTGGAAT CTIGCTGTTATTAAAT CGG CTGAAACAATC 
AATGATTCTGT CAAAATT CGTGATAAAAAAATAGTTGAAG C CTTACT CAA 
CGAAGGTaAATCTACCCAAGAAAAAGTTGATGAGTCT 

SEQ ID NO. 5205 
STRAIN K732 

AG CG ATACCTTTAATTTTGATATTG AC CAAATTG CAGAC 
AATGCTATCACTAAAACAGATAAAACAACAG 

AACAAGC CAAACTGGG CAAATTG CCTTTTTTGAAAAACTAACAC CAG CAC 
AAAAGTCTG CTAT CT CTGAAAAAACACCAGCTTTGGTAGATACITTTGTC 
GGTGACCAAAATG CG CT CCTTG ATTTTGG ACAATCCG CAGTAGAAGG CGT 
TAATACTACTGTTAAT CAT AT CTTGT CTGAGCAG AAAAAAATT CAAATTC 
CT CAAGTTGATGATTTACTAAAAAATG CTAAT CG CG^CTA 
ATTGC CAAATATAAAGATGCTACTC CGG CAGA^TTAGAGAAAAAACCAAA 
CTTGATTCAAAAATTATTCAAACAAAG CAAGACCT CGCTACAGGAATTTT 
ATTTTGACTCACAAAACATCGAGCAAAAAAT^ 

GTTGTCAAACAAGAAGATACTTTGGCAAGAAATAT CGT CT CTGCTGAAAT 
GCTCATTGAAGATAATACTAAATCTATTGAAAATTTGGTTC 
CTTTTATTG AATCGAGTCAAG C CGAGG CTG CCAAT CGTG CAAGC CACTTA 
CAACAAGAAATTCTAG CATT AGATAGC CAAAC GT C CG AATAT CAAATTAA 
AAGTAACC^ATTAGC CCGAATGACTGAAGTTAT CAATAC CCT CGAACAG C 
AACATACGGAATATGTCAGC CGTCTCT ACGTTG CATGGG CAACAACACCA 
CAGATGCGAAACTTGGTCAAAGTATCGTCAGATATGCGTCAGAAACTTGG 
TATGTTACGTCGAAATACCATTCCAACAATGAAACTCTCAAT CG CTCAGT 
TAGG CATGATGCAACAAT CTGTCAAATCCGGTGT CACTG CTGATGCT ATT 
GT CAACG CTAAT AATG CAG CATTG CAAATG CTGG CTG AAACTAGT AAAGA 
AGCGATTCCGATGTTAG AGAAGACCGCACAAAG C C CCACTGTTTCTATTA 
AATCTGT CACTG CATTAGCTGAAAG CTTAGTGG CT CAAAATAAT GGTATT 
ATCGCTGCCATAC4AG^AAGGACGTAAGGAACGTGCCCAATTAGAATCTGC 
TGTTATTAAATCGG CTGAAACAAT CAATGATTCTGTCAAAATTCGTG ATA 
AAA^VAAT AGTTGAAG CCTTACT CAACG AAGGTAAATCTACCCAAGAAAAA 
G 

SEQ ID NO. 5206 
STRAIN COH1 

CTAAAACAGATAAAACAACAGAAATTATTT C CAA.C CAGACAACAAGCCAA 
ACTGGGCAAATTGCCTTTTTTGAAAAACTAACAC CAG C^CAAAA3TCTG C 
TwTCTCTGAAAAAACACCAGCTTTGGTAGATACTTTTG^ 
ATG CG CT CCTTGATTTTGGACAAT CCG CAGTAGAAGG CGTTAAT ACTACT 
GTTAAT CATATCTTGTCTGAGCAGAAAAAAATTCAAATTC CT CAAGTTGA 
TGATTTACTAAAAAATG CTAAT CG CGAACTAAATGGATTTATTG CCAAAT 
ATAAAGATGCTACTCCC4GCaGAATTAGAGAAAZUy^CCAAACrT 
AAATTATTCAAACAAAG CAAC^A.CCTCX3CTACAGG AATTTTATTTTGACT C 
ACAAAACAT CGAGCAAAAAATGGATATGATGGCAG CAAATGTTGTCAAAC 
AAGAAGATACTTTGG CAAGAAATAT CGTCTCTGCT GA^ 
GATAATACTAAAT CTATTGAAAATTTGGTTGG AGTTATTG ClTTTATTGA 
AT CGAGTCAAGC CGAgG CTG CCAATCGTGCaAG C CACTT ACAACAaGAAA 
TT CT AGC aTT AGATAGCCAAACGT C CG AATATCAAATTAAAA3T AACCAA 
TTAG C C CGAATGACT GAaGTTATCAaT aC CCT CGAACAG CAACATACGGA 
aTATGTCAG CCGT CT CT ACGTTGCATGGG CAACAACAC CACAGATG CGAA 
ACTTGGT CAAAGTAT CGT CAGATATGCGTCAGAAACTTGGTATGTTACGT 
CGAAATACCATTCCAACAATGAAACTCTCAATCG CT CAGTTAGG CATGAT 
GCAACAATCTGTCAAATCCGGTGTCACTGCTGATGCTATT^ 
ATAATGCAG CATTG CAAATG CTGG CTG AAACTAGT AAAGAAG CG ATT CCG 
ATGTTAGAGAAGAC CGCACAAAGC C CCACTGTTT CTATTAAATCTGT CAC 
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TGCATTAGCTGAAAGCTTAGTGGCTCAAAATAATGGTATTATCGCTGCCA 
TAGACAAAGGACGTAAGGAACGTGCCCAATTAGAATCTGCTGTTATTAAA 
TCGGCTGAAACAATCAATGATTCTGTCAAAATTCGTGATAAAAAAATAGT 
TGAAGCCTTACTCAaCGAAGGTAAATCTACCCAAGAAAAAGTTGATGAGT 
CT 

SEQ ID NO. 5207 
STRAIN M781 

TTTTGATATTGACCAAATTGCAGACAATGCTATCACTAAAACAGATAAAA 
CAACAGAAATTATTT C CAAC CAGACAACAAG C CAAACTGGG CAAATT GC C 
TTTTTTGAAAAACTAACACCAGCACAAAAGTCTGCTATC 
ACCAGCTTTGGTAGATACTTTTGTCGGTGACCAAAATGCGCTCCTTGATT 
TTGG ACAAT CCG CAGTAGAAGG CGTTAATACT ACTG t TAAT CATAT CTTG 
TCTGAGCAGLAAAAAAATTCAAATTCCTCAAGTTGATGATTTACTAAAAAA 
TG CT AAT CG CGAACTAAATG GATTT ATTG C CAAAT ATAAAGATG CTA CT C 
CGGCAGAATTAGAGAAAAAACCAAACTTGATTCAAAAATTATTCAAACAA 
AG CAAGACCTCG CTACAGGAATTTTATTTTGACT CACAAAACAT CGAG CA 
AAAAATGGATATGATGG CAG CAAATGTTGT CAAA.CAAGAAGATACTTTGG 
CAAGAAATATCGTCTCTGCTGAAATGCTCATTGAAGATAATACTAAATCT 
ATTGAAAATTTGGTTGG AGTTATTG CTTTTATTG AATCG AGT CAAGCCGA 
GG CTG CCAATCGTCCAAGC CACTT ACAACAAGAAATT CT AG CATTAGATA 
GCCAAACGTCCGAATATCAAATTAAAAGTAACCAATTAGCCCGAATGACT 
GAAGTTATCAATACCCTCGAACAGCAACATACGGAATATGTCAGCCGTCT 
CTACGTTGCATGGGCAACIAACACCACAGATGCGAAACTTGGTCAAAGTAT 
CGTCAGATATGCGT CAGAAACTTCGTATGTTACGT CGAAATAC CATTC CA 
ACAATGAAACT CTC AATCG CTCAGTTAGG CATGATGC AACAATCTGT CAA 
ATCCGGTGT CACTG CTGATGCTATTGT CAACGCTAATAATGCAGCATTG C 
AAATG CTGGCTGAAACTAGTAAAGAAG CGATT CCGATGTT AGAGAAGAC C 
G CACAAAGC C CCACTGTTT CTATTAAATCTGT CACTG CATTAGCTGAA^G 
CTTAGTGGCTCAAAATAATGGTATTATCGCTGCCATAGACAAAGGACGTA 
AGGAACGTG CCCAATTAGAATCTGCTGTTATTAAATCGG CTC 
AATCATTCTGTCAAAATTCGTGATAAAAAAATAGTTCAAGCCTTACTCM. 
CGAAGGTAAATCTACCCAAGAAAAAGTTGATGAGTCT 

SEQ ID NO. 5208 
STRAIN CJBI10 

TTTTGATATTGACCAAATTGCAGACAATCCTATCACT 

CAACAGAAATTATTT CCAAC CAGACAACAAG CCAAACTGGG CAAATTG C C 

TTTTTTGAAAAACTAACACCAGCACAAAA.GT CTG CTAT CTCTGAAAA^AC 

ACCAGCITTGGTAGATACTTTTGT CGG CGAT CAAAATG CG CT CCTTGATT 

TTGGACAAT C CG CAGTAGAAGG CGTTAATACCACTGTTAAT CAT ATCTTG 

TCTGAGCAGAAAAAAATTCAAATTCCTCAAGTTGATGATT^ 

TG CTAATCG CGAACTAAATGGATTTATTG C CAAATATAAAGATGCTACT C 

CGGCAGAATTAGAGAAAAAACCAAACTTGATTCAAAAATO 

AGCAAGACCTTCCTACAGGAATTTTATTTTGACT 

AAAAATGGATATGATGGCAG CGAATGTTGTCAAACAAGAAGATACTTTGG 
CAAGAAATAT CGTCTCTG CTGAAATGCTCATTGAAGATAATACTAAATCT 
ATTGAAAATTTGGTTGGAGTTATTGCTTTTATTGAAT CGAGT CAAGCCGA 
GG CTGCTAAT CGTG CAAGC CACTTACAACAAGAAATT CTAG CATTAGATA 
GCCAAACGT CCGAGTAT CAAATTAAAAGTAAC CAATTAGCTCGAATGACT 
GAAGTTATCAATAC CCTCGAACAG CAa CATACTG AAT ATGTCAG CCGTCT 
CTACGTTGCATGGGCaACaACACCACACATGCGAAACTTGGTCAAAGTAT 
CGTCAGATATGCGTCAGAAACTTGGCATGTTACGTCGAAATACCATTCCA 
ACAATGAA^CTCTCAAT CG CTCAGTTAGG CATGATGCAACAAT CTGT CAA 
ATCCGGTGT CACTG CTGATG CTATTGTCAACGCTAATA^TGCAGCATTC C 
AGATG CTGGCTg AAACTAGTAAAGAAG CGATT CCGATGTT AGAGAAGACC 
GCACAAAGCC CCACTGTTT CTATTAAATCTGT CACTG CATTAGCTGAAAG 
CTTAGTGG CT CAAAATAATGGT ATT AT CG CTG CCATAGACAAAGGACGTA 
AGGAaCGTG CC CAATTGGAAT CTG CTGTT ATTAAATC GG CTC AAACAAT C 
AATGATTCTGTCAAAATTCGTGATaAAAAAATAGTTGAAGCCTTACTCAA 
CGAAGGTAAATCTAC C CAAG AAAAAGTTGATG AGT CT 

SEQ ID NO. 5209 
STRAIN 116 9NT 

GCAGACAATGCTATC^CTAAAACAGATAAAACAACAGAAATTATTTCCAA 
CCAGACAACAAGCGAAACTGGGCAAATTGCCTTTTTTG 
CAGCAC^AAAGTCTGCTATCTCTGAAAAAACACCAGCTT^ 
TTTGTCGGTGAC CAAAATG CGCT C CTTGATTTTGG ACAAT C CG CAGTAG A 
AGGCGTTAATACCACTGTTAAT CATAT CTTGT CTG AG CAG AAAAAAATT C 
AAATTCCTCAAGTTGATGATTTACTAAAAAATGCTAATC 
GGATTTATTG CCAAATATAAAGATGCTACT CCGG CAGAATTAGAG AAAAA 
ACCAAACTTGATCCAAAAATTATTCAAACAAAGCAAGACCT 
AATTTTATTTTGACT CACAAAACAT CGAG CAAAAAATGGATATGATGGCA 
GCAAATGTTGT CAAACAAGAAGATACTTTGG CAAGAAATATCGTCTCTG C 
TCAAATGCTCATTGAAGATAATACTAAATCTATTGAAAATTTGGT^ 
TTATTGCTTTTATTGAATCGAGTCAAGCCGAGGCTGCCAATCGTGCAAGC 
CACITACA^CAAGAAATTCTAGCATTAGATAGCCAAACGTCCGAGTAT^ 
AATTAAAAGTAAC CAATTAGCTCGAATGACTG AAGTT AT CAATAC CCT CG 
Aa CAG CAACATACTG AATATGT CAG C CGT CTCTACGTTC CATGGG CAACA 
aCACCACAGATC CGAAACTTGGTCAAAGT AT CGT CAG ATATG CGT CAAAA 
ACTTGG CATGTTACGT CGAAATAC CATTCCAACAATG AAACTCT CAAT CG 
CTCAGTTAGG CATGATG CAACAATCTCTCAAAT CCGGTGT CACTG CTGAT 
G CTATTCTCAACG CTAATAATG CAG CATTC CAGATG CTGGCTGAAACTAG 
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TAAAGAAGCGATTCCGATGTTAGAGAAGACCGCACAAAGCCCCACTGTTT 
CTATT AAAT CTGT CACTG CATT AG CTG AAAG CTTAGTGGCTCAAAAT AAT 
GGTATTAT CG CTG CCATAG ACAAAGG ACGTAAGG AACGTG C C CAATTAGA 
ATCrGCTGTTATTAAATCGGCTGAAACAATCAATGATTCTGTCAAAATT C 
GTGATAAAAAAATAGTTGAAGCCTTACTCAACGAAGGTaAATCTACCCAA 
GAAAAAGTTGATGAGTCT 

SEQ ID NO. 5210 
STRAIN JM913 0013 

AGCGATACCTTTAATTTTGATATTGACCAAATTGCAGAC 
AA.TG CTAT CACTAAAACAG ATAAAACAACAGAAATTATTT CCAACCAGAC 
AACAAGCCAAACTGGGCAAATTGCCTTTTTTGAAAAACTAACAC CAGCAC 
AAAAGTOTGCTATCTCTGAAAAAACACCAGCTTTGGTAGATACTTTTGTC 
GGTGACCAAAATGCGCTCCTTGATTTTGGACAATCCGC^GTAGAAGGCGT 
TAATACCACTGTTAATCATAT CTTGTCTGAG CAG AAAAAAATTC AAATT C 
CT CAAGT TGATGATTTACTAAAAAATGCTAAT CG CGAACTA^XATGGATTT 
ATTG CCAAAT ATAAAGATG CTACTTC CGGCAGAATT AGAGAAAAAAC CAAA 
CTTGATT CAAAAATTATT CAAA.CAAAGCAAGAC CT CG CTACAGGAATTTT 
ATTTTGACTCACAAAACAT CGAG CAAAAAATG G AT ATGATGG CAG CG AAT 
GTTGT CAAACAAGAAGATACTTTGG CAAGAAATAT CGTCT CTGCTGAAAT 
GCTCATTGAAGATAATACTAAATCTATTGA 

CTTTTATTG AATcGAGTCAAG CCGAGGCTG C CAATCGTG CAAGC CACTTA 
CAACAAGAAATT CT AG CATT AG ATAG C CAAACGT C CG AGTAT CAAAT tAA 
AAGTaACCAATTAGCTCGAATGACTGAAGTTATCAATACCCTCGAACAGC 
AACATACTGAATATGTGAGCCGTCTCTACGTTGCATGG^ 
CAC^TGCGAAACT^GGTCAAAGTATCGTCAGATATGCGTCTVAAAACrTGG 
CATGTTACXSTCGAAATACCATTCCAAa^TGAAACTCTCAATCGCTCAGT 
TAGG CATGATG CAACAATCTGT CAAATCCGGTGT CACTG CTGATG CTATT 
GT CAACG CTAAT AATG CAG CATTG CAGATG GTGG CTGAAACTAGTAAAGA 
AGCGATTCCGATGTTAGAGAAGACCGCACAAAGCCCCACTGTTTCTATTA 
AATCTGT CACTG CATTAG CTGAAAG CTTAGTGGCTCAAAAT AATGGTATT 
AT CG CTG CCATAGACAAAGG aCGTAAGGAACGTGC C CAATTAGAATCTG C 
TGTT ATTAAAT CGG CTGAA^CAAT CAATGATTCTGTCAAAATTCGTG AT A 
AAAAAATAGTTG AAG CCTT ACT CAACGAAGGT aAAT CTACCCAAGAAAAA 
GTTGATGAGT CT 

SEQ ID NO. 5211 
STRAIN 2603 

agcgatacctttaattttgatattgaccaaattgcagacaatgctatcac 
taaaacagataaaacaacagaaattatttccaaccagacaacaagccaaa 
ctgggcaaattgccttttttgaaaaactaacaccagcacaaaagtctgct 
atctctgaaaaaacaccagctttggtagatacttttgtcggcgatcaaaa 
tgcgctccttgattttggacaatccgcagtagaag'gcgttaataccactg 
ttaatcatatcttgtctgagcagaaaaaaattcaaattcctcaagttgat 
gatttactaaaaaatgctaatcgcgaactaaatggatttattgccaaata 
taaagatgctactccggcagaattagagaaaaaaccaaacttgattcaaa 
aattattcaaacaaagcaagacctcgctacaggaattttattttgactca 
caaaacatcgagcaaaaaatggatatgatggcagcgaatgttgtcaaaca 
agaagat ac 1 1 tggcaagaaat at cgt ct c tgctgaaatgct cat tgaag 
ataatactaaatctattgaaaatttggttggagttattgcttttattgaa 
tcgagtcaagccgaggctgctaatcgtgcaagccacttacaacaagaaat 
tctagcattagatagccaaacgtccgagtatcaaattaaaagtaaccaat 
tagctcgaatgactgaagttatcaataccctcgaacagcaacatcctgaa 
tatgtcagccgtctctacgttgcatgggcaacaacaccacagatgcgaaa 
cttggtcaaagtatcgtcagatatgcgtcagaaacttggcatgttacgtc 
gaaataccattccaacaatgaaactctcaatcgctcagttaggcatgatg 
caacaatctgtcaaatccggtgtcactgctgatgctattgtcaacgctaa 
taatgcagcattgcagatgctggctgaaactagtaaagaagcgattccga 
tgttagagaagaccgcacaaagccccactgtttctattaaatctgtcact 
gcattagctgaaagcttagtggctcaaaataatggtattatcgctgccat 
agacaaaggacgtaaggaacgtgcccaattggaatctgctgttattaaat 
cggctgaaacaatcaatgattctgtcaaaattcgtgataaaaaaatagtt 
gaagccttactcaacgaaggtaaatctacccaagaaaaagttgatgagtc 
t 
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1 50 

msal3607 -2{201_COH1} C 

msal3607 .2{201_M781} TTTTGA TATTGACCAA ATTGCAGACA ATGCTATCAC 

TTisal3607 .2{201_090} AGCGATACCT TTAATTTTGA TATTGACCAA ATTGCAGACA ATGCTATCAC 

msal3607 .2 {201_CJB110} TTTTGA TATTGACCAA ATTGCAGACA ATGCTATCAC 

msal3607 .2 {201_18RS21} TTTTGA TATTGACCAA ATTGCAGACA ATGCTATCAC 

msal3607.2{201_2603) AGCGATACCT TTAATTTTGA TATTGACCAA ATTGCAGACA ATGCTATCAC 

msal3607.2{201_A909} AGCGATACCT TTAATTTTGA TATTGACCAA ATTGCAGACA ATGCTATCAC 

msal3607.2{201_H36B} AGCGATACCT TTAATTTTGA TATTGACCAA ATTGCAGACA ATGCTATCAC 

msal3607.2{201_JM9130013} AGCGATACCT TTAATTTTGA TATTGACCAA ATTGCAGACA ATGCTATCAC 

msal3 607 . 2 {201_1169NT} GCAGACA ATGCTATCAC 

msal3607.2{201_M732} AGCGATACCT TTAATTTTGA TATTGACCAA ATTGCAGACA ATGCTATCAC 

Consensus ********** ********** ********** ********** ********** 



863 
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Table 52: Comparative Sequences relating to SAG 1823 



msal3 607 .2{201_COHl} 
msal3607.2{201_M78l} 
msal3607 .2{201_090} 
msal3 607 . 2 {201_CJB110} 
msal3607 .2 {2 01_18RS2l} 
msal3607.2(201_2603} 
msal3607.2{201_A909~ 
msal3607.2{201 4 _H36B 
msal3607.2{201_JM9130013 
msal3607 . 2{201_1169NT 
msal3607-2{201_M732 
Consensus 



51 

TAAAACAGAT 
TAAAACAGAT 
TAAAACAGAT 
TAAAACAGAT 
TAAAACAGAT 
TAAAACAGAT 
TAAAACAGAT 
TAAAACAGAT 
TAAAACAGAT 
TAAAACAGAT 
TAAAACAGAT 
********** 



AAAACAACAG 
AAAACAACAG 
AAAACAACAG 
AAAACAACAG 
AAAACAACAG 
AAAACAACAG 
AAAACAACAG 
AAAACAACAG 
AAAACAACAG 
AAAACAACAG 
AAAACAACAG 
********** 



AAATTATTTC 
AAATTATTTC 
AAATTATTTC 
AAATTATTTC 
AAATTATTTC 
AAATTATTTC 
AAATTATTTC 
AAATTATTTC 
AAATTATTTC 
AAATTATTTC 
AAATTATTTC 
********** 



CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
********** 



100 

ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
********** 



msal3607 .2 
msal3607 .2 
msa*13607 . 
msal3607.2{2 
msal3607.2 {2 
msal3 607 .2 
msal3 607 .2 
msal3607 .2 
msal3607 .2 {201_ 
msal3607.2{2 
msal3 607 .2 



{2 01_COH1} 
{201_M781* 
2{201_090 
01_CJB110 
01_18RS2l} 
{201_2603} 
{201_A909} 
{201_H36B} 
JM9130013} 
01_1169NT} 
{201_M732} 
Consensus 



101 

CTGGGCAAAT 
CTGGGCAAAT 
CTGGGCAAAT 
CTGGGCAAAT 
CTGGGCAAAT 
CTGGGCAAAT 
CTGGGCAAAT 
CTGGGCAAAT 
CTGGGCAAAT 
CTGGGCAAAT 
CTGGGCAAAT 
********** 



TGCCTTTTTT 
TGCCTTTTTT 
TGCCTTTTTT 
TGCCTTTTTT 
TGCCTTTTTT 
TGCCTTTTTT 
TGCCTTTTTT 
TGCCTTTTTT 
TGCCTTTTTT 
TGCCTTTTTT 
TGCCTTTTTT 
********** 



GAAAAACTAA 
GAAAAACTAA 
GAAAAACTAA 
GAAAAACTAA 
GAAAAACTAA 
GAAAAACTAA 
GAAAAACTAA 
GAAAAACTAA 
GAAAAACTAA 
GAAAAACTAA 
GAAAAACTAA 
********** 



CACCAGCACA 
CACCAGCACA 
CACCAGCACA 
CACCAGCACA 
CACCAGCACA 
CACCAGCACA 
CACCAGCACA 
CACCAGCACA 
CACCAGCACA 
CACCAGCACA 
CACCAGCACA 
********** 



150 

AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 
********** 



msa!3607 . 
msal3607 . 
msal3607 
msal3607.2 
msal3607.2 
msal3 607 
msal3607 
msal3 607 
msal3607.2{201 
msal3607.2{ 
msa!3 607 



2{201_COH1} 
2{201_M78l} 
2{201_090} 
201JTJB110} 
201_18RS2l) 
2{201_2603) 
2{201_A909} 
2{201_H3 6B} 
_JM9130013} 
201_1169NT} 
2{201_M732} 
Consensus 



151 

wTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 



AAACACCAGC 
AAACACCAGC 
AAACACCAGC 
AAACACCAGC 
AAACACCAGC 
AAACACCAGC 
AAACACCAGC 
AAACACCAGC 
AAACACCAGC 
AAACACCAGC 
AAACACCAGC 



TTTGGTAGAT 
TTTGGTAGAT 
TTTGGTAGAT 
TTTGGTAGAT 
TTTGGTAGAT 
TTTGGTAGAT 
TTTGGTAGAT 
TTTGGTAGAT 
TTTGGTAGAT 
TTTGGTAGAT 
TTTGGTAGAT 



ACTTTTGTCG 
ACTTTTGTCG 
ACTTTTGTCG 
ACTTTTGTCG 
ACTTTTGTCG 
ACTTTTGTCG 
ACTTTTGTCG 
ACTTTTGTCG 
ACTTTTGTCG 
ACTTTTGTCG 
ACTTTTGTCG 



200 

GtGAcCAAAA 
GfcGAcCAAAA 
GcGAtCAAAA 
GcGAtCAAAA 
GcGAtCAAAA 
GcGAtCAAAA 
GtGAcCAAAA 
GtGAcCAAAA 
GtGAcCAAAA 
GtGAcCAAAA 
GtGAcCAAAA 



_********* ********** ********** ********** *_**_***** 



msal3607 .2 
msal3607 .2 
msal3607. 
msal3607 .2 {2 
msal3 607.2 {2 
msal3607 .2 
msal3607 .2 
msal3607 .2 
msal3607.2{201_ 
msal3607.2{2 
tnsa!3607 .2 



{201_COH1} 
{201_M78l} 
2{201_090} 
01_CJB110} 
01_18RS2l} 
201_2603} 
201_A909} 
201_H36Bj 
JM9130013} 
01__1169NT} 
{201_M732} 
Consensus 



201 

TGCGCT CCTT 
TGCGCTCCTT 
TGCGCTCCTT 
TGCGCTCCTT 
TGCGCTCCTT 
TGCGCTCCTT 
TGCGCTCCTT 
TGCGCTCCTT 
TGCGCTCCTT 
TGCGCTCCTT 
TGCGCTCCTT 
********** 



GATTTTGGAC 
GATTTTGGAC 
GATTTTGGAC 
GATTTTGGAC 
GATTTTGGAC 
GATTTTGGAC 
GATTTTGGAC 
GATTTTGGAC 
GATTTTGGAC 
GATTTTGGAC 
GATTTTGGAC 
********** 



AATCCGCAGT 
AATCCGCAGT 
AATCCGCAGT 
AATCCGCAGT 
AATCCGCAGT 
AATCCGCAGT 
AATCCGCAGT 
AATCCGCAGT 
AATCCGCAGT 
AATCCGCAGT 
AATCCGCAGT 
********** 



AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 
********** 



250 

AATACtACTG 
AATACtACTG 
AATACcACTG 
AATACcACTG 
AATACcACTG 
AATACcACTG 
AATACcACTG 
AATACcACTG 
AATACcACTG 
AATACCACTG 
AATACtACTG 
*****_**** 



msal3607 .2{201_COH1 
msal3607.2{201_M781 
msal3607.2{201_090 
msal3 607 .2 (2 01_CJB110, 
msal3607 .2 {201_18RS21} 
tnsal3607 .2{201_2603} 
tnsal3607.2{201_A909} 
msal3607 .2{201_H36B} 
msal3607.2{201__JM9130013} 
rasal3607.2{201_1169NT} 
msal3607.2{201_M732} 
Consensus 



251 

T TAAT CAT AT 
TTAATCATAT 
TTAATCATAT 
TTAATCATAT 
TTAATCATAT 
TTAATCATAT 
TTAATCATAT 
TTAATCATAT 
TTAATCATAT 
TTAATCATAT 
TTAATCATAT 
********** 



CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
********** 



CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
********** 



TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
********** 



300 

TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
********** 



msal3607.2{201__COHl} 



301 350 
GATTTACTAA AAAATGCTAA TCGCGAACTA AATGGATTTA TTGCCAAATA 



864 
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Table 52: Comparative Sequences relating to SAG 1823 



msal3607 ,2{201_M78l} GATTTACTAA AAAATGCTAA 

msal3607 .2{201_09 0} GATTTACTAA AAAATGCTAA 

msal3607 .2{2 01_CJB110} GATTTACTAA AAAATGCTAA 

msal3607 .2{201_18RS21} GATTTACTAA AAAATGCTAA 

msal3607 .2{201_2603} GATTTACTAA AAAATGCTAA 

msal3607.2{201_A909} GATTTACTAA AAAATGCTAA 

l msal3607 .2{201_H3 6B} GATTTACTAA AAAATGCTAA 

tnsal3607.2(201_JM9130013} GATTTACTAA AAAATGCTAA 

msal3607.2{201_1169NT} GATTTACTAA AAAATGCTAA 

msal3 607 .2{201_M732} GATTTACTAA AAAATGCTAA 

Consensus ********** ********** 



TCGCGAACTA 
TCGCGAACTA 
TCGCGAACTA 
TCGCGAACTA 
TCGCGAACTA 
TCGCGAACTA 
TCGCGAACTA 
TCGCGAACTA 
TCGCGAACTA 
TCGCGAACTA 
********** 



AATGGATTTA 
AATGGATTTA 
AATGGATTTA 
AATGGATTTA 
AATGGATTTA 
AATGGATTTA 
AATGGATTTA 
AATGGATTTA 
AATGGATTTA 
AATGGATTTA 
********** 



TTGCCAAATA 
TTGCCAAATA 
TTGCCAAATA 
TTGCCAAATA 
TTGCCAAATA 
TTGCCAAATA 
TTGCCAAATA 
TTGCCAAATA 
TTGCCAAATA 
TTGCCAAATA 
********** 



5{2 01_COH1} 
2{201_M781} 



msal3 607 .2 
msal3 607.2; 
msal3607.2{201__090| 
msal3607 .2 {201_CJB110} 
msal3607 .2 {201_18RS2l} 
msal3607 . 2 { 2 01J2603 } 
msal3607 .2{201_A909} 
msal3607.2{201_H36B} 
msal3607.2{201_JM9130013} 
rasal3607.2{201_1169NT} 
msal3607.2{20 1_M7 3 2 } 
Consensus 



351 

TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 



ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 



AATTAGAGAA 
AATTAGAGAA 
AATTAGAGAA 
AATTAGAGAA 
AATTAGAGAA 
AATTAGAGAA 
AATTAGAGAA 
AATTAGAGAA 
AATTAGAGAA 
AATTAGAGAA 
AATTAGAGAA 



********** ********** ********** 



AAAACCAAAC 
AAAACCAAAC 
AAAACCAAAC 
AAAACCAAAC 
AAAACCAAAC 
AAAACCAAAC 
AAAACCAAAC 
AAAACCAAAC 
AAAACCAAAC 
AAAACCAAAC 
AAAACCAAAC 
********** 



400 

TTGATt CAAA 
TTGATt CAAA 
TTGATt CAAA 
TTGATt CAAA 
TTGATt CAAA 
TTGATt CAAA 
TTGATt CAAA 
TTGATt CAAA 
TTGATt CAAA 
TTGATc CAAA 
TTGATt CAAA 
*****_**** 



msal3 607.2{20 1_C0H1 } 
msal3607.2{201_M78l} 
msal3607. 2 {201_090} 
msal3607.2{201_CJB110} 
msal3607.2(201_18RS2l} 
msal3607.2{201_2603) 
msal3607.2(201_A909} 
msal3607.2{201_H36Bi 
msal3607 . 2 {201_JM913 0013 } 
msal3607 .2{201_1169NT} 
msa!3607.2{201_M732} 
Consensus 



msal3 607 . 2 { 2 01_COH1 } 
msal3607.2{201_M78l} 
msal3607.2{201_090} 
msal3607.2{201_CJB110} 
msal3607.2(201_18RS2l} 
msal3607. 2 {201^2603} 
msal3607.2(201_A909} 
* msa!3 607.2{201_H36B} 
msal3607.2{201_JM9130013} 
msal3607 .2 {201_1169NT} 
msal3607.2{201_M732} 
Consensus 



401 

AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
********** 

451 

CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
********** 



ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
********** 



ACCTCgCTAC 
ACCTCgCTAC 
ACCTCgCTAC 
ACCTCgCTAC 
ACCTCgCTAC 
ACCTCgCTAC 
ACCTCgCTAC 
ACCTCgCTAC 
ACCTCgCTAC 
ACCTCaCTAC 
ACCTCgCTAC 
*****_**** 



AGGAATTTTA 
AGGAATTTTA 
AGGAATTTTA 
AGGAATTTTA 
AGGAATTTTA 
AGGAATTTTA 
AGGAATTTTA 
AGGAATTTTA 
AGGAATTTTA 
AGGAATTTTA 
AGGAATTTTA 
********** 



AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
********** 



GGATATGATG 
GGATATGATG 
GGATATGATG 
GGATATGATG 
GGATATGATG 
GGATATGATG 
GGATATGATG 
GGATATGATG 
GGATATGATG 
GGATATGATG 
GGATATGATG 
********** 



GCAGCaAATG 
GCAGCaAATG 
GCAGCgAATG 
GCAGCgAATG 
GCAGCgAATG 
GCAGCgAATG 
GCAGCgAATG 
GCAGCgAATG 
GCAGCgAATG 
GCAGCaAATG 
GCAGCaAATG 
*****-**** 



450 

TTTTGACTCA 
TTTTGACTCA 
TTTTGACTCA 
TTTTGACTCA 
TTTTGACTCA 
TTTTGACTCA 
TTTTGACTCA 
TTTTGACTCA 
TTTTGACTCA 
TTTTGACTCA 
TTTTGACTCA 
********** 

500 

TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
********** 



msal3607 .2{201__COHi; 
msal3 607.2 {201_M78i; 
msal3607 .2 {201_090; 
msal3607 . 2 {201_CJB110 
msal3607 . 2 (201JL8RS21 
msal3607 .2{201_2603 
msal3607.2{201_A909. 
msal3 607.2{2 01_H36B] 
msa!3607 . 2 {201_JM9130013 ] 
msa!3607 . 2 {201_1169NT} 
msal3607.2{201_M732} 
Consensus 



msal3 6 07.2(20 1_C0H1 } 
msal3607.2{201_M78l} 
msal3607.2{201_090} 
msal3607.2(201_CJB110) 
msal3607 . 2 {201_18RS2l} 



501 

AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
********** 

551 

ATAATACTAA 
ATAATACTAA 
ATAATACTAA 
ATAATACTAA 
ATAATACTAA 



TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
********** 



AT CTATTGAA 
AT CTATTGAA 
ATCTATTGAA 
AT CTATTGAA 
ATCTATTGAA 



ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
********** 



AATTTGGTTG 
AATTTGGTTG 
AATTTGGTTG 
AATTTGGTTG 
AATTTGGTTG 



TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
********** 



GAGTTAt TGC 
GAGTTAt TGC 
GAGTTAt TGC 
GAGTTAt TGC 
GAGTTAt TGC 



550 

CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
********** 

600 

TTTTATTGAA 
TTTTATTGAA 
TTTTATTGAA 
TTTTATTGAA 
TTTTATTGAA 



865 



WO 2004/018646 
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Table 52: Comparative Sequences relating to SAG 1823 



msal3607. 2 {201_2603} ATAATACTAA ATCTATTGAA AATTTGGTTG 

msal3607 .2{2 01_A909} ATAATACTAA ATCTATTGAA AATTTGGTTG 

msal 3 6 0 7 . 2 { 2 0 1_H3 6B } ATAATACTAA ATCTATTGAA AATTTGGTTG 

msal3607.2{201_JM913 0013} ATAATACTAA ATCTATTGAA AATTTGGTTG 

msal3 607 .2{201__1169NT} ATAATACTAA ATCTATTGAA AATTTGGTTG 

msal3607.2{201_M732} ATAATACTAA ATCTATTGAA AATTTGGTTG 

Consensus ********** ********** ********** 

601 

msa!3607 .2{201_COHl} TCGAGTCAAG CCGAGGCTGC cAATCGTGCA 

msal3607 . 2 {2 01_M78l} TCGAGTCAAG CCGAGGCTGC cAATCGTGCA 

msal3607 .2{201_090} TCGAGTCAAG CCGAGGCTGC tAATCGTGCA 

msal3607.2{201_CJB110} TCGAGTCAAG CCGAGGCTGC tAATCGTGCA 

msal3607.2{201_18RS2l} TCGAGTCAAG CCGAGGCTGC tAATCGTGCA 

tnsal3607.2{201_2603} TCGAGTCAAG CCGAGGCTGC tAATCGTGCA 

msal3607 ,2{201_A909} TCGAGTCAAG CCGAGGCTGC cAATCGTGCA 

msal3607.2{20 1_H3 6B } TCGAGTCAAG CCGAGGCTGC cAATCGTGCA 

msal3607.2{201_JM913 0013} TCGAGTCAAG CCGAGGCTGC cAATCGTGCA 

msal3607.2{201_1169NT} TCGAGTCAAG CCGAGGCTGC cAATCGTGCA 

msal3607.2{201_M732} TCGAGTCAAG CCGAGGCTGC cAATCGTGCA 

Consensus ********** ********** _********* 



GAGTTAt TGC 
GAGTTAwTGC 
GAGTTAt TGC 
GAGTTAt TGC 
GAGTTAt TGC 
GAGTTAt TGC 
****** _*** 



TTTTATTGAA 
TTTTATTGAA 
TTTTATTGAA 
TTTTATTGAA 
TTTTATTGAA 
TTTTATTGAA 
********** 

650 

AACAAGAAAT 
AACAAGAAAT 
AACAAGAAAT 
AACAAGAAAT 
AACAAGAAAT 
AACAAGAAAT 
AACAAGAAAT 
AACAAGAAAT 
AACAAGAAAT 
AACAAGAAAT 
AACAAGAAAT 



AGCCACTTAC 
AGC CACTTAC 
AGCCACTTAC 
AGCCACTTAC 
AGCCACTTAC 
AGCCACTTAC 
AGCCACTTAC 
AGCCACTTAC 
AGCCACTTAC 
AGCCACTTAC 
AGCCACTTAC 
********** ********** 



msal3607.2{201_COHl} 
msal3607.2{201_M78l} 
msal3607.2{201_090} 
msal3607.2{201_CJB110} 
msal3607 . 2 {201_JL8RS2l} 
msal3 607.2{201_2603} 
msal3607.2{201_A909} 
msal3607.2{201_H3 6B} 
msal3607.2{201_JM9130013} 
msal3607.2{201_1169NT} 
msal3607.2{201_M732} 
Consensus 



msal3607.2{201_COHl} 
msal3607.2{201_M78l} 
msal3607. 2 {201_090} 
msal3607 . 2 {201_CJB110} 
msal3607 . 2 {201_18RS2l} 
rasal3607.2{201_2603} 
msal3607.2{201__A909} 
msal3607.2{201__H36B} 
msal3607.2{201_JM9130013} 
msal3607 . 2 {201_1169NT} 
msal3607.2{201_M732} 
Consensus 



msal3607 

msal3607 
msal3607 
msal3607.2{ 
msal3607.2{ 

msal3607 . 

msal3607 . 

msa!3607 . 
msal3607.2{201 
msal3607.2{ 

msal3607 



2{201_COH1} 
2{201_M781} 
2{201_090j 
201_CJB110} 
201_18RS21} 
2{201_2603} 
2{201_A909} 
2{201_H36B} 
_JM9130013} 
201_1169NT} 
2{201_M732} 
Consensus 



rasal3607.2{20 1_C0H1 
msal3607 . 2 {201_M781 
msal3607.2{201_090 
tnsal3 607 . 2 {201_CJB110 
msa!3607 . 2 {201_18RS21 
msal3607 .2 {201_2603 
rasal3607.2{201_A909 



651 

TCTAGCATTA 
TCTAGCATTA 
TCTAGCATTA 
TCTAGCATTA 
TCTAGCATTA 
TCTAGCATTA 
TCTAGCATTA 
TCTAGCATTA 
TCTAGCATTA 
TCTAGCATTA 
TCTAGCATTA 
********** 

701 

TAGCc CGAAT 
TAGCc CGAAT 
TAGCt CGAAT 
TAGCt CGAAT 
TAGCt CGAAT 
TAGCt CGAAT 
TAGCt CGAAT 
TAGCt CGAAT 
TAGCt CGAAT 
TAGCt CGAAT 
TAGCc CGAAT 
****_***** 

751 

TATGTCAGCC 
TATGTCAGCC 
TATGTCAGCC 
TATGTCAGCC 
TATGTCAGCC 
TATGTCAGCC 
TATGTCAGCC 
TATGTCAGCC 
TATGTCAGCC 
TATGTCAGCC 
TATGTCAGCC 
********** 

801 

CTTGGTCAAA 
CTTGGTCAAA 
CTTGGTPAAA 
CTTGGTCAAA 
CTTGGTCAAA 
CTTGGTCAAA 
CTTGGTCAAA 



GATAGCCAAA 
GATAGCCAAA 
GATAGCCAAA 
GATAGCCAAA 
GATAGCCAAA 
GATAGCCAAA 
GATAGCCAAA 
GATAGCCAAA 
GATAGCCAAA 
GATAGCCAAA 
GATAGCCAAA 
********** 



GACTGAAGTT 
GACTGAAGTT 
GACTGAAGTT 
GACTGAAGTT 
GACTGAAGTT 
GACTGAAGTT 
GACTGAAGTT 
GACTGAAGTT 
GACTGAAGTT 
GACTGAAGTT 
GACTGAAGTT 
********** 



GTCTCTACGT 
GTCTCTACGT 
GTCTCTACGT 
GTCTCTACGT 
GTCTCTACGT 
GTCTCTACGT 
GTCTCTACGT 
GTCTCTACGT 
GTCTCTACGT 
GTCTCTACGT 
GTCTCTACGT 
********** 



CGTCCGAaTA 
CGTCCGAaTA 
CGTCCGAgTA 
CGTCCGAg TA 
CGTCCGAgTA 
CGTCCGAgTA 
CGTCCGAgTA 
CGTCCGAgTA 
CGTCCGAgTA 
CGTCCGAgTA 
CGTCCGAaTA 
*******_** 



ATCAATACCC 
ATCAATACCC 
ATCAATACCC 
ATCAATACCC 
ATCAATACCC 
ATCAATACCC 
ATCAATACCC 
ATCAATACCC 
ATCAATACCC 
ATCAATACCC 
ATCAATACCC 
********** 



TGCATGGGCA 
TGCATGGGCA 
TGCATGGGCA 
TGCATGGGCA 
TGCATGGGCA 
TGCATGGGCA 
TGCATGGGCA 
TGCATGGGCA 
TGCATGGGCA 
TGCATGGGCA 
TGCATGGGCA 
********** 



TCAAATTAAA 
TCAAATTAAA 
TCAAATTAAA 
TCAAATTAAA 
TCAAATTAAA 
TCAAATTAAA 
TCAAATTAAA 
TCAAATTAAA 
TCAAATTAAA 
TCAAATTAAA 
TCAAATTAAA 
********** 



TCGAACAGCA 
TCGAACAGCA 
TCGAACAGCA 
TCGAACAGCA 
TCGAACAGCA 
TCGAACAGCA 
TCGAACAGCA 
TCGAACAGCA 
TCGAACAGCA 
TCGAACAGCA 
TCGAACAGCA 
********** 



ACAACACCAC 
ACAACACCAC 
ACAACACCAC 
ACAACACCAC 
ACAACACCAC 
ACAACACCAC 
ACAACACCAC 
ACAACACCAC 
ACAACACCAC 
ACAACACCAC 
ACAACACCAC 
********** 



GTATCGTCAG 
GTATCGTCAG 
GTATCGTCAG 
GTATCGTCAG 
GTATCGTCAG 
GTATCGTCAG 
GTATCGTCAG 



ATATGCGTCA 
ATATGCGTCA 
ATATGCGTCA 
ATATGCGTCA 
ATATGCGTCA 
ATATGCGTCA 
ATATGCGTCA 



gAAACTTGGt 
gAAACTTGGt 
gAAACTTGGc 
gAAACTTGGc 
gAAACTTGGc 
gAAACTTGGc 
aAAACTTGGc 



700 

AGTAACCAAT 
AGTAACCAAT 
AGTAACCAAT 
AGTAACCAAT 
AGTAACCAAT 
AGTAACCAAT 
AGTAACCAAT 
AGTAACCAAT 
AGTAACCAAT 
AGTAACCAAT 
AGTAACCAAT 
********** 

750 

ACATaCgGAA 
ACATaCgGAA 
ACATaCtGAA 
ACATaCtGAA 
ACATcCtGAA 
ACATcCtGAA 
ACATaCtGAA 
ACATaCtGAA 
ACATaCtGAA 
ACATaCtGAA 
ACATaCgGAA 
****-*-*** 

800 

AGATGCGAAA 
AGATGCGAAA 
AGATGCGAAA 
AGATGCGAAA 
AGATGCGAAA 
AGATGCGAAA 
AGATGCGAAA 
AGATGCGAAA 
AGATGCGAAA 
AGATGCGAAA 
AGATGCGAAA 
********** 

850 

ATGTTACGTC 
ATGTTACGTC 
ATGTTACGTC 
ATGTTACGTC 
ATGTTACGTC 
ATGTTACGTC 
ATGTTACGTC 



866 



WO 2004/018646 



PCT/US2003/026827 



Table 52: Comparative Sequences relating to SAG 1823 



msal3607 -2{201_H3 6B} 
msal3607 . 2 {201_JM913 0013 } 
msal3607 . 2 {201_1169NT} 
msal3607.2{201_M732} 
Consensus 



CTTGGTCAAA 
CTTGGTCAAA 
CTTGGTCAAA 
CTTGGTCAAA 



GTATCGTCAG 
GTATCGTCAG 
GTATCGTCAG 
GTATCGTCAG 



ATATGCGT CA 
ATATGCGTCA 
ATATGCGT CA 
ATATGCGTCA 



aAAACTTGGc 
aAAACTTGGc 
aAAACTTGGc 
gAAACTTGGt 



ATGTTACGTC 
ATGTTACGTC 
ATGTTACGTC 
ATGTTACGTC 



********** ********** ********** _********- ********** 



r.2j 
r.2{ 



msal3607 
msal3607 
msal3607 
msal3607 
msal3607 
msa!3607 - 
rrtsal3607 . 
msal3607 . 
msal3607.2{201 
msal3607.2{ 
msa!3607 



2{2 01_COHl} 
2{2 01__M781} 
.2{201__090} 
201_CJB110} 
201_JL8RS2l} 
2 {201^2603} 
2{201_A909} 
2{201_H36B} 
L_JM913 0013} 
'201_1169NT} 
2{201_M732} 
Consensus 



851 

GAAATA C CAT 
GAAATACCAT 
GAAATA C CAT 
GAAATACCAT 
GAAATACCAT 
'GAAATACCAT 
GAAATACCAT 
GAAATACCAT 
GAAATACCAT 
GAAATACCAT 
GAAATACCAT 
********** 



TCCAACAATG 
TCCAACAATG 
TCCAACAATG 
TCCAACAATG 
TCCAACAATG 
TCCAACAATG 
TCCAACAATG 
TCCAACAATG 
TCCAACAATG 
TCCAACAATG 
TCCAACAATG 
********** 



901 

msal3607.2{201__COHl} CAACAATCTG TCAAATCCGG 

msal3607.2{201_M78l} CAACAATCTG TCAAATCCGG 

rasal3 607 .2 {201^090} CAACAATCTG TCAAATCCGG 

msal3607.2{201_CJB110} CAACAATCTG TCAAATCCGG 

msal3607 .2{201_18RS2l} CAACAATCTG TCAAATCCGG 

msal3607. 2 {201^2603} CAACAATCTG TCAAATCCGG 

msal3607 .2{2 01_A909} CAACAATCTG TCAAATCCGG 

msal3607 .2{2 01_H3 6B} CAACAATCTG TCAAATCCGG 

msal3607.2{201_JM9130013} " CAACAATCTG TCAAATCCGG 

msal3607.2{201_1169NT} CAACAATCTG TCAAATCCGG 

msal3607.2{201__M732} CAACAATCTG TCAAATCCGG 

Consensus ********** ********** 



msal3607.2{201_COHl} 
msal3607.2{201_M78l} 
tnsal3607.2{201_090} 
msal3607 .2 {201_CJB110} 
msal3607 .2 {201__18RS21} 
msal3607.2{201_2603l 
rasal3607.2{201_A909] 
tnsal3607 .2{201__H36B) 
msal3607.2{201_JM9130013] 
msal3607 .2 {201_11S9NT) 
msal3607.2{201_M732} 
Consensus 



951 

TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 



TTGCAaATGC 
TTGCAaATGC 
TTGCAgATGC 
TTGCAgATGC 
TTGCAgATGC 
TTGCAgATGC 
TTGCAgATGC 
TTGCAgATGC 
TTGCAgATGC 
TTGCAgATGC 
TTGCAaATGC 



********** *****_**** 



AAACTCTCAA 
AAACTCTCAA 
AAACTCTCAA 
AAACTCTCAA 
AAACTCTCAA 
AAACTCTCAA 
AAACTCTCAA 
AAACTCTCAA 
AAACTCTCAA 
AAACTCTCAA 
AAACTCTCAA 
********** 



TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
********** 



TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
********** 



900 

AGGCATGATG 
AGGCATGATG 
AGGCATGATG 
AGGCATGATG 
AGGCATGATG 
AGGCATGATG 
AGGCATGATG 
AGGCATGATG 
AGGCATGATG 
AGGCATGATG 
AGGCATGATG 
********** 

950 

TCAACGCTAA 
TCAACGCTAA 
TCAACGCTAA 
TCAACGCTAA 
TCAACGCTAA 
TCAACGCTAA 
TCAACGCTAA 
TCAACGCTAA 
TCAACGCTAA 
TCAACGCTAA 
TCAACGCTAA 
********** 

1000 
GCGATTCCGA 
GCGATTCCGA 
GCGATTCCGA 
GCGATTCCGA 
GCGATTCCGA 
GCGATTCCGA 
GCGATTCCGA 
GCGATTCCGA 
GCGATTCCGA 
GCGATTCCGA 
GCGATTCCGA 
********** ********** 



TCGCTCAGTT 
TCGCTCAGTT 
TCGCTCAGTT 
TCGCTCAGTT 
TCGCTCAGTT 
TCGCTCAGTT 
TCGCTCAGTT 
TCGCTCAGTT 
TCGCTCAGTT 
TCGCTCAGTT 
TCGCTCAGTT 
********** 



GATGCTATTG 
GATGCTATTG 
GATGCTATTG 
GATGCTATTG 
GATGCTATTG 
GATGCTATTG 
GATGCTATTG 
GATGCTATTG 
GATGCTATTG 
GATGCTATTG 
GATGCTATTG 
********** 



TAGTAAAGAA 
TAGTAAAGAA 
TAGTAAAGAA 
TAGTAAAGAA 
TAGTAAAGAA 
TAGTAAAGAA 
TAGTAAAGAA 
TAGTAAAGAA 
TAGTAAAGAA 
TAGTAAAGAA 
TAGTAAAGAA 



msal3607 .2{2 01_COH1} 
msal3607.2{201_M78l} 
rasal3607.2{201_090) 
msal3607.2{201_CJB110} 
msal3607.2{201_18RS2l} 
rasal3607. 2 {201^2603} 
msal3607.2(201_A909} 
msal3607.2{201_H36B} 
msal3607.2{201_JM9130013} 
msal3607 . 2 {201_1169NTj 
msal3607 . 2 {2 01_M732 } 
Consensus 



msal3607 
msal3607 
msal360 
msa!3607 .2 
msal3607-2 
msa!3607 
msal3607 
msal3607 
msal3607.2{20 
msal3607.2 



.2{2 01__COH1} 
.2{201_M78l} 
7.2{201_090} 
{201_CJB110} 
{201__18RS21} 
.2{201_2603) 
.2{201_A909} 
.2{201_H36B} 
1_JM9130013} 
{201_1169NT} 



1001 

TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
********** 

1051 

GCATTAgCTG 
GCATTAgCTG 
GCATTAgCTG 
GCATTAgCTG 
GCATTAgCTG 
GCATTAgCTG 
GCATTAgCTG 
GGATTAtCTG 
GCATTAgCTG 
GCATTAgCTG 



GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
********** 



AAAGCTTAGT 
AAAGCTTAGT 
AAAGCTTAGT 
AAAGCTTAGT 
AAAGCTTAGT 
AAAGCTTAGT 
AAAGCTTAGT 
AAAGCTTAGT 
AAAGCTTAGT 
AAAGCTTAGT 



AGCCCCACTG 
AGCCCCACTG 
AGCCCCACTG 
AGCCCCACTG 
AGCCCCACTG 
AGCCCCACTG 
AGCCCCACTG 
AGCCCCACTG 
AGCCCCACTG 
AGCCCCACTG 
AGCCCCACTG 
********** 



GGCTCAAAAT 
GGCT CAAAAT 
GGCTCAAAAT 
GGCTCAAAAT 
GGCTCAAAAT 
GGCTCAAAAT 
GGCTCAAAAT 
GGCTCAAAAT 
GGCTCAAAAT 
GGCTCAAAAT 



TTTCTATTAA 
TTTCTATTAA 
TTTCTATTAA 
TTTCTATTAA 
TTTCTATTAA 
TTTCTATTAA 
TTTCTATTAA 
TTTCTATTAA 
TTTCTATTAA 
TTTCTATTAA 
TTTCTATTAA 
********** 



AATGGTATTA 
AATGGTATTA 
AATGGTATTA 
AATGGTATTA 
AATGGTATTA 
AATGGTATTA 
AATGGTATTA 
AATGGTATTA 
AATGGTATTA 
AATGGTATTA 



1050 
ATCTGTCACT 
AT CTGTCACT 
ATCTGTCACT 
ATCTGTCACT 
ATCTGTCACT 
ATCTGTCACT 
ATCTGTCACT 
ATCTGTCACT 
ATCTGTCACT 
ATCTGTCACT 
ATCTGTCACT 
********** 

1100 
TCGCTGCCAT 
TCGCTGCCAT 
TCGCTGCCAT 
TCGCTGCCAT 
TCGCTGCCAT 
TCGCTGCCAT 
TCGCTGCCAT 
TCGCTGCCAT 
TCGCTGCCAT 
TCGCTGCCAT 
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Table 52: Comparative Sequences relating to SAG 1823 



msal3607.2{201_M732} 
Consensus 



GCATTAgCTG AAAGCTTAGT GGCTCAAAAT AATGGTATTA TCGCTGCCAT 
******_*** ********** ********** ********** ********** 



msal3607.2{201_COHl} 
msal3607.2{201_M78l} 
msal3 607.2{201_090} 
msa!3607 . 2 {201_CJB110 } 
msal3 607 . 2 {2 01_18RS21 } 
msal3607.2{201_2603} 
msal3607.2{201_A909} 
msal3607.2{201_H36B} 
msal3607.2{201_JM913 0013} 
msal3607 . 2 {201_1169NT} 
rasal3607.2{201_M732} 
Consensus 



msal3 6 0 7 . 2 { 2 01_COH1 } 
msal3607.2{201_M78l} 
msal3 607. 2 {201^090} 
msal3607 . 2 {201_CJB110 } 
msal3607.2{201_18RS2l} 
msal3607.2{201_2603} 
msa!3607.2{201_A909} 
msal3607.2{201_H36B} 
msal3607 . 2 {201_JM9130013 \ 
msal3607.2{201__1169NT} 
msa!3607.2{201_M732} 
Consensus 



msal3607.2{20 1_C0H1 } 
msal3607.2{201_M78l} 
msal3 607.2{201_090} 
msal3607.2{201__CJB110} 
msal3607 . 2 {201_18RS2l} 
msal3607 . 2 {201_2603 } 
msal3 607.2l201_A909} 
msal3 6 07 . 2 {'2 0 1_H3 6B } 
msal3607 . 2 {201_JM9130013 } 
msal3607.2{201_1169NT}' 
msa!3 607.2{2 01_M732} 
Consensus 



1101 

AGACAAAGGA 
AGACAAAGGA 
AGACAAAGGA 
AGACAAAGGA 
AGACAAAGGA 
AGACAAAGGA 
AGACAAAGGA 
AGACAAAGGA 
AGACAAAGGA 
AGACAAAGGA 
AGACAAAGGA 
********** 

1151 

CGGCTGAAAC 
CGGCTGAAAC 
CGGCTGAAAC 
CGGCTGAAAC 
CGGCTGAAAC 
CGGCTGAAAC 
CGGCTGAAAC 
CGGCTGAAAC 
CGGCTGAAAC 
CGGCTGAAAC 
CGGCTGAAAC 
********** 

1201 

GAAGCCTTAC 
GAAGCCTTAC 
GAAGCCTTAC 
GAAGCCTTAC 
GAAGCCTTAC 
GAAGCCTTAC 
GAAGCCTTAC 
GAAGCCTTAC 
GAAGCCTTAC 
GAAGCCTTAC 
GAAGCCTTAC 
********** 



CGTAAgGAAC 
CGTAAgGAAC 
CGTAAgGAAC 
CGTAAgGAAC 
CGTAAgGAAC 
CGTAAgGAAC 
CGTAAaGAAC 
CGTAAaGAAC 
CGTAAgGAAC 
CGTAAgGAAC 
CGTAAgGAAC 



GTGCCCAATT 
GTGCCCAATT 
GTGCCCAATT 
GTGCCCAATT 
GTGCCCAATT 
GTGCCCAATT 
GTGCCCAATT 
GTGCCCAATT 
GTGCCCAATT 
GTGCCCAATT 
GTGCCCAATT 



aGAATCTGCT 
aGAATCTGCT 
gGAATCTGCT 
gGAATCTGCT 
gGAATCTGCT 
gGAATCTGCT 
aGAATCTGCT 
aGAATCTGCT 
aGAATCTGCT 
aGAATCTGCT 
aGAATCTGCT 



1150 
GTTATTAAAT 
GTTATTAAAT 
GTTATTAAAT 
GTTATTAAAT 
GTTATTAAAT 
GTTATTAAAT 
GTTATTAAAT 
GTTATTAAAT 
GTTATTAAAT 
GTTATTAAAT 
GTTATTAAAT 



*****-**** ********** -********* ********** 



AATCAATGAT 
AATCAATGAT 
AATCAATGAT 
AATCAATGAT 
AATCAATGAT 
AATCAATGAT 
AATCAATGAT 
AATCAATGAT 
AATCAATGAT 
AATCAATGAT 
AATCAATGAT 
********** 



TCTGTCAAAA 
TCTGTCAAAA 
TCTGTCAAAA 
TCTGTCAAAA 
TCTGTCAAAA 
TCTGTCAAAA 
TCTGTCAAAA 
TCTGTCAAAA 
TCTGTCAAAA 
TCTGTCAAAA 
TCTGTCAAAA 
********** 



TCAACGAAGG TAAATCTACC 
TCAACGAAGG TAAATCTACC 
TCAACGAAGG TAAATCTACC 
TCAACGAAGG TAAATCTACC 
TCAACGAAGG TAAATCTACC 
TCAACGAAGG TAAATCTACC 
TCAACGAAGG TAAATCTACC 
TCAACGAAGG TAAATCTACC 
TCAACGAAGG TAAATCTACC 
TCAACGAAGG TAAATCTACC 
TCAACGAAGG TAAATCTACC 
********** ********** 



TTCGTGATAA 
TTCGTGATAA 
TTCGTGATAA 
TTCGTGATAA 
TTCGTGATAA 
TTCGTGATAA 
TTCGTGATAA 
TTCGTGATAA 
TTCGTGATAA 
TTCGTGATAA 
TTCGTGATAA 
********** 



CAAGAAAAAG 
CAAGAAAAAG 
CAAGAAAAAG 
CAAGAAAAAG 
CAAGAAAAAG 
CAAGAAAAAG 
CAAGAAAAAG 
CAAGAAAAAG 
CAAGAAAAAG 
CAAGAAAAAG 
CAAGAAAAAG 
********** 



1200 
AAAAATAGTT 
AAAAATAGTT 
AAAAATAGTT 
AAAAATAGTT 
AAAAATAGTT 
AAAAATAGTT 
AAAAATAGTT 
AAAAATAGTT 
AAAAATAGTT 
AAAAATAGTT 
AAAAATAGTT 
********** 

1250 
ttgatgagtc 
ttgatgagtc 
ttgatgagtc 
ttgatgagtc 
ttgatgagtc 
ttgatgagtc 
ttgatgagtc 
ttgatgagtc 
ttgatgagtc 
ttgatgagtc 



msal 3 6 0 7 . 2 { 2 0 1_C0H1 } 
msal3607.2{201_M78l} 
msal3607.2{201_090} 
msal3607 . 2 {201_CJB110 } 
msal3607.2{201_18RS2l} 
msal3607.2{201_2603} 
msal3607.2{201_A909j 
msal3607.2{201_H36B} 
msal3607 . 2 {201_JM9 130013 * 
msal3607.2{201_1169NT] 
msal3607.2{201_M732] 
Consensus 



1251 

t 

t 

t 

t 

t 

t 

t 

t 

t 

t 



SEQ ID NO. 5212 

STRAIN _090 frame: 1 

SDTFNFDIDQIADNAITKTDKTTEIISNQTTSQTGQIAFFEKLTPAQKSAISEKTPALVD 
TFVGDQNALLDFGQS AVEGVNTTVNHI LS EQKKI Q I P Q VDDLL KNANRE LNG F I AKY KD A 
TPAELEKKPNIjI QKLFKQSKTSLQEFYFDSQNI EQKMDMMAANWKQEDTIARNIVSAEM 
L I EDNTKSI ENLVGVI AFI ESSQAEAANRASHLQQE I LALDSQTSEYQI KSNQLARMTEV 
INTLEQ^HTE YVSRL WAWATTPQMRNLVKVS SDMRQKLGMLRRNT I PTMKLS I AQLGMM 
QQSVKSGVTADAIVNANNAALQMLAETSKEAI PMUEKTAQS PTVS I KS VTALAESLVAQN 
NG 1 1 AAI DKGRKERAQLESAVI KSAETINDSVKI RDKKI VEALLNEGKSTQEKVDES 

SEQ ID NO. 52013 

STRAIN A909 frame: 1 

SDTFNFDIDQIADNAITKTDKTTEIISNQTTSQTGQIAFFEKLTPAQKSAISEKTPALVD 
TFVGDQNALLDFGQSAVEGVNTTVNHI LSEQKKI QI PQVDDLLKNANRELNGFI AKYKDA 
TPAELEKKPNLI QKLFKQSKTSLQEFYFDSQNI EQKMDMMAANWKQEDTLARNIVSAEM 
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Table 52: Comparative Sequences relating to SAG 1823 



LI EDNTKS I ENLVGVXAF I ESSQAEAANRASHLQQE I LALDS QT SEYQ I KSNQLARMTEV 
'INTLEQQHTEYVSRLYVAWATTPQMRNLVKVSSDMRQKLGMLRROTIPTMKLSIAQLGMM 
QQS VKSGVT ADA I VN ANNAAL QMIAET S KEA I PMLEKTAQSPTVS I KS VTALAE S LVAQN 
NGI I AAI DKGRKERAQLESAVI KS AET I NDSVKI RDKKIVEALLNEGKSTQEKVDES 

SEQ XD NO. 5214 

STRAIN H36B frame: 1 

SDTFNFDIDQI ADNAITKTDKTTE 1 1 SNQTTSQTGQI AFFEKLTPAQKSAI SEKTPALVD 
TFVGDQNALLDFGQSAVEGVNTTVNHI LSEQKKI Q I PQVDDLLKNANRELNGF I AKYKDA 
TPAELEKKPNL I QKL FKQSKTSLQEFYFDSQNIEQKMDMMAANVVKQEDTLARN I VSAEM 
LI EDNTICSI ENLVGVI AFI ESSQAEAANRASHLQQE I LALDSQTSEYQI KSNQLARMTEV 
I NTLEQQHTE YVSRLYVAWATTPQMRNLVKVS SDMRQKLGMLRRNT I PTMKLS I AQLGMM 
QQS VKSGVTADAI VNANNAALQMLAETSKEAI PMLEKTAQS PTVS I KS VTALSE SLVAQN 
NGI I AAI DKGRKERAQLESAVIKSAETINDSVKI RDKKIVEALLNEGKSTQEKVDES 

SEQ ID NO. 5215 

STRAIN 18RS21 frame: 2 

FD I DQ I ADNAITKTDKTTE 1 1 SNQTTSQTGQI AFFEKLTPAQKSAI SEKTPALVDTFVGD 
QNALLDFGQSAVEGVNTTVNH I LSEQKKI Q I PQVDDLLKNANRELNGF I AKYKDATPAEL 
EKKPNLIQKLFKQSKTSLQEFYFDSQNIEQKMDMMAANVVKQEDTLARNIVSAEMLIEDN 
TKS I ENLVGVI AFI ESSQAEAANRASHLQQEI LALDSQTSEYQI KSNQLARMTE VI NTLE 
QQHPE YVSRLYVAWATTPQMRNLVKVS SDMRQKLGMLRRNT I PTMKLS I AQLGMMQQSVK 
SGVTADAI VNANNAALQMLAETSKEAI PMLEKTAQSPTVS IKS VTALAE SLVAQNNG 1 1 A 
AI DKGRKERAQLESAVI KSAET INDSVKI RDKKIVEALLNEGKSTQEKVDES 

SEQ ID NO. 5216 

STRAIN M732 frame: 1 

SDTFNFDIDQIADNAITKTDKTTEI I SNQTTSG/TGQI AFFEKLTPAQKSAI SEKTPALVD 
TFVGDQNALLDFGQSAVEGVNTTVNHI LSEQKKI QI PQVDDLLKNANRELNGF I AKYKDA 
T PAE LEKKPNL I QKL FKQS KT S LQE FY FD S QN I E Q KMDMMAANVVKQEDTLARN I VSAEM 
LI EDNTKS I ENLVGVIAFIESSQAEAANRASHLQQE ILALDSQTSEYQI KSNQLARMTEV 
INTLEQQHTEYVSRLYVAWATTPQMRNLVKVS SDMRQKLGMLRRNT I PTMKLS IAQLGMM 
QQSVKSGVTADAI VNANNAALQMLAETSKEAI PMLEKTAQSPTVS I KS VTALAES LVAQN 
NGI I AAI DKGRKERAQLESAVI KSAET I NDSVKI RDKKI VEALLNEGKSTQEK 

SEQ ID NO. 5217 

STRAIN COH1 frame: 3 

KTDKTTE 1 1 SNQTTCQTGQ I AFFEKLTPAQKSAX SEKTPALVDTFVGDQNALLD FGQSAV 
EGVNTTVNH I LSEQKKI QI PQVDDLLKNANRELNGF I AKYKDAT PAE LEKKPNL I QKLFK 
QSKTSLQEFYFDSQNIEQKMDMMAANVVKQEDTLARNI VSAEML IEDNTKS IENLVGVI A 
F I ES S QAEAANRASHLQQE I LALDSQTSEYQ I KSNQLARMTEVINTLEQQHTEYVSRLYV 
AWATTPQMRNLVKVS SDMRQKLGMLRRNT I PTMKLS I AQLGMMQQS VKSGVTADAI VNAN 
NAAL QMLAET S KEA I PMLEKTAQSPTVS I KS VTALAE SLVAQNNG 1 1 AAIDKGRKERAQL 
ESAVI KSAET I NDSVKI RDKKIVEALLNEGKSTQEKVDES 

SEQ ID NO. 5218 

STRAIN COH1 frame: 3 

KTDKTTE 1 1 SNQTTCQTGQ I AFFEKLTPAQKSAXSEKTPALVDTFVGDQNALLD FGQSAV 
EGVNTTVNH I LSEQKKI Q I PQVDDLLKNANRELNGF I AKYKDAT PAELE KKPNL I QKLFK 
QSKTSLQEFYFDSQNIEQKMDMMAAtTVVKQEDTLARNIVSAEML I EDNTKS IENLVGVI A 
FI ESSQAEAANRASHLQQE I IjALDSQTSEYQI KSNQLARMTEVINTLEQQHTEYVSRLYV 
AWATTPQMP^LVKVS SDMRQKLGMLRRNT I PTMKLS I AQLGMMQQS VKSGVTADAI VNAN 
NAALQMLAETSKEAI PMLEKTAQSPTVS I KS VTALAE SLVAQNNG 1 1 AAIDKGRKERAQL 
ESAVI KSAET INDSVKI RDKKIVEALLNEGKSTQEKVDES 

SEQ ID NO. 5219 

STRAIN M781 frame: 2 

FD I DQ I ADNAITKTDKTTE 1 1 SNQTTSQTGQI AFFEKLTPAQKSAI SEKTPALVDTFVGD 
QNALLDFGQSAVEGVNTTVNH I LSEQKKI Q I P QVDDLLKNANRELNGFI AKYKDATPAEL 
EKKPNLI QKLFKQSKTSLQE FYFDSQNI EQKMDMMAANWKQEDTLARN I VSAEML I EDN 
TKS I ENLVGVI AF I ES SQAEAANRASHLQQE I LALDSQTSEYQ I KSNQLARMTEVI NTLE 
QQHTEYVSRLWAWATTPQMRNLVKVS SDMRQKLGMLRRNT I PTMKLS I AQLGMMQQS VK 
SGVTADAI VNANNAALQMLAETSKEAI PMLEKTAQS PTVS I KS VTALAESLVAQNNG 1 1 A 
AI DKGRKERAQLESAVI KSAET I NDSVKI RDKKIVEALLNEGKSTQEKVDES 

SEQ ID NO. 5220 
STRAIN CJB110 frame: 2 

FD I DQI ADNAITKTDKTTE 1 1 SNQTTSQTGQI AFFEKLTPAQKSAI SEKTPALVDTFVGD 
QNALLDFGQSAVEGVNTTVNH I LSEQKKI Q I PQVDDLLKNANRELNGF I AKYKDATPAEL 
EKKPNLI QKLFKQSKTSLQE FYFDSQNI EQKMDMMAANVVKQEDTLARN I VSAEML I EDN 
TKSIENLVGVIAFIESSQAEAANRASHIiQQEILALDSQTSEYQIKSNQLARMTEVINTLE 
QQHTEYVSRLYVAWATTPQMRNLVKVSSDMRQKLGMLRRNTI PTMKLS I AQLGMMQQSVK 
S GVTADA I VN ANNAALQMLAET S KEA I PMLEKTAQ S PTVS I KSVTALAE S LVAQNNG 1 1 A 
AI DKGRKERAQLESAVI KSAET I NDSVKI RDKKIVEALLNEGKSTQEKVDES 

SEQ ID NO. 5221 

STRAIN 1169NT frame: 1 

ADNAITKTDKTTE 1 1 SNQTTSQTGQIAFFEKLTPAQKSAI SEKTPALVDTFVGDQNALLD 
FGQSAVEGVNTTVNH I LSEQKKI QI PQVDDLLKNANRELNGF I AKYKDAT PAELE KKPNL 
I QKLFKQSKTSLQE FYFDSQNI EQKMDMMAANVVKQEDTLARN I VSAEML I EDNTKS I EN 
LVGVIAF I ESSQAEAANRASHLQQE I LALDSQTSEYQ I KSNQLARMTEVINTLEQQHTE Y 
VS RL YVAWATT P QMRNL VKVS SDMRQKLGMLRRNT I PTMKLS I AQLGMMQQS VKSGVTAD 
AI VNANNAALQMLAETSKEAI PMLEKTAQSPTVS I KSVTALAESLVAQNNGI IAAIDKGR 
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Table 52: Comparative Sequences relating to SAG 1823 



KERAQLESAVIKSAETI1TOSVKIRDKKIVEALLNEGKSTQEKVDES 

SEQ ID NO. 5222 

STRAIN JM9130013 frame: 1 

SDTFNFD IDQ I ADNAITKTDKTTE 1 1 SNQTTSQTGQ I AFFEKLTPAQKS AI SEKTPALVD 

TFVGDQNALLDFGQSAVEGVNTTVNHILSEQKKI QI PQVDDLLKNANRELNGF I AKYKDA 

TPAELEKKPNLIQKLFKQSKTSLQEFYFDSQNIEQKMDMMAANWKQEDTIjARNIVSAEM 

LI EDNTKSI ENLVGVIAFI ESSQAEAANRASHLQQEILALDSQTSEYQI KSNQLARMTEV 

INTLEQ^HTEYVSRLYVAWATTPQMRNLVKVSSDMRQKLGMLRRNTIPTMK^ 

QQS VKSGVTADAI VNANNAALQMLAETSKEAI PMLEKTAQS PTVS I KS VTALAE S LVAQN 

NG 1 1 AAI DKGRKERAQLESAVI KSAET INDS VKI RDKKI VEALLNEGKSTQEKVDES 



SEQ ID NO. 5223 

STRAIN 2603 frame: 1 

SDTFNFDIDQIADNAITKTDKTTEI I SNQTTSQTGQI AFFEKLTPAQKSAI SEKTPALVD 
TFVGDQNALLDFGQS AVEGVNTTVNH I LS EQKKI QI PQVDDLLKNANRELNGF I AKYKDA 
TPAELEKKPNL I QKLFKQSKTSLQEFYFDSQNI EQKMDMMAANWKQEDTLARNI VSAEM 
LI EDNTKS I ENLVGVI AF I E SSQAEAANRASHLQQE I LALDS QTS E YQI KSNQLARMTEV 
I NTLEQQHPE YVSRL YVAWATT PQMRNLVKVS S DMRQKLGMLRRNTI PTMKLS IAQLGMM 
QQS VKSGVTADA I VNANNAALQMLAETSKEAI PMLEKTAQS PTVS I KS VTALAE S LVAQN 
NG 1 1 AAI DKGRKERAQLESAVI KSAET INDS VKI RDKKI VEALLNEGKSTQEKVDES 

PRETTY of : /biotmp/msa2 83 69 . 2 { * } April 22, 2002 04:27 .. 

1 

msa28369 .2{201_090} sdtfnfdidq iadnaitKTD KTTEIISNQT 

msa28369.2(201_1169NT} adnaitKTD KTTEIISNQT 

msa28369.2(201_A909} sdtfnfdidq iadnaitKTD KTTEIISNQT 

msa28369.2{201_JM9130013} sdtfnfdidq iadnaitKTD KTTEIISNQT 

msa28369.2{201_COHl} KTD KTTEIISNQT 

msa28369 .2{201_CJB110} fdidq iadnaitKTD KTTEIISNQT 

msa28369.2{201_M78l} fdidq iadnaitKTD KTTEIISNQT 

msa28369.2{201_2603} sdtfnfdidq iadnaitKTD KTTEIISNQT 

msa28369.2{201_H36B} sdtfnfdidq iadnaitKTD KTTEIISNQT 

msa28369.2{201_18RS2l} fdidq iadnaitKTD KTTEIISNQT 

msa28369.2{201_M732} sdtfnfdidq iadnaitKTD KTTEIISNQT 



TsQTGQIAFF 
TsQTGQIAFF 
TsQTGQIAFF 
TsQTGQIAFF 
TcQTGQIAFF 
TsQTGQIAFF 
TsQTGQIAFF 
TsQTGQIAFF 
TsQTGQIAFF 
TsQTGQIAFF 
TsQTGQIAFF 



50 

EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 



*_******** ********** 



51 

msa28369.2{201_090} i SEKTPALVD 

msa2 8369.2(20 1_1 1 6 9 NT } iSEKTPALVD 

msa28369 . 2 {201_A909} iSEKTPALVD 

msa28369 . 2 (201_JM913 0013 } iSEKTPALVD 

msa2 8369.2(20 l_COHl } xS E KTPALVD 

msa28369.2(201_CJB110} iSEKTPALVD 

msa28369.2(201_M78l} iSEKTPALVD 

msa28369.2{201_2603} iSEKTPALVD 

msa28369.2(201_H36B} iSEKTPALVD 

msa28369.2{20 1_1 8RS 2 1 } iSEKTPALVD 

msa28369 .2 (201__M732> iSEKTPALVD 

Consensus _********* 



TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 



DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 



********** ********** 



NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
********** 



100 

QKKIQIPQVD 
QKKTQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 
********** 



msa28369.2{201_090} 
msa28369.2{201_1169NT} 
msa28369 .2 {201_A909} 
msa28369 .2 (201__JM9130013 } 
msa28369.2(201_COHl} 
rasa2 8369.2(201_CJB110} 
tnsa28369.2{201_M78l} 
msa28369 . 2 (201^2603 } 
msa28369.2(201_H36B} 
msa28369.2(201_18RS2l} 
rasa28369.2(201_M732} 
Consensus 



101 

DLL KNANREL 
DLLKNANREL 
DLLKNANREL 
DLLKNANREL 
DLLKNANREL 
DLLKNANREL 
DLLKNANREL 
DLLKNANREL 
DLLKNANREL 
DLLKNANREL 
DLLKNANREL 
********** 



NGF I AKYKDA 
NGF I AKYKDA 
NGF I AKYKDA 
NGF I AKYKDA 
NGF I AKYKDA 
NGF I AKYKDA 
NGF I AKYKDA 
NGF I AKYKDA 
NGF I AKYKDA 
NGF I AKYKDA 
NGF I AKYKDA 
********** 



TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
********** 



LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 



150 

TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 



********** ********** 



msa28369.2(201__090 
msa28369 . 2 (201JL169NT 
msa28369.2(201_A909 
msa28369.2{201_JM9130013 
msa2 8 3 6 9 . 2 { 2 0 l__COHl 
msa28369 . 2 {201_CJB110 
msa28369.2(201__M781 
msa28369.2(201_2603 
msa28369.2(201_H3 6B 



151 

QNIEQKMDMM 
QNIEQKMDMM 
QNIEQKMDMM 
QNIEQKMDMM 
QNIEQKMDMM 
QNIEQKMDMM 
QNIEQKMDMM 
QNIEQKMDMM 
QNIEQKMDMM 



AANWKQEDT 
AANWKQEDT 
AANWKQEDT 
AANWKQEDT 
AANWKQEDT 
AANWKQEDT 
AANWKQEDT 
AANWKQEDT 
AANWKQEDT 



LARN I VSAEM 
LARNIVSAEM 
LARN I VSAEM 
LARNIVSAEM 
LARNIVSAEM 
LARNIVSAEM 
LARNIVSAEM 
LARNIVSAEM 
LARNIVSAEM 



LI EDNTKS IE 
LI EDNTKS IE 
LI EDNTKS IE 
LI EDNTKS IE 
LI EDNTKS IE 
LIEDNTKSIE 
LI EDNTKS IE 
LIEDNTKSIE 
LIEDNTKSIE 



200 

NLVGVi AF I E 
NLVGViAFIE 
NLVGVxAF I E 
NLVGViAFIE 
NLVGViAFIE 
NLVGViAFIE 
NLVGViAFIE 
NLVGViAFIE 
NLVGViAFIE 
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Table 52: Comparative Sequences relating to SAG 1823 



msa28369 .2{201_18RS2l} 
msa28369 .2 {201__M732} 
Consensus 



QNIEQKMDMM AANWKQEDT LARNIVSAEM LIEDNTKSIE NLVGViAFIE 
QNIEQKMDMM AANWKQEDT LARNIVSAEM LIEDNTKSIE NLVGViAFIE 
********** ********** ********** ********** *****_**** 



msa28369 
msa28369 .2 { 

msa28369 . 
msa28369.2{201 

msa28369 
msa28369 . 2 { 

msa28369 

msa28369 . 

msa28369 . 
msa28369 .2{ 

msa28369 



.2{201_090} 
201_1169NT} 
2{201__A909} 
_JM913 0013} 
2{2 01_COH1} 
201_CJB110} 
2{201_M781} 
2{201__2603} 
2{201_H36B} 
201_18RS2l} 
2{201_M732} 
Consensus 



201 250 
SSQAEAANRA SHLQQEILAL DSQTSEYQIK SNQLARMTEV INTLEQQHtE 
SSQAEAANRA SHLQQEILAL DSQTSEYQIK SNQLARMTEV INTLEQQHtE 
SSQAEAANRA SHLQQEILAL DSQTSEYQIK SNQLARMTEV INTLEQQHtE 
SSQAEAANRA SHLQQEILAL DSQTSEYQIK SNQLARMTEV INTLEQQHtE 
SSQAEAANRA SHLQQEILAL DSQTSEYQIK SNQLARMTEV INTLEQQHtE 
SSQAEAANRA SHLQQEILAL DSQTSEYQIK SNQLARMTEV INTLEQQHtE 
SSQAEAANRA SHLQQEILAL DSQTSEYQIK SNQLARMTEV INTLEQQHtE 
SSQAEAANRA SHLQQEILAL DSQTSEYQIK SNQLARMTEV INTLEQQHpE 
SSQAEAANRA SHLQQEILAL DSQTSEYQIK SNQLARMTEV INTLEQQHtE 
SSQAEAANRA SHLQQEILAL DSQTSEYQIK SNQLARMTEV INTLEQQHpE 
SSQAEAANRA SHLQQEILAL DSQTSEYQIK SNQLARMTEV INTLEQQHtE 
********** ********** ********** ********** ********_* 



msa28369 
msa2 83 69 .2 { 
msa28369 . 
msa28369.2{201 
msa28369 
msa28369 .2{ 
msa283 69 
msa28369 
msa28369 
msa28369 .2{ 
msa2 8 3 69 



2{201_090 
201__1169NT 
2{201_A909 
_JM9130013 
2{201__COH1 
201_CJB110 
2(201_M781 
2{201_2603 
2{201_H36B 
201_18RS21 
2{201_M732 
Consensus 



251 

YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
********** 



TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 



VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 



********** ********** 



MLRRNTI PTM 
MLRRNTIPTM 
MLRRNTI PTM 
MLRRNTIPTM 
MLRRNTI PTM 
MLRRNTIPTM 
MLRRNTIPTM 
MLRRNTIPTM 
MLRRNTIPTM 
MLRRNTIPTM 
MLRRNTIPTM 
********** 



300 

KLSIAQLGMM 
KLSIAQLGMM 
KLSIAQLGMM 
KLSIAQLGMM 
KLSIAQLGMM 
KLSIAQLGMM 
KLSIAQLGMM 
KLSIAQLGMM 
KLSIAQLGMM 
KLSIAQLGMM 
KLSIAQLGMM 
********** 



msa28369 
msa28369.2{ 

msa28369 
msa28369.2{201 

msa28369 
msa28369 .2{ 

msa28369 . 

msa28369 . 

msa283 69 . 
msa28369.2 { 

msa28369 . 



.2{201_090 
201_1169NT 
2{201_A909 
_JM9130013 
2{2 01_COH1 
201 CJB110 



201__M781 
201_2603 
201JH36B 
201_18RS21 
2{201_M732 
Consensus 



301 

QQSVKSGVTA 
QQSVKSGVTA 
QQSVKSGVTA 
QQSVKSGVTA 
QQSVKSGVTA 
QQSVKSGVTA 
QQSVKSGVTA 
QQSVKSGVTA 
QQSVKSGVTA 
QQSVKSGVTA 
QQSVKSGVTA 



DAIVNANNAA 
DAIVNANNAA 
DAIVNANNAA 
DAIVNANNAA 
DAIVNANNAA 
DAIVNANNAA 
DAIVNANNAA 
DAIVNANNAA 
DAIVNANNAA 
DAIVNANNAA 
DAIVNANNAA 



LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 



AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 



350 

SPTVSIKSVT 
SPTVSIKSVT 
SPTVSIKSVT 
SPTVSIKSVT 
SPTVSIKSVT 
SPTVSIKSVT 
SPTVSIKSVT 
SPTVSIKSVT 
SPTVSIKSVT 
SPTVSIKSVT 
SPTVSIKSVT 



********** ********** ********** ********** ********** 



msa28369.2{201_090} 
msa28369.2{201_1169NT} 
msa28369.2{201_A909} 
msa28369.2{201_JM9130013} 
msa2 8369.2(20 l_COH 1 } 
msa28369 . 2 {201_CJB110 } 
msa28369.2(201_M78l} 
msa28369 .2{201_2603} 
msa28369.2{201_H36B} 
rasa28369 .2 {201_18RS21} 
msa28369 .2{201_M732} 
Consensus 



351 

ALaESLVAQN 
ALaESLVAQN 
ALaESLVAQN 
ALaESLVAQN 
ALaESLVAQN 
ALaESLVAQN 
ALaESLVAQN 
ALaESLVAQN 
ALsESLVAQN 
ALaESLVAQN 
ALaESLVAQN 
**_******* 



NGI IAAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 
********** 



RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
********** 



VIKSAETIND 
VIKSAETIND 
VIKSAETIND 
VIKSAETIND 
VIKSAETIND 
VIKSAETIND 
VIKSAETIND 
VIKSAETIND 
VIKSAETIND 
VIKSAETIND 
VIKSAETIND 
********** 



400 

SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
********** 



401 417 

msa28369. 2 {201^090} EALLNEGKST QEKvdes 

msa28369.2{201__1169NT} EALLNEGKST QEKvdes 

msa28369 .2{201_A909} EALLNEGKST QEKvdes 

msa28369.2{201_JM913 0013} EALLNEGKST QEKvdes 

msa28369 .2{2 01_COH1} EALLNEGKST QEKvdes 

tnsa28369.2{201_CJB110} EALLNEGKST QEKvdes 

msa28369 .2{2 01_M78l} EALLNEGKST QEKvdes 

msa28369 .2{201_2603} EALLNEGKST QEKvdes 

msa28369 .2{201_H36B} EALLNEGKST QEKvdes 

msa28369.2{201_18RS2l} EALLNEGKST QEKvdes 

msa28369.2{201_M732} EALLNEGKST QEK 

Consensus ********** *** 
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SEQ ID NO. 5301 
STRAIN 2603 

acaaatactttgaaaaaagaattagttgaagctaaaaagacaattccatc 
cgtaaaagcttcaaaagtaccgcaaaaatcaacatcatcgaaagataaag 
agtttgttcttaaaccgattatcgatgtctctggttggcaacttcctaag 
gagattgattacgatacgctttcaaaaaatatttcaggtgttgttattcg 
tgtct.ttggtggatcaaagatatctaagactaataacgctgcttatacaa 
ctggaatcgataaatcgtttaagacccatatcaaagaatttcaaaagcga 
aatatcccagtagctgtctacagttatgcacttggttcaagtgttaaaga 
aatgaaagaagaggctcagatattttataagaatgcagctccttacaaac 
caactttttattggattgacgtagaagaggagacaatgtctaacatgaat 
aaaggtg t c caagca 1 1 ccgaaaagaat t aaaaagac 1 1 ggtgc t aaaaa 
tgttggtatctacattggtacttactttatgactgagcaaggcatctctg 
taaaaggatttgacgctgtttggattccaacttatggtagcgattctgga 
tactatgaagcggctccgcaaactgaacttaaatacgatttacaccaata 
cacctctcaaggttatctaccaggawtcaatcaaccgcttgatttaaatc 
aaattgcagttaataaagacaagaagaaaacttatgagaaactttttgga 
aaagtaaaagag 

SEQ XD NO. 5302 
STRAIN 090 

AGAAATACTTTGAAAAAAGAATTAG 

TTGAAGCTAAAAAGACAATT C CAT C CGTAAAAG CTTCAAAAGTAC CG CAA 
AAAT CAACAT CATCGAAAGATAAAGAGTT TGTTCTTAAAC CGAT T ATCGA 
TGTCTCTGGTTGGCAACTTCCTAAGGAGATTGATTACGATACGCTTTCAA 
AAAATATTT CAGGTGTTGTTATT CGTGT CTTT GGTGGAT CAAAGATATCT 
AAGACTAATAACG CTG CTTATACAACTGGAAT CGATAAATCGTTTAAGAC 
CCATATCAAAGAATTTCAAAAG CG AAATAT CC CAGTAG CTGTCTACAGTT 
ATGCACTTGGTTCAAGTGTT AAAG AAATG AAAGAAGAGG CT CAGATATTT 
TATAAGAATG GAG CTCCITACAAAC CAACTTITT ATTGGATTGACGTAGA 
AGAGGAGACAATGT CTAACATG AATAAAG GTGTCCAAGCATT CCGAAAAG 
AATTAAAAAGACTTGGTGCTAAAAATGTT GGT AT CTACATTGGTACTTAC 
TTTATGACTGAG CAAGG CAT CT CTGTAAAAGGATTTGACG CTGTTTGGAT 
TC CAACTTATGGTAG CGATT CTGG ATACT ATG AAG CG G CT C CGCAAACTG 
AACTTAAATACGATTTACAC CAAT ACAC CT CT CAAGGTTAT CTAC CAGGA 
TT CAATCAACCG CTTGATTTAAAT CAAATTG CAGTTAAT AAAGACAAGAA 
GAAAACTTATGAGAAACTTTTTGGAAAAGTAAAAGAG 

SEQ ID NO. 5303 
STRAIN A909 

ACAAATACTTTGAAAAAAGAATTAGTTGAAGCTAAAA 
AGACAATTCCATCCGTAAAAGCTTCAAAAGTACCGCAAAAATCAACATCA 
TCGAAAGAT AAAGAGTTTGTT CTTAAAC CGATTAT CG ATGTCTCTGGTTG 
G CAACTT CCT AAGG AGATTGATTACGATACG CTTT CAAAAAATATTT CAG 

GCTG CTTAT ACAACT GGAAT CG ATAAATCGTTTAAGACC CATAT CAAAGA 
ATTT CAAAAG CX3AAATATCCCAGTAGCTGT CT ACAGTTATGCA.CTTGGTT 
CAAGTGTTAAAGAAATGAAAGAAGAGG CT CAGATATTTTATAAGAATG CA 
G CTC (TTTACAAACCAACTTTTTATTGG ATTGACGTAGAAGAGGAGACAAT 
GTCTAACATGAATAAAGGTGTCCAAGCATTCCGAAAAGAATTAAAAAGAC 
TTGGTGCTAAAAATGTTGGTATCTACATTGGTACTTACTTTATGACTG^ 
CAAGG CATCT CTGTAAAAGGATTTGACG CTGTTTGGATTCCAACTTATGG 
TAGCGATTCTGGATACTATGAAGCGGCTC C G CAAACTGAACTTAAATACG 
ATTTACACCAA'r ACACCTCTCAAG GTT AT CTACCAGGATT CAAT CAACCG 
CTTGATTTAAAT CAAATTG CAGTTAAT AAAGACAAGAAGAAAACTTATG A 
GAAACTTTTTGGAAAAGTAAAAGAG 

SEQ ZD NO. 5304 
STRAIN H36B 

ACAAATACTTTGAAAAAAGAATTAG 

TTGAAG CTAAAAAGACAATT C CAT C CGTAAAAG CTTCAAAAGTACCGCAA 
AAAT CAACAT CAT CGAAAGAT AAAG AGTTTGTTCTTAAAC CGATTAT CG A 
TGTCT CTGGTTGG CAACTTC CTAAG GAG ATTGATTACGATACG CTTT CAA 
AAAAT ATTT CAGGTGTTGTTATTCGTGT CTTT GGTGGAT CAAAG ATAT CT 
AAGACTAATAACGCTGCTITATACAACTGGAATCGATAAATCG 
CCATATCAAAGAATTTCAAAAG CGAAATAT CC CAGTAG CTGT CT ACAGTT 
ATGCACTTGGTTCAAGTGTTAAAGA^TGAAAGAAGAGGCTCAGATATTT 
TATAAGAATGCAGCT CCTT ACAAAC CAACTTT^ 

AGAGGAGACAATGT CTAACATGAAT AAAG GTGT CCAAG CATT C CGAAAAG 
AATTAAAAAGACTTGGTGCTAAAAATGTTGGTATCTACATTGGTACTTAC 
TTTATGACTGAG CAAGG CAT CT CTGTAAAAGGATTTGACG CT GTTTGGAT 
TCCAACrrATGGTAGCGATTCTGGATACTATGAAGCGGCTCCGCAAACTG 
AACTTAAATACGATTTACAC CAAT ACACCT CT CAAGGTTATCTACCAGG A 
TT CAATCAACCG CTTGATTTAAAT CAAATTGCAGTT AATAAAGACAAGAA 
GAAAACTTATGAGAAACTTTTTCGAAAAGTAAAAGAG 

SEQ XD NO. 5305 
STRAIN X8RS2X 

ACAAATACTTTGAAAAAAGAATTAGTTGAAGCT AAAAA 
GACAATT CCATCCGTAAAAG CTT CAAAAGTAC CG CAAAAAT CAACAT CAT 
CG AAAGATAAAGAGTTTGTT CTTAAAC CGATT AT CGATGT CTCTGGTTGG 
CAACTT CCTAAGGAGATTG ATT AC G AT ACG CTTT CAAAAAATATTT CAGG 
TGTTGTTATT CGTGT CTTTGGTGG AT CAAAGATAT CT AAGACTAATAACG 
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CTG CTTAT ACAACT GGAAT CGATAAAT CGTTT AAG AC C CAT AT CAAAGAA 

TTTCAAAAGCGAAATATCCCAGTAGCTGTCTACAGTTATGCACTTGGTTC 

AAGTGTTAAAG AAATGAAAGAAGAG G CT CAGAT ATTTTATAAGAATG CAG 

CTCCTTACAAACCAACTTTTTATTGGATTGACGTAGAAGAG^ 

TCTAACATGAATAAAGCTGTC(TAAGCATT CCG AAAAGAATTAAAAAGACT 

TGGTGCTAAAAATGTTGGTATCTACAT^GGTACTTACTTTATGACTGAGC 

AAGGCATCTCTGTAAAAGGATTTGACGCTGTTTGGATTCCAACTTATGGT 

AGCGATTCTGGATACTATGAAGCGGCTCCGCAAACTGAACTTAAATACGA 

TTTACACCAATACACCrCTCAAGGTTATCTACCAG 

TTGATTT AAAT CAAATT G CAGTTAATAAAG ACAAGAAGAAAACTT ATGAG 
AAACTTTTTGGAAAAGTAAAAGAG 

SEQ ID NO. 5306 
STRAIN M732 

ACAAATACTTTGAAAAAAGAATTAGTTGAAGCTAAA 

AAGACAATTC CATC CGTAAAAG CTT CAAAAGT AC C G CAAAAAT CAACAT C 
AT CGAAAGAT AAA.G AGTTTGTT CTTAAA.C CGATTAT CGATGT CTCTGGTT 
GGCAACTTC CTAAGGAG ATTGATT ACGAT ACGCTTT CAAAAAATATTT CA 
GGTGTTGTTATT CGT AT CTTTGGTGGAT CAAAGAT AT CT AAG ACT AATAA 
CG CTG CTTAT ACAACTGGAAT CGATAAAT CGTTTAAGAC C CATATCAAAG 
AATTT CAAAAGCGAAATAT C C CAGTAG CTGT CTACAGTTATG CACTTGGT 
TCAAGTGTTAAAGAAATGAAAGAAGAGGCTCAGATATTTTATAAGAATGC 
AGCTCCTTAGAAaCCAACTTTTTATTGG&^ 

TGTCT AACATGAATAAAGGT GTC CAAG CATTC CG AAAAGAGTTAAAAAGA 
CTTGGTGCTAA^^TGTTGGTATCTACATCGGTACTTACrTT 
GCAAGGTATCTCTGTAAAAGGATTTGACGCTGTTTGGATTCCAACTTATG 
GTAGCGATT CTGGAT ACTATGAAG CAG CT C CACA AACTG A ACTTABATAC 
GATTT ACACCAATACAC CT CTCAAGGTTAT CTAC CAGGATT CAATCAACC 
G CTTGATTT AAATCAAATT G CAGTT AATAAAGAGAA3AAGAAAACTT ATG 
AGAAACTTTTTGGAAAAGTAAAAGAG 

SEQ ID NO. 5307 
STRAIN COH1 

ACAAATACTTTGAAAAAAG AATTAGTTG AAG CTAAAA 

AGACAATT C CATC CGTAAAAGCTT CAAAAGTACCG CAAAAAT CAACAT CA 

TCGAAAGATAAAGAGTTTGTTCITAAACCGATTATCGATGTCTCTGGTTG 

GCAACTTCCTAAG<5AGATTGATTACGATACGCr^ 

GTGTTGTTATTCGTAT CTTTGGTGGAT CAAAGATAT CTAAGACTAATAAC 
GCTG CTTAT ACAACTGGAAT CGATAAAT C GTTTAAGACC CATAT CAAAGA 
ATTTCAAAAGCGAAATATCCCAGTAGCTGTCTACAGTTATGCACTT^^ 
CAAGTGTTAAAGAAATGAAAGAAG AGG CT CAGAT ATTTTATAAGAATGCA 
GCTCCTTACAAACCAACTTTTTATTG^ATTGACGTAGAA 
GT CTAACATGAATAAAGGTGTCCAAG CATT CCGAAAAGAGTTAAAAAGAC 
TTGGTG CTAAAAATGTTGGT AT CTACAT C GGTACTTACTTTATGACTGAG 
CAAGGTAT CT CTGTAAAAGGATTTGACG CTGTTTGGATT CCAACTTATGG 
TAG CGATTCTGG ATACTATGAAG CAG CTC CACAAACTGAACTTAAATACG 
ATTTACACCAATACAC CT CT CAAGGTTAT CTAC CAGG AT T CAAT CAAC C G 
CTTGATTTAAATCAAATTGCAGTTAATAA^ 
GAAACTTTTTGGAAAAGTAAAAGAG 

SEQ ID NO. 5308 
STRAIN M781 

AOUVATACTTTGAAAAAAGAATTAGTTGAAGCTAAA 

AAGACAATT C CATC C GTAAAAG CT'T CAAAAGT AC CGCAAAAAT CAACAT C 
AT CGAAAGATAAAGAGTTTGTTCTTAAACCGATT ATCGATGT CT CTGGTT 
GG CAACTTC CTAAGGAGATTGATTACG ATACG CTTT CAAAAAATATTTCA 
GGTGTTGTTATT CGTATCTTTGGTGGAT CAAAGATAT CTAAGACT AATAA 
CGCTG CTT ATACAACTGGAATCGATAAAT C GTTTAAGAC CCATAT CAAAG 
AATTT CAAAAGCGAAATAT CCCAGTAGCTGTCTACAGTTATGCACT 
TCAAGTGTTAAAGAAATGAAAGAAGAGG CT CAGAT ATTTT AT AAGAATGC 
AGCTCCTTACAAACCAACTTTTTatTGGATTGACGTAGA^ 
TGTCT AACATGAATAAAGGTGT C CAAG CATT C CG AAAAGAGTTAAAAAGA 
CTTGGTG CTAAAAATGTTGGTATCTACAT CC4GTACTTACTTTATGACTG A 
GCAAGGTATCTCTGTAAAAGGATTTGACG CTGTTTGG ATT C CAACTTATG 
GTAG CGATT CTGGATACTATGAAGCAG CT CCACAAACTGAACTTAAATAC 
GATTT ACAC CAATACAC CT CTCAAGGTTAT CTAC CAGGATTCAAT CAAC C 
GCTTGATTTAAATCAAATTG CAGTTAATAAAGACAAGAAGAAAACTTATG 
AGAAACTTTTTGGAAAAGTAAAAGAG 

SEQ ID NO. 5309 
STRAIN CJB110 

AAATACTTTGAAAAAAG AATTAGTTGAAG CT AAAAAG ACAATTCCATCCG 
TAAAAG CTT CAAAAGTAC CG CAAAAAT CAACAT CATC GAAAG AT AAAGAG 
TTTGTTCITAAACCGATTATCGATGTCTCTC^TC 

GATTGATTAC GATACGCTTT CAAAAAATATTT CAGGTGTTGTTATTCGTG 
T CTTTGGTGGAT CAAAG AT ATCTAAGACT AATAACGCTG CTTATACAACT 
GGAATCGATAAATCGTTTAAGACCCATATCAAAGAATT^ 
TATC CCAGTAGCTGT CTACAGTTATG CACTTGCHT CAAGTGTTAAAGAAA 
TGAAAGAAGAGG CT CAGATATTTTATAAG AATG CAG CT CCTTACAAAC CA 
ACTTTTTATTGGATT GACGTAGAAGAGGAG ACAATGT CTAACATG AATAA 
AGGTGTCCAAGCATT C CGAAAAGAATTAAAAAGACTITGGTG CT AAAAATG 
TTGGTAT CTACATT GGTACTTACTTTATGACTGAG CAAGG CATCT CTGT A 
AAAGGATTTGACGCTGTTTGGATTCCAACTTATGGTAGCGAT^ 
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CTATGAAGCGGCTCCGCAAACTGAACTTAAATACGATTTACACCAATACA 
CCTCTCAAGGTTAT CTACCAGGATTCAATCAACCGCTTGATTTAAATCAA 
ATTACAGTTAATAAAGACAAGAAGAAAACTTATGAGAAACTTTTTGGAAA 
AGTAAAAGAG 

SEQ ID NO. 5310 
STRAIN 1169NT 

AGAAATACTTTGAAAAAAGAATTAGTTGAAGCTAAAAAGACAATTCC 
ATCCGTAAAAGCTTCAAAAGTACCGCAAAAATCAACATCATCGAAAGATA 

AAGGAGATTGATTACGATACGCTTTCAAAAAATATTTCAGGTGTTGTTAT 
TCGTGTCTTTGGTGGATCAAAGATATCTAAGACTAATAACGCTGCTTATA 
CAACTGG AATCGATAAATCGTTT AAGAC C CAT AT CAAAG AATTT C AAAAG 
CGAAATATCCCAGTAGCTGTCTACAGTTATGCACTTGGTTCAAGTGTTAA 
AG AAATGAAAGAAG AGG CT CAG ATATTTT ATAAG AATG CAG CT C CTTACA 
AACCAACTTTTTATTGGATTGACGTAGAAGAGGAGACAATGTCTAACATG 
AATAAAGGTGTC CAAG CATT CCGAAAAGAATTAAAAAGACTTGGCG CTAA 
AAATGTTGGTATCTACATCGGTACTTACTTTATGACTGAGCAAGGTATCT 
CTGTAAAAGGATTTGACG CTGTTTG GATT C CAACTTATGGTAGCG ATTCT 
GGATACT ATGAAG CAGCT CCGCAAACTGAACrTAAATACGATTTACACCA 
ATACAlCCTCTCAAGGTTATCTACCAGGATTCAATCAACCGCITGATTTAA 
AT CAAATTG CAGTTAATAAAGACAAGAAGAAAACTTATG AGAAACTTTTT 
GGAAAAGTAAAAGAG 

SEQ ID NO. 5311 
STRAIN JH9130013 

ACAAATACTTTGAAAAAAGAATTAG 

TTGAAGCTAAAAAGACAATTCCATCCGTAAAAGCTTCAAAAGTACCGCAA 
AAAT CAACAT CATCGAAAGATAAAGAGTTTGTTCTTAAACCG ATT AT CGA 
TGTCT CTGGTTGGCAACTT C CTAAGGAGATTG ATTACGATACGCTTT CAA 
AAAATATTTCAGGTGTTGTTATTCGTGTCTTTGGTGGATCAAAGATATCT 
AAGACrAATAACGCTGCTTATACAACT GG AAT CG ATAAAT CGTTTAAGAC 
CCATATCAA^GAATTT CAAAAG CGAAATAT CC CAGTAG CTGTCTACAGTT 
ATGCACTTGGTT CAAGTGTTAA^GAAATGAAAGAAGAGG CT CAG ATATTT 
TATAAGAATG CAGCT CCTTACAAAC QA^CTTTTTATTGGATTGACGTAG A 
AGAGGAGAGAATGTC^AACATGAATAAAGGTGTC CAAG CATT CCGAAAAG 
AACTAAAAAGACTTGGTGCTAAAAATGTTGGTATCTAC^TTGGTACTTAC 
TTTATGACTGAGCAAGGCATCTCTGTAAAAGGATCTGACGCTGTTTGGAT 
TCCAACTTATGGTAG CG ATT CTGGATACT ATGAAG CGGCT C CG CAAACTG 
AACTTAAATACGATTTACAC CAATACAC CTCTCAAGGTTAT CTACCAGGA 
TT CAAT CAAC CG CTTGATTTAAAT CAAATT G CAGTTAAT APAGACAAGAA 
GAAAACTTATGAGAAACTTTTTGGAAAAGTAAAAGAG 

PRETTY of : /biotmp/msa2 1441 . 2 { * } January 20, 2003 03:46 



msa21441.2{206_090} 
msa21441.2{206_18RS2l} 
msa21441 . 2 { 206_2603} 
msa21441.2f 206_A909} 
ms a2 144 1 . 2 { 206_H3 SB } 
msa2144 1 . 2 { 2 06_JM913 0 0 13 } 
msa21441 . 2 { 206_CJB110 } 
msa2 1441 . 2 ( 206_COH1 } 
msa21441 . 2 {206_M732 } 
msa21441.2{206_M78l} 
msa21441 . 2 {206_1169NT} 
Consensus 



msa21441.2{206_090 
msa2 1441 . 2 { 206_18RS21 
msa21441 . 2 {206_2603 
msa21441 . 2 (206_A909 } 
msa21441.2{206_H36B} 
msa21441.2{206_JM9130013j 
msa21441.2{206_CJB110} 
msa2 144 1.2(20 6_COHl } 
msa21441.2{206_M732} 
msa21441.2{206_M78l} 
msa21441.2{206_1169NT} 
Consensus 



a c AAATACTT 
a c AAATACTT 
acAAATACTT 
acAAATACTT 
acAAATACTT 
acAAATACTT 
— AAATACTT 
acAAATACTT 
acAAATACTT 
acAAATACTT 
acAAATACTT 



TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 



ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 



_******** ********** ********** 



51 

CGTAAAAGCT 
CGT AAAAG CT 
CGTAAAAGCT 
CGTAAAAGCT 
CGTAAAAGCT 
CGTAAAAGCT 
CGTAAAAGCT 
CGTAAAAGCT 
CGTAAAAGCT 
CGTAAAAGCT 
CGTAAAAGCT 
********** 



TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
********** 



CGCAAAAATC 
CGCAAAAATC 
CGCAAAAATC 
CGCAAAAATC 
CGCAAAAATC 
CGCAAAAATC 
CGCAAAAATC 
CGCAAAAATC 
CGCAAAAATC 
CGCAAAAATC 
CGCAAAAATC 
********** 



GCTAAAAAGA 
GCTAAAAAGA 
GCTAAAAAGA 
GCTAAAAAGA 
GCTAAAAAGA 
GCTAAAAAGA 
GCTAAAAAGA 
GCTAAAAAGA 
GCTAAAAAGA 
GCTAAAAAGA 
GCTAAAAAGA 
********** 



AACAT CATCG 
AACATCATCG 
AACAT CATCG 
AACATCATCG 
AACATCATCG 
AACATCATCG 
AACATCATCG 
AACATCATCG 
AACATCATCG 
AACATCATCG 
AACATCATCG 
********** 



50 

CAATTCCATC 
CAATTCCATC 
CAATTCCATC 
CAATTCCATC 
CAATTCCATC 
CAATTCCATC 
CAATTCCATC 
CAATTCCATC 
CAATTCCATC 
CAATTCCATC 
CAATTCCATC 
********** 

100 

AAAGATAAAG 
AAAGATAAAG 
AAAGATAAAG 
AAAGATAAAG 
AAAGATAAAG 
AAAGATAAAG 
AAAGATAAAG 
AAAGATAAAG 
AAAGATAAAG 
AAAGATAAAG 
AAAGATAAAG 
********** 



101 150 

msa21441.2{206_090} AGTTTGTTCT TAAACCGATT ATCGATGTCT CTGGTTGGCA ACTTCCTAAG 

msa21441.2{206_18RS2l} AGTTTGTTCT TAAACCGATT ATCGATGTCT CTGGTTGGCA ACTTCCTAAG 

msa2 144 1.2 { 206^2603} AGTTTGTTCT TAAACCGATT ATCGATGTCT CTGGTTGGCA ACTTCCTAAG 

msa21441.2{206_A909} AGTTTGTTCT TAAACCGATT ATCGATGTCT CTGGTTGGCA ACTTCCTAAG 

msa2 144 1 . 2 { 2 06_H3 6B } AGTTTGTTCT TAAACCGATT ATCGATGTCT CTGGTTGGCA ACTTCCTAAG 

msa21441.2{206_JM9130013} AGTTTGTTCT TAAACCGATT ATCGATGTCT CTGGTTGGCA ACTTCCTAAG 

msa21441.2{206_CJB110} AGTTTGTTCT TAAACCGATT ATCGATGTCT CTGGTTGGCA ACTTCCTAAG 

msa21441.2f 206_COH1} AGTTTGTTCT TAAACCGATT ATCGATGTCT CTGGTTGGCA ACTTCCTAAG 

msa21441.2{206_M732} AGTTTGTTCT TAAACCGATT ATCGATGTCT CTGGTTGGCA ACTTCCTAAG 
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msa2 144 1 . 2 { 2 0 6_M7 8 1 } 
msa21441.2{206_1169NT} 
Consensus 



AGTTTGTTCT TAAACCGATT ATCGATGTCT CTGGTTGGCA ACTTCCTAAG 
AGTTTGTTCT TAAACCGATT ATCGATGTCT CTGGTTGGCA ACTTCCTAAG 
********** ********** ********** ********** ********** 



msa21441 . 2 { 206JD90 } 
msa21441 . 2 { 206_18RS21 } 
msa21441 . 2{206_2603 } 
msa2 144 1 . 2 1 2 06_A909 } 
msa21441.2{206_H36B} 
msa21441.2{206_JM9130013} 
msa21441.2{206_CJB110} 
msa21441 . 2 f 206_COH1 ) 
msa21441 . 2{206_M732 } 
msa2 144 1 . 2 { 2 0 6_M7 8 1 } 
msa21441 . 2{206_1169NT} 
Consensus 



msa21441 . 2 { 206_090 ) 
msa21441.2{206_18RS2l} 
msa2 144 1 . 2 { 2 06_2603 } 
msa21441 . 2 { 206_A909 } 
msa2 144 1 . 2 { 2 06_H36B } 
msa21441.2{206_JM9130013} 
msa21441.2{206_CJB110} 
msa2 144 1 . 2 { 2 0 6_COHl } 
msa21441 . 2 (206_M732 } 
ms a2 1 4 4 1 . 2 { 2 0 6_M7 8 1 } 
msa2 1441 . 2 { 2 0 6_1 1 6 9 NT } 
Consensus 



msa21441.2{206_090} 
msa21441.2{206_18RS2l} 
msa21441 .2 [ 206_2603 J 
msa21441.2{206_A909) 
msa2 144 1 . 2 { 2 06JK3 6B } 
msa21441.2{206_JM9130013 ) 
msa21441.2{206_CJB110} 
msa2 144 1 . 2 { 2 06_COH1 } 
msa21441 . 2 {206_M732 } 
msa21441.2{206_M78l} 
msa21441 . 2{206_1169NT) 
Consensus 



151 

GAGATTGATT 
GAGATTGATT 
GAGATTGATT 
GAGATTGATT 
GAGATTGATT 
GAGATTGATT 
GAGATTGATT 
GAGATTGATT 
GAGATTGATT 
GAGATTGATT 
GAGATTGATT 



msa21441 
msa21441.2{ 

msa21441 . 

msa21441 . 

msa21441 . 
msa21441.2{206 
msa21441.2{ 

msa21441 . 

msa21441 . 

msa21441 . 
msa21441.2{ 



.2{206_090} 
2 06_18RS21} 
2{206_2603) 
2{206_A909} 
2{206_H36B} 
_JM9130013l 
206_CJB110} 
2{206_COH1) 
2{206_M732) 
2{206_M781} 
2 06_1169NT} 
Consensus 



msa21441. 2 {206^090} 
msa21441.2{206_18RS2lJ 
msa21441 . 2 (206__2603 } 
msa21441.2{206_A909} 
msa2 144 1 . 2 { 2 0 6_H3 6B ) 
msa2 144 1 . 2 { 2 06_JM913 0 0 13 } 
msa2 14 4 1 . 2 { 2 0 6_C JB 1 1 0 J 
msa2 144 1 . 2 { 2 0 6_COHl ] 
msa21441.2{206_M732j 
msa21441.2{206JM78l] 
msa21441 . 2 { 206_1169NT) 
Consensus 



msa2 1441.2(20 6_0 9 0 } 
msa21441 . 2 { 206_18RS21 } 
msa21441.2(206_2603 
msa21441 . 2 (206_A909} 
msa2 1441. 2(20 6_H3 6B ) 
msa21441.2{206 - JM913 0013] 
msa21441.2{206_CJB110 
msa2 144 1 . 2 { 2 0 6_C0H1 J 



ACGATACGCT 
ACGATACGCT 
ACGATACGCT 
ACGATACGCT 
ACGATACGCT 
ACGATACGCT 
ACGATACGCT 
ACGATACGCT 
ACGATACGCT 
ACGATACGCT 
ACGATACGCT 



TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 



********** ********** ********** 



ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
********** 



201 

TgTCTTTGGT GGATCAAAGA TATCTAAGAC TAATAACGCT 
TgTCTTTGGT GGATCAAAGA TATCTAAGAC TAATAACGCT 
TgTCTTTGGT GGATCAAAGA TATCTAAGAC TAATAACGCT 
TgTCTTTGGT GGATCAAAGA TATCTAAGAC TAATAACGCT 
TgTCTTTGGT GGATCAAAGA TATCTAAGAC TAATAACGCT 
TgTCTTTGGT GGATCAAAGA TATCTAAGAC TAATAACGCT 
TgTCTTTGGT GGATCAAAGA TATCTAAGAC TAATAACGCT 
TaTCTTTGGT GGATCAAAGA TATCTAAGAC TAATAACGCT 
TaTCTTTGGT GGATCAAAGA TATCTAAGAC TAATAACGCT 
TaTCTTTGGT GGATCAAAGA TATCTAAGAC TAATAACGCT 
TgTCTTTGGT GGATCAAAGA TATCTAAGAC TAATAACGCT 
*_******** ********** ********** ********** 



251 

CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
********** 

301 

AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
********** 

351 

AATGAAAGAA 
AATGAAAGAA 
AATGAAAGAA 
AATGAAAGAA 
AATGAAAGAA 
AATGAAAGAA 
AATGAAAGAA 
AATGAAAGAA 
AATGAAAGAA 
AATGAAAGAA 
AATGAAAGAA 
********** 

401 

CAACTTTTTA 
OAACTTTTTA 

CAACTTTTTA 
CAACTTTTTA 
CAACTTTTTA 
CAACTTTTTA 
CAACTTTTTA 
CAACTTTTTA 



TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 



AAGACCCATA 
AAGACCCATA 
AAGACCCATA 
AAGACCCATA 
AAGACCCATA 
AAGACCCATA 
AAGACCCATA 
AAGACCCATA 
AAGACCCATA 
AAGACCCATA 
AAGACCCATA 



********** ********** 



TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
********** 



GAGGCTCAGA 
GAGGCTCAGA 
GAGGCTCAGA 
GAGGCTCAGA 
GAGGCTCAGA 
GAGGCTCAGA 
GAGGCTCAGA 
GAGGCTCAGA 
GAGGCTCAGA 
GAGGCTCAGA 
GAGGCTCAGA 
********** 



TTGGATTGAC 
TTGGATTGAC 
TTGGATTGAC 
TTGGATTGAC 
TTGGATTGAC 
TTGGATTGAC 
TTGGATTGAC 
TTGGATTGAC 



CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
********** 



TCAAAGAATT 
TCAAAGAATT 
TCAAAGAATT 
TCAAAGAATT 
TCAAAGAATT 
TCAAAGAATT 
TCAAAGAATT 
TCAAAGAATT 
TCAAAGAATT 
TCAAAGAATT 
TCAAAGAATT 
********** 



CTTGGTTCAA 
CTTGGTTCAA 
CTTGGTTCAA 
CTTGGTTCAA 
CTTGGTTCAA 
CTTGGTTCAA 
CTTGGTTCAA 
CTTGGTTCAA 
CTTGGTTCAA 
CTTGGTTCAA 
CTTGGTTCAA 
********** 



TATTTTATAA 
TATTTTATAA 
TATTTTATAA 
TATTTTATAA 
TATTTTATAA 
TATTTTATAA 
TATTTTATAA 
TATTTTATAA 
TATTTTATAA 
TATTTTATAA 
TATTTTATAA 
********** 



GTAGAAGAGG 
GTAGAAGAGG 
GTAGAAGAGG 
GTAGAAGAGG 
GTAGAAGAGG 
GTAGAAGAGG 
GTAGAAGAGG 
GTAGAAGAGG 



GAATGCAGCT 
GAATGCAGCT 
GAATGCAGCT 
GAATGCAGCT 
GAATGCAGCT 
GAATGCAGCT 
GAATGCAGCT 
GAATGCAGCT 
GAATGCAGCT 
GAATGCAGCT 
GAATGCAGCT 
********** 



AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 



200 

TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
********** 

250 

GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
********** 

300 

TCAAAAGCGA 
TCAAAAGCGA 
TCAAAAGCGA 
TCAAAAGCGA 
TCAAAAGCGA 
TCAAAAGCGA 
TCAAAAGCGA 
TCAAAAGCGA 
TCAAAAGCGA 
TCAAAAGCGA 
TCAAAAGCGA 
********** 

350 

GTGTTAAAGA 
GTGTTAAAGA 
GTGTTAAAGA 
GTGTTAAAGA 
GTGTTAAAGA 
GTGTTAAAGA 
GTGTTAAAGA 
GTGTTAAAGA 
GTGTTAAAGA 
GTGTTAAAGA 
GTGTTAAAGA 
********** 



400 

CCTTACAAAC 
CCTTACAAAC 
CCTTACAAAC 
CCTTACAAAC 
CCTTACAAAC 
CCTTACAAAC 
CCTTACAAAC 
CCTTACAAAC 
CCTTACAAAC 
CCTTACAAAC 
CCTTACAAAC 
********** 

450 

TAACATGAAT 
TAACATGAAT 
TAACATGAAT 
TAACATGAAT 
TAACATGAAT 
TAACATGAAT 
TAACATGAAT 
TAACATGAAT 



875 



WO 2004/018646 



PCT/US2003/026827 



Table 53: Comparative Sequences relating to SAG 0755 



msa21441 . 2 ( 2 06 JVI732 } 
msa21441 . 2 (206_M781 } 
msa21441 . 2 {206_1169NT} 
Consensus 



CAACTTTTTA TTGGATTGAC GTAGAAGAGG AGACAATGTC TAACATGAAT 
CAACTTTTTA TTGGATTGAC GTAGAAGAGG AGACAATGTC TAACATGAAT 
CAACTTTTTA TTGGATTGAC GTAGAAGAGG AGACAATGTC TAACATGAAT 
********** ********** ********** ********** ********** 



msa21441 
msa21441.2{ 

msa21441. 

msa21441. 

msa21441 . 
msa21441.2{206 
m3a21441.2{ 

msa21441. 

msa21441. 

msa21441. 
msa21441.2{ 



.2{206_090} 
206_18RS2l} 
2{206_2603} 
2(206^909} 
2{206_H36B} 
_JM9130013} 
2 06_CJB110) 
2{206_COH1} 
2{206_M732} 
2{206_M78l} 
206_1169NT} 
Consensus 



451 

AAAGGTGTCC 
AAAGGTGTCC 
AAAGGTGTCC 
AAAGGTGTCC 
AAAGGTGTCC 
AAAGGTGTCC 
AAAGGTGTCC 
AAAGGTGTCC 
AAAGGTGTCC 
AAAGGTGTCC 
AAAGGTGTCC 
********** 



AAGCATTCCG 
AAGCATTCCG 
AAGCATTCCG 
AAGCATTCCG 
AAGCATTCCG 
AAGCATTCCG 
AAGCATTCCG 
AAGCATTCCG 
AAGCATTCCG 
AAGCATTCCG 
AAGCATTCCG 



AAAAGAaTTA 
AAAAGAaTTA 
AAAAGAaTTA 
AAAAGAaTTA 
AAAAGAaTTA 
AAAAGAaTTA 
AAAAGAaTTA 
AAAAGAgTTA 
AAAAGAgTTA 
AAAAGAgTTA 
AAAAGAaTTA 



AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 



500 

GtGCTAAAAA 
GtGCTAAAAA 
GtGCTAAAAA 
GtGCTAAAAA 
GtGCTAAAAA 
GtGCTAAAAA 
GtGCTAAAAA 
GtGCTAAAAA 
GtGCTAAAAA 
GtGCTAAAAA 
GcGCTAAAAA 



********** ******_*** ********** *_******** 



501 

msa2 14 4 1 . 2 { 2 0 6_0 9 0 } TGTTGGTATC 

msa21441.2{206__18RS2l} TGTTGGTATC 

maa21441 . 2 {206_2603 } TGTTGGTATC 

msa2 1441.2(20 6__A9 0 9 } TGTTGGTATC 

msa2 1441.2(20 6_H3 6B } TGTTGGTATC 

ms a2 1441.2(20 6_JM9 1 30013} TGTTGGTATC 

ms a2 14 4 1 . 2 ( 2 0 6_C JB1 1 0 } TGTTGGTATC 

msa2 1441.2(20 6_COHl } TGTTGGTATC 

msa2 1441. 2(20 6_M7 3 2 } TGTTGGTATC 

msa2 1441. 2(20 6_M7 8 1 } TGTTGGTATC 

msa21441.2(206_1169NT} TGTTGGTATC 

Consensus ********** 



TACATtGGTA 
TACATtGGTA 
TACATtGGTA 
TACATtGGTA 
TACATtGGTA 
TACATtGGTA 
TACATtGGTA 
TACATcGGTA 
TACATcGGTA 
TACATcGGTA 
TACATcGGTA 



CTTACTTTAT 
CTTACTTTAT 
CTTACTTTAT 
CTTACTTTAT 
CTTACTTTAT 
CTTACTTTAT 
CTTACTTTAT 
CTTACTTTAT 
CTTACTTTAT 
CTTACTTTAT 
CTTACTTTAT 



*****_**** ********** 



GACTGAGCAA 
GACTGAGCAA 
GACTGAGCAA 
GACTGAGCAA 
GACTGAGCAA 
GACTGAGCAA 
GACTGAGCAA 
GACTGAGCAA 
GACTGAGCAA 
GACTGAGCAA 
GACTGAGCAA 
********** 



550 

GGcATCTCTG 
GGcATCTCTG 
GGcATCTCTG 
GGcATCTCTG 
GGcATCTCTG 
GGcATCTCTG 
GGcATCTCTG 
GGtATCTCTG 
GGtATCTCTG 
GGtATCTCTG 
GGtATCTCTG 
**_******* 



msa21441 
msa21441.2{ 

msa21441 . 

rasa21441. 

msa21441 . 
msa21441.2{206 
msa21441.2( 

msa21441. 

msa21441 . 

msa21441 
msa21441.2{ 



2{206_090} 
206_18RS2l] 
2{206_2603; 
2{206_A909] 
2{206_H36B] 
JM9130013J 
206_CJB110 ! 
2(206_COHi; 
2{206_M732' 
2{206_M78ll 
206_1169NT} 
Consensus 



551 

TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
********** 



TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
********** 



TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
********** 



CTTATGGTAG 
CTTATGGTAG 
CTTATGGTAG 
CTTATGGTAG 
CTTATGGTAG 
CTTATGGTAG 
CTTATGGTAG 
CTTATGGTAG 
CTTATGGTAG 
CTTATGGTAG 
CTTATGGTAG 
********** 



600 

CGATTCTGGA 
CGATTCTGGA 
CGATTCTGGA 
CGATTCTGGA 
CGATTCTGGA 
CGATTCTGGA 
CGATTCTGGA 
CGATTCTGGA 
CGATTCTGGA 
CGATTCTGGA 
CGATTCTGGA 
********** 



msa21441 
msa21441.2{ 

msa21441. 

msa21441 . 

msa21441 . 
msa21441.2{2 06, 
msa21441.2{ 

msa21441 . 

msa21441. 

msa21441. 
msa21441.2{ 



2{206_090} 
206_18RS21} 
2{206_2603} 
2(206_A909} 
2(206_H36B} 
_JM9130013} 
206_CJB110} 
2(206__COHl} 
2(206_M732} 
2{206_M781} 
206_1169NT} 
Consensus 



601 

TACTATGAAG 
TACTATGAAG 
TACTATGAAG 
TACTATGAAG 
TACTATGAAG 
TACTATGAAG 
TACTATGAAG 
TACTATGAAG 
TACTATGAAG 
TACTATGAAG 
TACTATGAAG 



CgGCTCCgCA 
CgGCTCCgCA 
CgGCTCCgCA 
CgGCTCCgCA 
CgGCTCCgCA 
CgGCTCCgCA 
CgGCTCCgCA 
CaGCTCCaCA 
CaGCTCCaCA 
CaGCTCCaCA 
CaGCTCCgCA 



AACTGAACTT 
AACTGAACTT 
AACTGAACTT 
AACTGAACTT 
AACTGAACTT 
AACTGAACTT 
AACTGAACTT 
AACTGAACTT 
AACTGAACTT 
AACTGAACTT 
AACTGAACTT 



AAATACGATT 
AAATACGATT 
AAATACGATT 
AAATACGATT 
AAATACGATT 
AAATACGATT 
AAATACGATT 
AAATACGATT 
AAATACGATT 
AAATACGATT 
AAATACGATT 



650 

TACACCAATA 
TACACCAATA 
TACACCAATA 
TACACCAATA 
TACACCAATA 
TACACCAATA 
TACACCAATA 
TACACCAATA 
TACACCAATA 
TACACCAATA 
TACACCAATA 



********** *_*****„** ********** ********** ********** 



651 700 

msa2 1441. 2 {206^090} CACCTCTCAA GGTTATCTAC CAGGAtTCAA TCAACCGCTT GATTTAAATC 

msa21441.2{206_18RS21} CACCTCTCAA GGTTATCTAC CAGGAtTCAA TCAACCGCTT GATTTAAATC 

msa21441.2{206_2603} CACCTCTCAA GGTTATCTAC CAGGAwTCAA TCAACCGCTT GATTTAAATC 

msa21441.2{206_A909} CACCTCTCAA GGTTATCTAC CAGGAtTCAA TCAACCGCTT GATTTAAATC 

msa21441.2{206_H36B} CACCTCTCAA GGTTATCTAC CAGGAtTCAA TCAACCGCTT GATTTAAATC 

msa21441- 2 {206_JM9130013 } CACCTCTCAA GGTTATCTAC CAGGAtTCAA TCAACCGCTT GATTTAAATC 

msa21441.2{206_CJB110} CACCTCTCAA GGTTATCTAC CAGGAtTCAA TCAACCGCTT GATTTAAATC 

msa21441.2{206_COHl} CACCTCTCAA GGTTATCTAC CAGGAtTCAA TCAACCGCTT GATTTAAATC 

• msa21441.2{206_M732} CACCTCTCAA GGTTATCTAC CAGGAtTCAA TCAACCGCTT GATTTAAATC 

msa21441.2{206_M78l} CACCTCTCAA GGTTATCTAC CAGGAtTCAA TCAACCGCTT GATTTAAATC 

msa21441.2{206_1169NT} CACCTCTCAA GGTTATCTAC CAGGAtTCAA TCAACCGCTT GATTTAAATC 

Consensus ********** ********** *****_**** ********** ********** 



701 

msa21441.2{206_090} AAATTgCAGT 

msa21441.2{206_18RS21> AAATTgCAGT 

msa2 1441. 2 ( 206^2603} AAATTgCAGT 

msa21441.2(206_A909} AAATTgCAGT 

msa21441.2{206_H36B} AAATTgCAGT 

msa21441.2{206_JM9130013 J AAATTgCAGT 

msa21441.2{206_CJB110) AAATTaCAGT 



TAATAAAGAC 
TAATAAAGAC 
TAATAAAGAC 
TAATAAAGAC 
TAATAAAGAC 
TAATAAAGAC 
TAATAAAGAC 



AAGAAGAAAA 
AAGAAGAAAA 
AAGAAGAAAA 
AAGAAGAAAA 
AAGAAGAAAA 
AAGAAGAAAA 
AAGAAGAAAA 



CTTATGAGAA 
CTTATGAGAA 
CTTATGAGAA 
CTTATGAGAA 
CTTATGAGAA 
CTTATGAGAA 
CTTATGAGAA 



750 

ACTTTTTGGA 
ACTTTTTGGA 
ACTTTTTGGA 
ACTTTTTGGA 
ACTTTTTGGA 
ACTTTTTGGA 
ACTTTTTGGA 



876 



WO 2004/018646 
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Table 53: Comparative Sequences relating to SAG 0755 

msa21441 . 2 {2 06_COH1} AAATTgCAGT TAATAAAGAC AAGAAGAAAA CTTATGAGAA ACTTTTTGGA 

msa21441.2{206__M732} AAATTgCAGT TAATAAAGAC AAGAAGAAAA CTTATGAGAA ACTTTTTGGA 

msa21441.2{206_M78l} AAATTgCAGT TAATAAAGAC AAGAAGAAAA CTTATGAGAA ACTTTTTGGA 

msa21441.2{206_1169NT} AAATTgCAGT TAATAAAGAC AAGAAGAAAA CTTATGAGAA ACTTTTTGGA 

Consensus *****-**** ********** ********** ********** *********** 

751 762 
msa21441.2{206_090} AAAGTAAAAG AG 
msa21441.2{206_18RS2l} AAAGTAAAAG AG 
msa21441.2{206_2603} AAAGTAAAAG AG 
msa21441.2{206_A909} AAAGTAAAAG AG 
msa21441.2{206_H36B} AAAGTAAAAG AG 
msa21441.2{206_JM9130013) AAAGTAAAAG AG 
msa21441.2{206_CJB110} AAAGTAAAAG AG 
msa21441.2{206_COHl} AAAGTAAAAG AG 
msa21441.2(206_M732} AAAGTAAAAG AG 
msa21441.2{206_M781} AAAGTAAAAG AG 
msa21441.2{206_1169NT} AAAGTAAAAG AG 
Consensus ********** ** 

SEQ ID NO. 5312 

STRAIN 2603 frame: 1 

TNTLKKELVEAKKT I PS VKASKVPQKSTSSKDKEFVLKP I IDVSGWQLPKEI DYDTLSKN 
I SGWIRWGGSKI SKTNNAAYTTG IDKSFKTHI KEFQKRNI PVAVYS YALGS S VKEMKE 
EAQI FYKNAAPYKPTFYWI DVEEETMSNMNKGVQAFRKELKRLGAKNVG I YI GTYFMTEQ 
GISVKGFDAWIPTYGSDSGYYEAAPQTELKYDLHQYTSC^YLPGXNQPIJ5I^QIAVNKD 
KKKTYEKLFGKVKE 

SEQ ID NO. 5313 
STRAIN 090 frame: 1 

TNTLKKELVEAKKT I PS VKASKVPQKSTSSKDKEFVLKP I IDVSGWQLPKEI DYDTLSKN 
I SGWIRVFGGSKI SKTNNAAYTTG IDKSFKTHI KEFQKRNI PVAVYSYALGSSVKEMKE 
EAQI FYKNAAPYKPTFYW I DVEEETMSNMNKGVQAFRKELKRLGAKNVG I YI GTYFMTEQ 
GI SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQ I AVNKD 
KKKTYEKLFGKVKE 

SEQ ID NO. 5314 

STRAIN A909 frame: 1 

TNTLKKELVEAKKT I PS VKASKVPQKSTSSKDKEFVLKP I IDVSGWQLPKEI DYDTLSKN 
I SGWIRVFGGSKI SKTNNAAYTTG I DKS FKTHI KEFQKRNI PVAVYS YALGS S VKEMKE 
EAQI FYKNAAPYKPTFYWI DVEEETMSNMNKGVQAFRKELKRLGAKNVG I YI GTYFMTEQ 
GI SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQ I AVNKD 
KKKTYEKLFGKVKE 

SEQ ID NO. 5315 

STRAIN H36B frame: 1 

TNTLKKELVEAKKTI PSVKASKVPQKSTSSKDKEFVLKPI IDVSGWQLPKEI DYDTLSKN 
I SGWIRVFGGSKI SKTNNAAYTTG I DKS FKTHI KEFQKRNI PVAVYSYALGSSVKEMKE 
EAQI FYKNAAPYKPTFYWI DVEEETMSNMNKGVQAFRKELKRLGAKNVG I YI GTYFMTEQ 
GISVKGFDAVWIPTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQIAVNKD 
KKKTYEKLFGKVKE 

SEQ ID NO. 5316 

STRAIN 18RS21 frame: 1 

TNTLKKELVEAKKT I PS VKASKVPQKSTSSKDKEFVLKP I IDVSGWQLPKEI DYDTLSKN 
I SGWIRVFGGSKI SKTNNAAYTTG I DKS FKTHI KEFQKRNI PVAVYSYALGSSVKEMKE 
EAQI FYKNAAPYKPTFYW I DVEEETMSNMNKGVQAFRKELKRLGAKNVG I YI GTYFMTEQ 
GISVKGFDAVWIPTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQIAVNKD 
KKKTYEKLFGKVKE 

SEQ ID NO. 5317 

STRAIN M732 frame: 1 

TNTLKKELVEAKKTI PSVKASKVPQKSTSSKDKEFVLKPI IDVSGWQLPKEI DYDTLSKN 
I SGWIRI FGGSKI SKTNNAAYTTG IDKSFKTHI KEFQKRNI PVAVYSYALGSSVKEMKE 
EAQI FYKNAAPYKPTFYW I DVEEETMSNMNKGVQAFRKELKRLGAKNVG I YIGTYFMTEQ 
GI SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQIAVNKD 
KKKTYEKLFGKVKE 

SEQ ID NO. 5318 

STRAIN COH1 frame: 1 

TNTLKKELVEAKKTI PSVKASKVPQKSTSSKDKEFVLKPI IDVSGWQLPKEIDYDTLSKN 
I SGWIRI FGGSKI SKTNNAAYTTG I DKS FKTHI KEFQKRNI PVAVYSYALGSSVKEMKE 
EAQI FYKNAAPYKPTFYWI DVEEETMSNMNKGVQAFRKELKRLGAKNVG! YIGTYFMTEQ 
GI SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQI AVNKD 
KKKTYEKLFGKVKE 

SEQ ID NO. 5319 

STRAIN M781 frame: 1 

TNTLKKELVEAKKTI PSVKASKVPQKSTSSKDKEFVLKPI IDVSGWQLPKEIDYDTLSKN 
I SGWIRIFGGSKI SKTNNAAYTTGI DKSFKTHI KEFQKRNI PVAVYSYALGSSVKEMKE 
EAQI FYKNAAPYKPTFYWI DVEEETMSNMNKGVQAFRKELKRLGAKNVG I YIGTYFMTEQ 
GI SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQIAVNKD 
KKKTYEKLFGKVKE 



877 



WO 2004/018646 
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Table 53: Comparative Sequences relating to SAG 0755 



SHQ XD NO. 5320 
STRAIN CJB110 frame: 2 

NTLKKELVEAKKT I P S VKAS KVPQKSTS S KDKEFVLtKP 1 1 D VSGWQLPKE I DYDTLS KN I 
SGWI RVFGGSKI SKTNNAAYTTG I DKSFKTHI KEFQKRNI PVAVYS YALGSS VKEMKEE 
AQ I FYKNAAP YKPTFYW I DVEEETMSNMNKGVQAFRKELKRLGAKNVG I Y IGTYFMTEQG 
I SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQITVNKDK 
KKTYEKLFGKVKE 

SEQ XD NO. 5321 

STRAIN 1169NT frame: 1 

TNTLKKELVEAKKTI PSVKASKVPQKSTSSKDKEFVLKPI IDVSGWQLPKEIDYDTLSKN 
I SGWIRVFGGSKI SKTNNAAYTTG I DKSFKTH I KEFQKRNI PVAVYSYALGSSVKEMKE 
EAQIFYKNAAPYKPTFYWI DVEEETMSNMNKGVQAFRKELKRLGAKNVG I YIGTYFMTEQ 
GI SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQI AVNKD 
KKKTYEKLFGKVKE 

SEQ ID NO. 5322 

STRAIN JM9130013 frame: 1 

TNTLKKELVEAKKTI PSVKASKVPQKSTSSKDKEFVLKPI IDVSGWQLPKEIDYDTLSKN 
I SGWIRVFGGSKI SKTNNAAYTTG IDKSFKTHI KEFQKRNI PVAVYSYALGSSVKEMKE 
EAQI FYKNAAP YKPTFYWI DVEEETMSNMNKGVQAFRKELKRLGAKNVG I Y I GT YFMTEQ 
GI SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQI AVNKD 
KKKTYEKLFGKVKE 

PRETTY of : /biotmp/msa21641 . 2 { * } January 20, 2003 03:59 



msa2 164 1.2 {206_090} tNTLKKELVE AKKTI PSVKA 

msa21641 . 2 { 206_1169NT} tNTLKKELVE AKKTI PSVKA 

msa21641.2{206_18RS21) tNTLKKELVE AKKTI PSVKA 

msa21641 .2{206_2603) tNTLKKELVE AKKTI PSVKA 

msa21641.2{206_A909} tNTLKKELVE AKKTI PSVKA 

msa21641.2{206_H3 6B} tNTLKKELVE AKKTI PSVKA 

msa21641.2{206_JM913 0013} tNTLKKELVE AKKTI PSVKA 

msa2 164 1 . 2 { 2 06_COH1 } tNTLKKELVE AKKTI PSVKA 

msa21641.2{206_M732) tNTLKKELVE AKKTI PSVKA 

msa21641.2{206_M78l} tNTLKKELVE AKKTI PSVKA 

msa21641 . 2 { 206_CJB110} -NTLKKELVE AKKTI PSVKA 

Consensus -********* ********** 



SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 



KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 



50 

IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 



********** ********** ********** 



51 

msa2164 1.2 {206^0 90} EIDYDTLSKN ISGWIRvFG 

msa21641 . 2 f 206_1169NT) EIDYDTLSKN ISGWIRvFG 

msa21641 . 2 { 206_18RS21 } EIDYDTLSKN ISGWIRvFG 

msa21641.2{206__2603} EIDYDTLSKN ISGWIRvFG 

msa21641 . 2 {206_A909} EIDYDTLSKN ISGWIRvFG 

msa21641.2{206_H36B} EIDYDTLSKN ISGWIRvFG 

msa21641.2{206_JM913 0013} EIDYDTLSKN ISGWIRvFG 

msa21641.2{206_COHl) EIDYDTLSKN ISGWIRiFG 

msa21641.2{206_M732) EIDYDTLSKN ISGWIRiFG 

msa21641.2{206_M78l} EIDYDTLSKN ISGWIRiFG 

msa21641.2{206_CJB110} EIDYDTLSKN ISGWIRvFG 

Consensus ********** *******_** 



GSKISKTNNA 
GSKISKTNNA 
GSKISKTNNA 
GSKISKTNNA 
GSKISKTNNA 
GSKISKTNNA 
GSKISKTNNA 
GSKISKTNNA 
GSKISKTNNA 
GSKISKTNNA 
GSKISKTNNA 



AYTTG I DKS F 
AYTTGIDKSF 
AYTTG IDKSF 
AYTTGIDKSF 
AYTTGIDKSF 
AYTTGIDKSF 
AYTTGIDKSF 
AYTTGIDKSF 
AYTTGIDKSF 
AYTTGIDKSF 
AYTTGIDKSF 



100 

KTHIKEFQKR 
KTHI KEFQKR 
KTHIKEFQKR 
KTHIKEFQKR 
KTHIKEFQKR 
KTHIKEFQKR 
KTHIKEFQKR 
KTHIKEFQKR 
KTHIKEFQKR 
KTHIKEFQKR 
KTHIKEFQKR 



********** ********** ********** 



101 

msa2 1641 . 2 {206_0 90 } NI PVAVYSYA 

msa21641.2{206_1169NT} N I PVAVYSYA 

msa21641.2{206_18RS2l} NI PVAVYSYA 

msa21641.2{206_2603} NI PVAVYSYA 

msa21641.2{206_A909} NI PVAVYSYA 

msa21641.2{206_H36B> NI PVAVYSYA 

msa21641.2{2 06_JM913 0013} NI PVAVYSYA 

, msa2 164 1 . 2 { 206_COH1 } NI PVAVYSYA 

msa21641 .2{206_M732} NI PVAVYSYA 

msa21641.2{206_M78l} NI PVAVYSYA 

msa21641.2{206_CJB110} NIPVAVYSYA 

Consensus ********** 



LGSSVKEMKE 
LGSSVKEMKE 
LGSSVKEMKE 
LGSSVKEMKE 
LGSSVKEMKE 
LGSSVKEMKE 
LGSSVKEMKE 
LGSSVKEMKE 
LGSSVKEMKE 
LGSSVKEMKE 
LGSSVKEMKE 



EAQI FYKNAA 
EAQIFYKNAA 
EAQI FYKNAA 
EAQIFYKNAA 
EAQIFYKNAA 
EAQIFYKNAA 
EAQIFYKNAA 
EAQIFYKNAA 
EAQIFYKNAA 
EAQIFYKNAA 
EAQIFYKNAA 



********** ********** 



PYKPTFYWID 
PYKPTFYWID 
PYKPTFYWID 
PYKPTFYWID 
PYKPTFYWID 
PYKPTFYWID 
PYKPTFYWID 
PYKPTFYWID 
PYKPTFYWID 
PYKPTFYWID 
PYKPTFYWID 
********** 



150 

VEEETMSNMN 
VEEETMSNMN 
VEEETMSNMN 
VEEETMSNMN 
VEEETMSNMN 
VEEETMSNMN 
VEEETMSNMN 
VEEETMSNMN 
VEEETMSNMN 
VEEETMSNMN 
VEEETMSNMN 
********** 



151 200 

msa21641.2{206_090) KGVQAFRKEL KRLGAKNVGI YIGTYFMTEQ GI SVKGFDAV WIPTYGSDSG 

msa21641 . 2 { 206_1169NT} KGVQAFRKEL KRLGAKNVGI YIGTYFMTEQ GI SVKGFDAV WIPTYGSDSG 

msa21641 .2{206_18RS2l} KGVQAFRKEL KRLGAKNVGI YIGTYFMTEQ GI SVKGFDAV WIPTYGSDSG 

msa21641 . 2{206_2603} KGVQAFRKEL KRLGAKNVGI YIGTYFMTEQ GI SVKGFDAV WIPTYGSDSG 

msa21641.2{206_A909) KGVQAFRKEL KRLGAKNVGI YIGTYFMTEQ GI SVKGFDAV WIPTYGSDSG 

msa21641.2{206_H36B} KGVQAFRKEL KRLGAKNVGI YIGTYFMTEQ GI SVKGFDAV WIPTYGSDSG 

msa21641.2{206_JM9130013} KGVQAFRKEL KRLGAKNVGI YIGTYFMTEQ GI SVKGFDAV WIPTYGSDSG 

msa2 164 1 . 2 { 2 0 6_COHl } KGVQAFRKEL KRLGAKNVGI YIGTYFMTEQ G I SVKGFDAV WIPTYGSDSG 

msa21641.2{206_M732) KGVQAFRKEL KRLGAKNVGI YIGTYFMTEQ GI SVKGFDAV WIPTYGSDSG 

msa21641.2{206_M781) KGVQAFRKEL KRLGAKNVGI YIGTYFMTEQ GI SVKGFDAV WIPTYGSDSG 

msa2 164 1 . 2 { 206_CJB110 } KGVQAFRKEL KRLGAKNVGI YIGTYFMTEQ GI SVKGFDAV WIPTYGSDSG 

Consensus ********** ********** ********** ********** ********** 



878 



WO 2004/018646 
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msa21641 .2{206__090} 
ms a2 1 6 4 1 . 2 { 2 0 6__1 1 6 9NT } 
msa21641 . 2 { 206_18RS21 } 
msa21641 . 2 {206^2603} 
msa21641.2{206_A909} 
ms a2 1 6 4 1 . 2 { 2 0 6_H3 6B } 
msa21641.2{206_JM9130013} 
ms a2 164 1 . 2 { 2 0 6_COHl } 
msa21641.2{206_M732} 
msa21641.2{206_M78l} 
msa2164i.2{206_CJB110} 
Consensus 



201 

YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 



KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 



GYLPGfNQPL 
GYLPGf NQPL 
GYLPGfNQPL 
GYLPGxNQPL 
GYLPGfNQPL 
GYLPGfNQPL 
GYLPGfNQPL 
GYLPGfNQPL 
GYLPGfNQPL 
GYLPGfNQPL 
GYLPGfNQPL 



DLNQIaVNKD 
DLNQ I aVNKD 
DLNQIaVNKD 
DLNQIaVNKD 
DLNQIaVNKD 
DLNQIaVNKD 
DLNQIaVNKD 
DLNQIaVNKD 
DLNQIaVNKD 
DLNQIaVNKD 
DLNQ I tVNKD 



******************** *****_**** *****_**** 



250 

KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
********** 



msa21641 . 2 { 206_090 } 
msa21641.2{206_1169NT} 
msa21641.2{206_18RS2l} 
msa21641 . 2 { 206_2603 } 
msa21641.2{206_A909} 
msa21641.2{206_H36B} 
msa21641.2{206_JM9130013} 
msa2 164 1 . 2 { 2 06_COH1 } 
msa21641.2f206_M732} 
msa21641 . 2 {206_M78l} 
msa21641 . 2 {206_CJB110 } 
Consensus 



251 

KVKE 

KVKE 

KVKE 

KVKE 

KVKE 

KVKE 

KVKE 

KVKE 

KVKE 

KVKE 

KVKE 

**** 
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Table 54: Comparative Sequences relating to SAG0949 



SEQ ID NO. 5401 
STRAIN 2603 

TTGACTCACAAAAATATATTATTAACCATTATATTTGGATTATTT 

ATGATTATATTATCAGCATGTGGTATGTCTAATAAGGAAATGGCTGGTATTGATAATTGG 
GAACATTATCAAAAGGAAAAGAAAATTACTATTGGATTTGATAATACTTTTGTTCCTATG 
GGATTTGAAAGTCGTTCTGGTGACTATACCGGCTTTGATATTGATTTAGCTAATGCTGTT 
TTTAAAGAATACGGTATTT CAGTG AAATGG CAG CCTATTAACTGGGATATGAAAG AAACT 
GAACTTAAT AATGGT AAT ATAG AC CTTATT TG GAATGGTTATT CAAAAACGG CAG AACGT 
G CTAAAAAAGT CGCTTTTACAAACC CATATATGAATAAT CAT CAAGTAATTGTTACT AAA 
ACTT CAT CACATATTAATAGTATTAAGGAT ATGAAGG GGAAAAAACTAGGAG CC CAGTCG 
GGTTCATCTGGTTTTGATGCTTTTAACGCTAAACCTGATATTTTA 

GGAAAAG AAG CAGTT CIAAT ACG AT ACTTT CACT CAGG CTTTG ATTGATTTAAAAAATAAC 
CGTATTGATGGTCTTTTGATTGAT G AAGTTTATG CTAACTATTAT TT AAAG CAAG AAGGA 
AATATAAAAGCTTATTATTTTGTTAAAACTGCTTATCAAGGAGAAAATTTTGTAGTAGGA 
G CT CGT AAAGTTGAT CGTAG ACTAATTGAAAAGAT TAACAAAG CTTT CAAACAG CTT CAT 
AAT AAGGGG AGATTT CAAAAAAT CT CTTACAAATGGTTTGGTGAAGATGTTT AT AGT AAA 
GAA 

SEQ ID NO. 5402 

STRAIN 090 
ATTGGGaACATTATC 

AAAAGGAAAAGAAAATTACTAT TGG ATTTGATAATACTTTTGTT CCT ATG 
GGATTTGAAAGC CGT T CTGGTG ACTAt AC CGG CTTTGATATTGATTTAG C 
TAATG CTGTTTTTAAAGAATACGGTATTT CAGTGAAATG G CAG CCTATT A 
ACTGGGATATGAAAGAAACTGAACTTAATAATGGTAATATAGACCTTATT 
TGGAATGGTTATT CAAAAACGGCAGAACGT G CTAAAAAAGTCG CTTTTAC 
AAACC CATATATGAATAAT CAT CAAGTAAT TGTTACTAAAACTT CAT CAC 
AT ATT AATAGTATTAAGGATATGAAGGGGAAAAAACTAGGAG CC CAGT CG 
GGTT CATCTGGTTTTGATG CTTTTAATGCTAAACCTGATATTTTAAAAAA 
GTTTGTAAAAGGAAAAGAAG CAGTT CAATACGATACTTT CACTCAGG CTT 
TGATTGATTTAAAAAATAACCGTATTGATGGTCTTTTGATTGATGAAGTT 
TATGCTAACTATTATTTAAAGCAAGAAGGAAATATAAAAGCT 
TGTTAAAACTG CTTATCAAGGAGAAAATTTTGTAGTAGGAG CTCG CAAAG 
TTGAT CGTAGACTAATTGAAAAGATTAACAAAG CTTT CAAACAG CTT CAT 
AATAAGG GAAAATTT CAAAAAAT CT CTTACAAATGGTTTGGTG AAGATGT 
TTATAGTAAAGAA 

SEQ ID NO. 5403 

STRAIN A909 
ATTGGG 

aACATTATCAAAAGGAAAAGAAAATTACTATTG^ 

GTTC CTATGGGATTT GAAAGTCGTT OTGGTGACTATACCGGCTTTGATAT 
TGATTTAGCTAATGCTGTTTTTAAAGAATACGGTATTTC^ 
AGCCTATTAACTGGGATAtgAAAGAAACTGAACTTAATAATGGTAATATA 
GACCTTATTTGGAATGGTTATT CAAAAACGG CAGAACGTG CTAAAAAAGT 
CGCTTTTACAAACC CATATATGAATAAT CATCAAGTAATTGTTACTAAAA 
CTTCAT CACATATT AATAGTATTAAGGATATGAA.GGGGAAAAAACTAGGA 
GCCCAGTCGGGTTCATCTGGTTTTGATGCTTTTA^ 

TTTAAAAAAGTTTGTAAAAGGAAAAGAAG CAG t T CAATA CGATACTTT CA 
CT CAGG CITTGATTGATTTAAAAAATAAC CGTATTGATGG 
GATGAAGTTTATG CTAACTATTATTTAAAG CAAGAAGGAAATAT AAAAG C 
TTATT ATTTTGTTAAAACTG CTTAT CAAGGAGAAAATTTTGTAGTAGGAG 
CT CGTAAAGTTGAT CGTAG ACT AATTGAAAAGATTAACAAAG CTTT CAAA 
CAGCTTCATAATAAGGGGAGATTTCAAAAAATCT 
TGAAGATGTTTATAGTAAAGaA 

SEQ ID NO. 5404 

STRAIN H36B 

ATTGGGAACATTAT CAAAAGGAAAAGAAAATT ACT ATTGGATT 
TGATAATACTTTTGTTC CTATGGG ATTTG AAAGT CGTT CTGGTGACTATA 
C CGG CITTGATATTGATTTAG CTAATGCTGTTTTTAAAGAATACGGTATT 
TCAGTGAAATGG CAG CCTATTAACTGGGATATGAAAGAAACT GAACTTAA. 
TAATGGT AATATAGACCTTATTTGGAATGGTTATT CAAAAACGG CAG AAC 
GTGCTAAAAAAGTCG CTTTTACAAAC CCATAT ATG AATAAT CAT CAAGTA 
ATTGTTACTAAAACTTCAT CACAT ATTAATAGTATTAAG G AT ATG AAGG G 
GAAAAAACTAGGAGCCCAGTCGGGTTC^TCTGGTTTTGATGCrTTTAACG 
CTAAACCTGATATTTTAAAAAAGTTTGTAAAAGG AAAAG AAG CAG tTCAA 
TACGATACITTCACTCAGGCTTTGATTGATTTAAAAAATAACCGTAT^ 
TGGTCTTTTGATTGATGAAGTtTATGCTAACTATTATTTAAAGCAAGAAG 
GAAATATAAAAGCTTATTATTTTGTTAAAACTGCTTAT CAAGGAgAAA^T 
TTTGTAGTAGGAG CT CGTAAAGTTGATCGTAGACTAATTG AAAAG ATTAA 
CAAAGCTTTCAAACAGCTTCATAATAAGGGGAGATTTCAA^ 
ACAAATGGTTTGGTGAAGATGTTTATAGTAAAGAA 

SEQ ID NO. 5405 

STRAIN 18RS21 
ATTGGGAACATTA 

TCAAAAGGAAAAGAAAATTACTATTG-GATTTGATAATACITT'TGTTCCTA 
TGGGATTTGAAAGTCGTTCTGGTGACTAtACCGGCTTTGATATTGATTTA 
GCTAATG CTGTTTTTAAAGAATACGGTATTT CAGTGAAATGG CAGCCTAT 
TAACTGGGATATGAAAG AAACTGAACTTAATAATGGTAATAT AGAC CTT A 
TTTGGAATGGTTATT CAAAAACGG CAGAACGTGCTAAAAAAGTCGCTTTT 
ACAAACCCATATATGAATAATCATCAAGTAATTGTTACTAAAACTTCATC 
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ACAT ATT AAT AGTATTAAGG AT ATG AAGG GGAAAAAACT AGG AG C CCAGT 
CGGGTTCATCTGGTTTTGATGCTTTTAACGCTAAACCTGATATTTTAAAA 
AAGTTTGTAAAAGG AAAAG AAG CAGTT CAATACG ATACTTT CACTCAGG C 
TTTGATTGATTTAAAAAATAACCGTATTGATGGTCTTTTGATTGATGAAG 
TTTATGCTAACTATTATTTAAAGCAAGAAGGAAATATAAAAGCTTATTAT 
TTTGTTAAAACTG CTTATCAAGGAGAAAATTTTGTAGTAGGAGCTCGTAA 
AGTTGAT CGT AG ACT AATT G AAAAGATTAACAAAG CTTT CAAACAGCTT C 
AT AAT AAGG GGAGATTT CAAAAAAT CT CTTACAAATGGTTTG GTGAAGAT 
GTTTATAGTAAAGAA 

SEQ ID NO. 5406 

STRAIN M732 

ATTGGGAACATTATCAAAAGGAAAAGAAAATTACTATTGGATTTGATAA 
TACTTTTGTTCCTATGGGATTTGAAAGTCGTTCTGGTGACTATACCGGCT 
TTGATATTG ATTTAG CT AA.TG CTGTTTTTAAA.GAATACG GTATTT CAGTG 
AAATGGCAG C CTATTAACTG GGATATG AAAGAAACTGAACTTAATAATG G 
TAATATAGAC CTTATTTGGAAT GGTTATT CAAAAACGG CAGAACGTG CTA 
AAAAAGT CG CTTTTACAAA.C CCAT ATATGAATAATCAT CAAGTAATTGTT 
ACTAAAACTTCATCACATATTAATAGTATTAAGGATATGAAGGGGAAAAA 
ACTAGGAGCCC^GTCGGGTTCATCTGGTTTTGATGCrTTTAACGCTAAAC 
CTGATATTTTAAAAAAGTTTGTAAAAGGAAAAGAAGCAGTTCAATACGAT 
ACTTT CACT CAGGCTTTGATTGATTTAAAAAATAAC CGTATTGATGGT CT 
TTTG ATTGATGAAGTTT ATG CTAACTATTATTTAAAGCAAGAA 
TAAAAGCTT ATTATTTTGTTAAAACTG CTTATCAAGG AGAAAATTTTGTA 
GTAGGAG CT CGTAAAGTTGATCGTAGACTAACTG AAAAG ATTAACAAAG C 
TTT CAAACAG CTT CATAATAAGGGGAG ATTTCAAAAAAT CTCTTACAAAT 
GGTTTGGTGAAGATGTTTATAGTAAAGAA 

SEQ ID NO. 5407 

STRAIN COH1 

ATTGGGAACATTAT CAAAAGGAAAAGAAAATTACTATTGGATTTGATAA 
TACTTTTGTTCCTATGGGATCTGAAAGTCGTTCTGGTGACTATACCGGCT 
TTGATACTGATTTAGCTAATGCTGTIT'1T^\AAGAATACX3GTATTT CAGTG 
AAATGG CAG CCTACTAACTGGGAT ATGAAAGAAACTGAACTTAAT AATGG 
TAATATAGACCTTATTTGGAATGGTTATTCAAAAACGGCAGAACGTGCTA 
AAAAACTCG CTTTTACAAAC CCAT ATATGAATAAT CATCAAGTAATTGTT 
ACTAAAACTT CAT CACATATTAATAGTATTAAGGATATGAAGGGG AAAAA 
ACTAGGAGCCCAGTCGGGTTCATCTGGTTTTGATGCTTTTAACGCTAAAC 
CTGATATTTTAAAAA^GTTTGTAAAAGGAAAAGAAGCAGTTCAATAC 
ACTT T CACT CAGGCTCTGATTG ATTTAAAAAATAAC CGTATTGATGGTCT 
TTTGACTGATGAAGTTTATGCTAACTACTACTTAAAGCAAGAAGGAAATA 
TAAAAGCTT ATTATTTTGTTAAAACTG CTTAT CAAGGAGAAAATTTTGTA 
GTAGGAG CTCGTAAAGTTGAT CGTAGACTAATTGAAAAGATTAACAAAG C 
TTTCAAACAGCTT CATAATAAGGGGAG ATTTCAAAAAA 
GGTTTGGTGAAGATGTTTATAGTAAAGAA 

SEQ ID NO. 5408 

STRAIN M781 

ATTGGGAACATTATCAAAAGGAAAAGAAAATTA 

ATACTTTTGTTC CTATGGGATTTGAAAGT CGTTCTGGTGACTATAC CGG C 
TTTGATATTGATTTAGCTAATG CTGTTTTTAAAGAATACGGTATTT CAGT 
GAAATGGCAGCCTATTAACTGGGATATGAAAGAAACTGAACTTAATAATG 
GTAATATAGACCTTATTTGGAATGGTTATT CAAAAACGG CAGAACGTG CT 
AAAAAAGT CG CTTTTACAAA.CCCATATATGAATAATCAT C AAGTAATTGT 
TACTAAAACTT CATCACATATTAATAGTATTAAGG ATATGAAGGGGAAAA 
AACTAGGAG CCCAGT CGGGTTCAT CTG GTTTTGATGCTTTTAACGCTAAA 
C CTGATATTTTAAAAAAGTTTGTAAAAGGAAAAGAAG CAGTT CAATACG A 
TACTTT CACT CAGG CTTTGATTGATCTAAAAAATAAC CGTATTG ATGGT C 
TTTTGATTGATGAAGTTTAT G CTAACTATTATTTAAAG CAAGAAGGAAAT 
ATAAAAGCCTATTATTTTGTTA Z VAACTGCTTATCAAGGAGAAAAT 
AGTAGGAGCT CGTAAAGTTGAT CGTAGACTAATTGAAAAGATTAACAAAG 
CTTT CAAACAG CTT CATAATAAG^GGAGATTT CAAAAAAT CT CTT ACAAA 
TGGTTTGGTGAAGATGTTTATAGTAAAGaA 

SEQ ID NO. 5409 

STRAIN CJB110 

ATTGGGAACATTAT CAAAAGGAAAAGAAAATTACTATTGGATTTG ATAAT 
ACTTTTGTTC CTATGGGATTTGAAAGTCGCTCTGGTGACT CTT 
TGATATTGATTTAG CTAATG CTGTTTTTAAAGAATACGGT ATTTCAGTGA 
AATGGCAGCCTATTAACTGGGATATGAAAGAAACTGAACTTAATAATGGT 
AATATAGAC CTT ATTTGGAATGGCT ATTCAAAAACGG CAGAACGTGCTAA 
AAAAGT CGCTTTTACAAAC C CATATATGAATAATCAT CAAGTAATTGTT A 
CTAAAACCTCATCACATACTAATAGTACTAAGGATATGAAGGGGAAAAAA 
CTAGGAGCCCAGTCGGGTTCATCTGGTTTTGA 

TGAT ATTTTAAAAAAGTTTGTAAAAGGAAAAGAAG CAGTT CAAT ACGATA 
CTTT CACTCAGG CTCTGACTGATTTAAAAAATAAC CGTATTGATGGTCTT 
TTGATTGATGAAGTTTATGCTAACTATTATTT^ 

AAAAGCTT ATTATTTTGTTAAAACTG CTTATCAAGGAGAAAATTTTGTAG 
TAGG AGCTCGTAAAGTT GAT CGTAG ACTAACTGAAAAGATTAACAAAG CT 
TT CAAACAG CTT CATAATAAGGGGAGATTT CAAAAAATCTCTTACAAATG 
GTTT GGTGAAGATGTTTAT AGT AAAGAA 

SEQ ID NO. 5410 
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STRAIN 1169NT 

ATTG GGAACATTAT C AAAAG GAAAAGAAAATT AC TATTGG ATTTGAT AA 
TACTTTTGTTCCTATGGGATTTGAAAGTCGTTCTGGTGACTATACCGGCT 
TTGAT ATTGATTT AG CTAATGCTGTTTTTAAAGAATACGGT ATTT CAGTG 
AAATGG CAG C CT ATTAACTG GGAT ATG AAAGAAACTGAACTCAAT AATGG 
TAATATAGACCTTATTTGGAATGGTTATTCAAAAACGGCAGAACGTGCTA 
AAAAAGTCGCTTTTACAAACCCATATATGAATAATCATCAAGTAATTGTT 
ACTAAAACTTCATCACATATTAATAGTATTAAGGATATGAAGGGGAAAAA 
ACTAGGAGCCCAGTCGGGTTCATCTGGTTTTGATGCTTTTAATGCTAAAC 
CTGACATTTTAAAAAAGTTTGTAAAAGGAAAAGAAGCAGTTCAATACGAT 
ACITTCACTCAGGCTTTGATTGATTTAAAAAATAACCGTATTGATGGTCT 
TTTG ATTGATGAAGTTT ATG CTAACTATTATTTAAAG CAAGAAGGAAATA 
TAAAAG CTTATTATTTTGTT AAAACTG CTT ATCAAGGAGAAAATTTTGTA 
. GTAGG AG CT CGCAAAGTTGATCGTAGACTAATTG AAAAG ACT AACAAAG C 
TTTCAAACAGCTTCATAATAAGGGGAAATTTCAAAAAAT CTCTTACAAAT 
GGTTTGGTGAAGATGTTTATAGTAAAGAA 

SEQ XD NO. 5411 

STRAIN JM913 0013 

ATTGGGAACATTAT C 

AAAAGGAAAAGAAAATTACTATTGGATTTGATAATACTTTTGTTCCTATG 

GGATTTGAAAGTCGCTCTGGTGACTAtACCGGCTTTGATATTGATTTAGC 

TAATG CTGCTTTTAAAGAATACGGT ATTT CAGTGAAATGG CAGC CTATTA 

ACTGGGATATGAAAGAAACTGAACTTAATAATGGTAATATAGACCTTATT 

TGGAATGGTTATTCAAAAACGGCAGAACGTGCTAAAAAAGTC 

AAAC C CATATATGAATAAT CAT CAAGTAATTGTTACTAAAACTTCATCAC 

ATATTAATAGTATTAAGGATATGAAGGGGAAAAAACTAGGAGCCCAGTCG 

GGTT CAT CTGGTTTTGATGCTTTTAACG CTAAAC CTGATATriT AAAAAA 

GTTTGTAAAAGGAAAAGAAG CAGTTCAATACGATACTTTCACTCAGG CTT 

TGATTGATCTAAAAAATAACCGTATTGATGGTCTTTTGATTC 

TATG CTAACTATTATTTAAAG CAAGAAGGAAATAT AAAAG CTTATTATTT 

TGTTAAAACTGCTTATCAAGGAGAAAATTTTGTAGTAGGAG CTCGTAAAG 

TTGAT CGTAGACTAATTGAAAAGATTAACAAAGCTTT CAAACAG CTT CAT 

AATAAGGGGAGATTTCAAAAAATCTCTTACAAATGGTTTGGTGAAGATGT 

TTATAGTAAAGAA 

PRETTY of : /biotmp/msa39314 . 2 { * } February 18, 2003 11:01 .. 

1 50 

msa39314.2{225_18RS2l} 

msa3 9314.2{225_2603} ttgactcaca aaaatatatt attaaccatt atatttggat tatttatgat 

msa39314.2{225_A909) 

msa39314.2{225_CJB110} 

msa3 9314.2 { 22 5_COHl } 

msa3 9314.2{225_H36B} 

msa39314 . 2 {225_KM9130013 } 

msa3 9314.2{225_M732} 

msa39314 . 2 {225__M78l} 

msa39314.2{225_090} 

msa39314.2{225 1169NT} — 

Consensus ********** ********** ********** ********** ********** 

51 100 

msa39314.2{225_18RS2l) 

msa39314.2{225_2603} tatattatca gcatgtggta tgtctaataa ggaaatggct ggtattgata 

msa39314.2{225_A909} ' 

msa3 93 14 . 2 { 22 5__CJB110 } 

msa3 9314.2(22 5_C0H1 ) 

msa39314.2{225_H36B) 

rasa39314 . 2 {225JKM9130013 } 

msa39314 . 2 { 225_M732 ) 

msa39314.2{225_M78l} 

msa39314.2{225_090} 

msa3 9314 . 2 { 225_1169NT} 

Consensus ********** ********** ********** ********** ********** 

101 ISO 

msa39314.2{225_18RS2l} ATTGGGAACA TTATCAAAAG GAAAAGAAAA TTACTATTGG ATTTGATAAT 

msa39314 .2{225_2603 } ATTGGGAACA TTATCAAAAG GAAAAGAAAA TTACTATTGG ATTTGATAAT 

msa39314.2{225_A909) ATTGGGAACA TTATCAAAAG GAAAAGAAAA TTACTATTGG ATTTGATAAT 

msa39314.2{225_CJB110} ATTGGGAACA TTATCAAAAG GAAAAGAAAA TTACTATTGG ATTTGATAAT 

msa39314.2{225_COHl} ATTGGGAACA TTATCAAAAG GAAAAGAAAA TTACTATTGG ATTTGATAAT 

msa39314.2{225_H36B} ATTGGGAACA TTATCAAAAG GAAAAGAAAA TTACTATTGG ATTTGATAAT 

msa39314.2{225_KM9130013) ATTGGGAACA TTATCAAAAG GAAAAGAAAA TTACTATTGG ATTTGATAAT 

msa39314.2{225_M732} ATTGGGAACA TTATCAAAAG GAAAAGAAAA TTACTATTGG ATTT GATAAT 

msa39314.2{225_M78l} ATTGGGAACA TTATCAAAAG GAAAAGAAAA TTACTATTGG ATTTGATAAT 

msa3 93 14 .2 {225^090} ATTGGGAACA TTATCAAAAG GAAAAGAAAA TTACTATTGG ATTTGATAAT 

msa39314.2{225JL169NT) ATTGGGAACA TTATCAAAAG GAAAAGAAAA TTACTATTGG ATTTGATAAT 

Consensus ********** ********** ********** ********** ********** 



msa39314.2{225_18RS2l} 
msa39314 . 2 {225_2603 } 



151 200 
ACTTTTGTTC CTATGGGATT TGAAAG t CGT TCTGGTGACT ATACCGGCTT 
ACTTTTGTTC CTATGGGATT TGAAAG t CGT TCTGGTGACT ATACCGGCTT 
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msa3 9314 . 2{225_A909} ACTTTTGTTC CTATGGGATT 

msa39314.2{225_CJB110} ACTTTTGTTC CTATGGGATT 

msa3 9314.2{225_COHl} ACTTTTGTTC CTATGGGATT 

msa39314.2{225_H3 6B} ACTTTTGTTC CTATGGGATT 

msa39314.2{225_KM9130013} ACTTTTGTTC CTATGGGATT 

msa3 93X4.2{225_M732} ACTTTTGTTC CTATGGGATT 

msa3 9314.2{225__M78l} ACTTTTGTTC CTATGGGATT 

msa3 93 14. 2 {225^090} ACTTTTGTTC CTATGGGATT 

msa39314.2{225_1169NT} ACTTTTGTTC CTATGGGATT 

Consensus ********** ********** 



TGAAAG tCGT 
TGAAAG t CGT 
TGAAAG t CGT 
TGAAAG t CGT 
TGAAAG t CGT 
TGAAAG t CGT 
TGAAAG t CGT 
TGAAAGc CGT 
TGAAAG t CGT 



TCTGGTGACT 
TCTGGTGACT 
TCTGGTGACT 
TCTGGTGACT 
TCTGGTGACT 
TCTGGTGACT 
TCTGGTGACT 
TCTGGTGACT 
TCTGGTGACT 



******_*** ********** 



ATACCGGCTT 
ATACCGGCTT 
ATACCGGCTT 
ATACCGGCTT 
ATACCGGCTT 
ATACCGGCTT 
ATACCGGCTT 
ATACCGGCTT 
ATACCGGCTT 
********** 



msa39314.2{ 
msa3 9314. 
msa39314 . 

msa39314.2{ 
msa3 9314 
msa3 9314 
msa39314.2{225 
msa39314 . 
msa39314 
msa39314 

msa39314.2{ 



225_18RS2l} 
2{225_2603} 
2{225_A909} 
225_CJB110} 
2{225_C0H1) 
2{225_H36B} 
_KM9130013} 
2(225_M732} 
2{225_M78l} 
2{225_090} 
225_1169NT} 
Consensus 



201 

TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
********** 



TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
********** 



CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
********** 



AGAATACGGT 
AGAATACGGT 
AGAATACGGT 
AGAATACGGT 
AGAATACGGT 
AGAATACGGT 
AGAATACGGT 
AGAATACGGT 
AGAATACGGT 
AGAATACGGT 
AGAATACGGT 
********** 



250 

ATTTCAGTGA 
ATTTCAGTGA 
ATTTCAGTGA 
ATTTCAGTGA 
ATTTCAGTGA 
ATTTCAGTGA 
ATTTCAGTGA 
ATTTCAGTGA 
ATTTCAGTGA 
ATTTCAGTGA 
ATTTCAGTGA 
********** 



msa393 14 . 2 { 225_18RS21 } 
msa3 9314 . 2 { 225_2603 } 
msa39314 . 2 {225_A909 } 
msa39314.2{225_CJB110} 
msa3 9314.2(22 5_COHl } 
ms a3 9 3 1 4 . 2 { 2 2 5_H3 6B } 
msa39314 . 2{225_KM9130013 } 
msa39314 . 2 {225_M732 } 
msa39314 . 2 {225_M781 } 
msa39314 . 2 { 225_090 } 
msa39314 . 2 { 225_1169NT} 
Consensus 



251 

AATGGCAGCC 
AATGGCAGCC 
AATGGCAGCC 
AATGGCAGCC 
AATGGCAGCC 
AATGGCAGCC 
AATGGCAGCC 
AATGGCAGCC 
AATGGCAGCC 
AATGGCAGCC 
AATGGCAGCC 



TATTAACTGG 
TATTAACTGG 
TATTAACTGG 
TATTAACTGG 
TATTAACTGG 
TATTAACTGG 
TATTAACTGG 
TATTAACTGG 
TATTAACTGG 
TATTAACTGG 
TATTAACTGG 



********** ********** 



GATATGAAAG 
GATATGAAAG 
GATATGAAAG 
GATATGAAAG 
GATATGAAAG 
GATATGAAAG 
GATATGAAAG 
GATATGAAAG 
GATATGAAAG 
GATATGAAAG 
GATATGAAAG 
********** 



AAACTGAACT 
AAACTGAACT 
AAACTGAACT 
AAACTGAACT 
AAACTGAACT 
AAACTGAACT 
AAACTGAACT 
AAACTGAACT 
AAACTGAACT 
AAACTGAACT 
AAACTGAACT 
********** 



300 

tAATAATGGT 
tAATAATGGT 
tAATAATGGT 
tAATAATGGT 
tAATAATGGT 
tAATAATGGT 
tAATAATGGT 
tAATAATGGT 
tAATAATGGT 
tAATAATGGT 
cAATAATGGT 
_********* 



301 350 

msa3 9314.2{225_18RS2l} AATATAGACC TTATTTGGAA TGGTTATTCA AAAACGGCAG AACGTGCTAA 

msa39314.2{225_2603} AATATAGACC TTATTTGGAA TGGTTATTCA AAAACGGCAG AACGTGCTAA 

msa39314.2{225_A909) AATATAGACC TTATTTGGAA TGGTTATTCA AAAACGGCAG AACGTGCTAA 

msa39314.2{225_CJB110} AATATAGACC TTATTTGGAA TGGTTATTCA AAAACGGCAG AACGTGCTAA 

rasa39314.2{225_COHl} AATATAGACC TTATTTGGAA TGGTTATTCA AAAACGGCAG AACGTGCTAA 

msa39314.2{225_H36B} AATATAGACC TTATTTGGAA TGGTTATTCA AAAACGGCAG AACGTGCTAA 

msa39314.2{225_KM9130013} AATATAGACC TTATTTGGAA TGGTTATTCA AAAACGGCAG AACGTGCTAA 

msa3 9314.2{225_M732} AATATAGACC TTATTTGGAA TGGTTATTCA AAAACGGCAG AACGTGCTAA 

msa3 9314.2{225_M78l} AATATAGACC TTATTTGGAA TGGTTATTCA AAAACGGCAG AACGTGCTAA 

msa39314. 2 {225^090} AATATAGACC TTATTTGGAA TGGTTATTCA AAAACGGCAG AACGTGCTAA 

msa39314.2{225_1169NT} AATATAGACC TTATTTGGAA ' TGGTTATTCA AAAACGGCAG AACGTGCTAA 

Consensus ********** ********** ********** ********** ********** 



351 

msa3 9314 .2{225_18RS2l) AAAAGTCGCT TTTACAAACC CATATATGAA 

msa39314.2{225_2603) AAAAGTCGCT TTTACAAACC CATATATGAA 

msa39314.2{225_A909} AAAAGTCGCT TTTACAAACC CATATATGAA 

msa39314.2{225_CJB110} AAAAGTCGCT TTTACAAACC CATATATGAA 

msa39314.2{225_COHl} AAAAGTCGCT TTTACAAACC CATATATGAA 

msa39314.2{225_H36B} AAAAGTCGCT TTTACAAACC CATATATGAA 

msa39314.2{225_KM9130013} AAAAGTCGCT TTTACAAACC CATATATGAA 

msa3 93 14. 2 {22 5^732} AAAAGTCGCT TTTACAAACC CATATATGAA 

msa39314.2{225_M78l} AAAAGTCGCT TTTACAAACC CATATATGAA 

msa39314.2{225_090) AAAAGTCGCT TTTACAAACC CATATATGAA 

msa39314.2{225__1169NT} AAAAGTCGCT TTTACAAACC CATATATGAA 

Consensus ********** ********** ********** 



TAATCATCAA 
TAATCATCAA 
TAATCATCAA 
TAATCATCAA 
TAATCATCAA 
TAATCATCAA 
TAATCATCAA 
TAATCATCAA 
TAATCATCAA 
TAATCATCAA 
TAATCATCAA 



400 

GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 



********** ********** 



msa3 93 14 . 2 { 225_18RS2 1 } 
msa3 9314 . 2 ( 225_2603 } 
msa39314 . 2 { 225_A909 } 
msa39314.2{225_CJB110} 
msa39314 .2 {225_COHl} 
tnsa39314.2{225_H36B^ 
msa39314-2{225_KM9130013 
msa3 9314 . 2 { 225_M732 ) 
msa39314 . 2 {225_M78l} 
msa39314.2{225_090j 
msa39314.2{225_1169NT} 
Consensus 



401 

CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
********** 



ATCACATATT 
ATCACATATT 
ATCACATATT 
ATCACATATT 
ATCACATATT 
ATCACATATT 
ATCACATATT 
ATCACATATT 
ATCACATATT 
ATCACATATT 
ATCACATATT 
********** 



AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
********** 



AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
********** 



450 

GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
********** 



msa39314.2{225_18RS2l} 



451 • 500 

CTAGGAGCCC AGTCGGGTTC ATCTGGTTTT GATGCTTTTA AcGCTAAACC 



883 



WO 2004/018646 



PCT/US2003/026827 



Table 54: Comparative Sequences relating to SAG0949 



msa39314. 

msa39314 . 
msa39314.2{ 

msa39314 . 

msa39314 . 
msa39314.2{22S 

msa39314 

msa3 9314 
msa39314 
msa39314.2{ 



2{225_2603} 
2{225_A909} 
225_CJB110} 
2{225_COHlj 
2{225_H36B} 
_KM9130013} 
2{225_M732} 
2{225_M781} 
2{225_090} 
225_1169NT} 
Consensus 



CTAGGAGCCC 
CTAGGAGCCC 
CTAGGAGCCC 
CTAGGAGCCC 
CTAGGAGCCC 
CTAGGAGCCC 
CTAGGAGCCC 
CTAGGAGCCC 
CTAGGAGCCC 
CTAGGAGCCC 
********** 



AGTCGGGTTC 
AGTCGGGTTC 
AGTCGGGTTC 
AGTCGGGTTC 
AGTCGGGTTC 
AGTCGGGTTC 
AGTCGGGTTC 
AGTCGGGTTC 
AGTCGGGTTC 
AGTCGGGTTC 
********** 



ATCTGGTTTT 
ATCTGGTTTT 
ATCTGGTTTT 
ATCTGGTTTT 
ATCTGGTTTT 
ATCTGGTTTT 
ATCTGGTTTT 
ATCTGGTTTT 
ATCTGGTTTT 
ATCTGGTTTT 



GATGCTTTTA 
GATGCTTTTA 
GATGCTTTTA 
GATGCTTTTA 
GATGCTTTTA 
GATGCTTTTA 
GATGCTTTTA 
GATGCTTTTA 
GATGCTTTTA 
GATGCTTTTA 



********** ********** 



AcGCTAAACC 
AcGCTAAACC 
AcGCTAAACC 
AcGCTAAACC 
AcGCTAAACC 
AcGCTAAACC 
AcGCTAAACC 
AcGCTAAACC 
AtGCTAAACC 
AtGCTAAACC 
*_******** 



msa39314.2{ 

msa39314 . 

msa39314 . 
msa39314.2{ 

msa3 9314 . 

msa39314 . 
msa39314.2{225 

msa39314 . 

msa3 9314. 
msa39314 
msa39314.2{ 



225_18RS2l} 
2 {225^2603} 
2{225_A909} 
225_CJB110} 
2{225_COHl} 
2{225_H36B} 
_KM9130013} 
2{225_M732} 
2{225_M781} 
.2{225_090} 
225_1169NT} 
Consensus 



501 

TGAtATTTTA 
TGAtATTTTA 
TGAtATTTTA 
TGAtATTTTA 
TGAtATTTTA 
TGAtATTTTA 
TGAtATTTTA 
TGAtATTTTA 
TGAtATTTTA 
TGAtATTTTA 
TGAcATTTTA 
***_****** 



AAAAAGTTTG 
AAAAAGTTTG 
AAAAAGTTTG 
AAAAAGTTTG 
AAAAAGTTTG 
AAAAAGTTTG 
AAAAAGTTTG 
AAAAAGTTTG 
AAAAAGTTTG 
AAAAAGTTTG 
AAAAAGTTTG 
********** 



TAAAAGGAAA 
TAAAAGGAAA 
TAAAAGGAAA 
TAAAAGGAAA 
TAAAAGGAAA 
TAAAAGGAAA 
TAAAAGGAAA 
TAAAAGGAAA 
TAAAAGGAAA 
TAAAAGGAAA 
TAAAAGGAAA 
********** 



AGAAGCAGTT 
AGAAGCAGTT 
AGAAGCAGTT 
AGAAGCAGTT 
AGAAGCAGTT 
AGAAGCAGTT 
AGAAGCAGTT 
AGAAGCAGTT 
AGAAGCAGTT 
AGAAGCAGTT 
AGAAGCAGTT 
********** 



550 

CAATACGATA 
CAATACGATA 
CAATACGATA 
CAATACGATA 
CAATACGATA 
CAATACGATA 
CAATACGATA 
CAATACGATA 
CAATACGATA 
CAATACGATA 
CAATACGATA 
********** 



551 600 

maa3 9314 .2 {225_18RS2l} CTTTCACTCA GGCTTTGATT GATTTAAAAA ATAAC CGTAT TGATGGTCTT 

msa39314.2f 225_2603) CTTTCACTCA GGCTTTGATT GATTTAAAAA ATAACCGTAT TGATGGTCTT 

msa39314.2{225_A909} CTTTCACTCA GGCTTTGATT GATTTAAAAA ATAACCGTAT TGATGGTCTT 

msa39314.2{225_CJB110} CTTTCACTCA GGCTTTGATT GATTTAAAAA ATAACCGTAT TGATGGTCTT 

msa3 9314.2f 225_C0H1) CTTTCACTCA GGCTTTGATT GATTTAAAAA ATAACCGTAT TGATGGTCTT 

msa39314.2{225_H36B} CTTTCACTCA GGCTTTGATT GATTTAAAAA ATAACCGTAT TGATGGTCTT 

msa39314.2{225_KM9130013} CTTTCACTCA GGCTTTGATT GATTTAAAAA ATAACCGTAT TGATGGTCTT 

msa3 9314.2{225_M732} CTTTCACTCA GGCTTTGATT GATTTAAAAA ATAACCGTAT TGATGGTCTT 

msa3 9314 .2 {225_M781} CTTTCACTCA GG CTTT GATT GATTTAAAAA ATAACCGTAT TGATGGTCTT 

msa39314 .2{225_090} CTTTCACTCA GGCTTTGATT GATTTAAAAA ATAACCGTAT TGATGGTCTT 

msa39314 .2{225_1169NT} CTTTCACTCA GGCTTTGATT GATTTAAAAA ATAACCGTAT TGATGGTCTT 

Consensus ********** ********** ********** ********** ********** 



601 

msa39314.2{225_18RS2l} TTGATTGATG 

msa3 9314. 2 {225^2603} TTGATTGATG 

msa39314 ,2{225_A909} TTGATTGATG 

msa3 9314-2{225_CJB110} TTGATTGATG 

rasa39314.2{22 5_COHl } TTGATTGATG 

msa3 9314 .2 { 225_H3 6B } TTGATTGATG 

msa39314.2{225_KM9130013} TTGATTGATG 

msa39314.2{225_M732} TTGATTGATG 

msa39314.2{225_M78l} TTGATTGATG 

msa39314.2{225_090) TTGATTGATG 

msa3 9314. 2(22 5_1 1 6 9NT } TTGATTGATG 

Consensus ********** 



AAGTTTATGC 
AAGTTTATGC 
AAGTTTATGC 
AAGTTTATGC 
AAGTTTATGC 
AAGTTTATGC 
AAGTTTATGC 
AAGTTTATGC 
AAGTTTATGC 
AAGTTTATGC 
AAGTTTATGC 



TAACTATTAT 
TAACTATTAT 
TAACTATTAT 
TAACTATTAT 
TAACTATTAT 
TAACTATTAT 
TAACTATTAT 
TAACTATTAT 
TAACTATTAT 
TAACTATTAT 
TAACTATTAT 



TTAAAGCAAG 
TTAAAGCAAG 
TTAAAGCAAG 
TTAAAGCAAG 
TTAAAGCAAG 
TTAAAGCAAG 
TTAAAGCAAG 
TTAAAGCAAG 
TTAAAGCAAG 
TTAAAGCAAG 
TTAAAGCAAG 



650 

AAGGAAATAT 
AAGGAAATAT 
AAGGAAATAT 
AAGGAAATAT 
AAGGAAATAT 
AAGGAAATAT 
AAGGAAATAT 
AAGGAAATAT 
AAGGAAATAT 
AAGGAAATAT 
AAGGAAATAT 



********** ********** ********** ********** 



msa39314.2{ 
msa39314 . 
tnsa39314 . 

msa3 9314.2{ 
msa39314 . 
msa39314 . 
msa39314.2{225 
msa39314 
msa39314 
msa39314 

msa3 9314.2{ 



225_18RS2l} 
2{225__2603} 
2{225_A909} 
225_CJB110} 
2{225_C0Hl} 
2{225_H36B} 
_KM9130013} 
2{225_M732} 
2{225_M781} 
.2 {225^090} 
225_1169NT} 
Consensus 



651 

AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
********** 



TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
********** 



AAACTGCTTA 
AAACTGCTTA 
AAACTGCTTA 
AAACTGCTTA 
AAACTGCTTA 
AAACTGCTTA 
AAACTGCTTA 
AAACTGCTTA 
AAACTGCTTA 
AAACTGCTTA 
AAACTGCTTA 
********** 



TCAAGGAGAA 
TCAAGGAGAA 
TCAAGGAGAA 
TCAAGGAGAA 
TCAAGGAGAA 
TCAAGGAGAA 
TCAAGGAGAA 
TCAAGGAGAA 
TCAAGGAGAA 
TCAAGGAGAA 
TCAAGGAGAA 
********** 



700 

AATTTTGTAG 
AATTTTGTAG 
AATTTTGTAG 
AATTTTGTAG 
AATTTTGTAG 
AATTTTGTAG 
AATTTTGTAG 
AATTTTGTAG 
AATTTTGTAG 
AATTTTGTAG 
AATTTTGTAG 
********** 



msa3 9314.2{ 
msa39314 . 
msa39314 . 
msa39314.2{ 
msa39314 
msa39314 
msa39314.2{225 
msa39314 . 
msa39314 
msa39314 
- msa39314.2{ 



225_18RS21) 
2{225_2603} 
2{225_A909} 
225_CJB110} 
2{225_COHl) 
2{225_H36B} 
_KM9130013} 
2{225_M732} 
2{225_M78lJ 
.2{225_090} 
225_1169NT} 
Consensus 



701 

TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
********** 



tAAAGTTGAT 
tAAAGTTGAT 
tAAAGTTGAT 
tAAAGTTGAT 
tAAAGTTGAT 
tAAAGTTGAT 
tAAAGTTGAT 
tAAAGTTGAT 
tAAAGTTGAT 
CAAAGTTGAT 
cAAAGTTGAT 
_********* 



CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
********** 



TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 



750 

TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 



********** ********** 



751 



800 



884 



WO 2004/018646 



PCT/US2003/026827 



Table 54: Comparative Sequences relating to SAG0949 



msa39314 .2 {225_18RS2l} TTCAAACAGC TTCATAATAA 

msa3 93 14. 2 {225^2603} TTCAAACAGC TTCATAATAA 

msa39314.2{225_A909} TTCAAACAGC TTCATAATAA 

msa39314.2{225_CJB110} TTCAAACAGC TTCATAATAA 

msa39314 . 2{225__COHl) TTCAAACAGC TTCATAATAA 

msa39314 .2{225__H3 6B} TTCAAACAGC TTCATAATAA 

msa39314 . 2 { 225_KM9130013 } TTCAAACAGC TTCATAATAA 

msa39314.2{225_M732} TTCAAACAGC TTCATAATAA 

msa39314 .2{225_M78lJ TTCAAACAGC TTCATAATAA 

msa39314 .2{225_090) TTCAAACAGC TTCATAATAA 

msa39314 .2{225_1169NT} TTCAAACAGC TTCATAATAA 

Consensus ********** ********** 



GGGgAgATTT 
GGGgAgATTT 
GGGgAgATTT 
GGGgAgATTT 
GGGgAgATTT 
GGGgAgATTT 
GGGgAgATTT 
GGGgAgATTT 
GGGgAgATTT 
GGGaAaATTT 
GGGgAaATTT 



CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 



CTTACAAATG 
CTTACAAATG 
CTTACAAATG 
CTTACAAATG 
CTTACAAATG 
CTTACAAATG 
CTTACAAATG 
CTTACAAATG 
CTTACAAATG 
CTTACAAATG 
CTTACAAATG 



***_*-**** ********** ********** 



801 

msa39314 . 2 { 225_18RS2l} GTTTGGTGAA 

msa39314.2{225_2603} GTTTGGTGAA 

msa39314.2{225_A909} GTTTGGTGAA 

msa39314.2{225_CJB110} GTTTGGTGAA 

msa3 9314.2(22 5_COHl } GTTTGGTGAA 

maa39314.2{225_H36B} GTTTGGTGAA 

msa39314.2{225_KM9130013} GTTTGGTGAA 

msa39314 .2{225_M732} GTTTGGTGAA 

msa39314.2{225_M78l} GTTTGGTGAA 

msa393 14 - 2 { 225_090 } GTTTGGTGAA 

msa39314.2{22S_1169NT} GTTTGGTGAA 

Consensus ********** 



GATGTTTATA 
GATGTTTATA 
GATGTTTATA 
GATGTTTATA 
GATGTTTATA 
GATGTTTATA 
GATGTTTATA 
GATGTTTATA 
GATGTTTATA 
GATGTTTATA 
GATGTTTATA 



828 
GTAAAGAA 
GTAAAGAA 
GTAAAGAA 
GTAAAGAA 
GTAAAGAA 
GTAAAGAA 
GTAAAGAA 
GTAAAGAA 
GTAAAGAA 
GTAAAGAA 
GTAAAGAA 



********** ******** 



SEQ ID NO. 5412 

STRAIN 2603 frame: 1 

LTHKN I LLT I IFGLFMI I LSACGMSNKEMAG I DNWEHYQKEKKIT I GFDNTFVPMGFESR 
SGDYTGFD I DItANAVFKB YG I SVKWQP INWDMKETELNNGNI DL I WNGYS KTAERAKKVA 
FTNPYMNNHQVI VTKTSSHINS IKDMKGKKLGAQSGSSGFDAFNAKPDILKKFVKGKEAV 
QYDTFTQAL I DLKNNRI DGLLIDE VYANYYLKQEGNI KAYYFVKTAYQGENFWGARKVD 
RRLI EKINKAFKQLHNKGRFQKI S YKWFGEDVYS KE 



SEQ ID NO. 5413 

STRAIN 090 frame: 3 

WEHYQKEKKI T I GFDNTFVPMGFE SRSGD YTGFD I DLANAVFKE YG I S VKWQP I NWDMKE 
TE LNNGN I DL I WNG Y S KT AE RAKKVAFTN P YMNNHQV I VT KT S S H I NS I KDM KG KKLGAQ 
SGS SGFDAFNAKPD I LKKFVKGKEAVQYDTFTQAL I DLKNNRI DGLL I DE VYANYYLKQE 
GN I KAYY FVKT AYQGENFVVGARKVDRRL I EKINKAFKQLHNKGKFQKI SYKWFGEDVYS 
KE 



SEQ ID NO. 5414 

STRAIN A909 frame: 3 

WEHYQKEKKI T I GFDNTFVPMGFESRSGDYTGFD I DLANAVFKE YG I S VKWQP I NWDMKE 
TELNNGNIDL I WNG Y S KTAE RAKKVAFTN P YMNNHQV I VTKT S SH INS I KDMKGKKLGAQ 
SGSSGFDAFNAKPD I LKKFVKGKEAVQYDTFTQAL IDLKNNRI DGLL I DEVYANYYLKQE 
GN I KAYY FVKTAYQGEN FVVGARKVDRRL I EKINKAFKQLHNKGRFQKI SYKWFGEDVYS 
KE 

SEQ ID NO. 5415 

STRAIN H36B frame: 3 

WEHYQKEKKITI GFDNTFVPMGFESRSGDYTGFD IDLANAVFKEYGI SVKWQP INWDMKE 
TELNNGNIDLI WNGYSKTAERAKKVAFTNPYMNNHQVI VTKTSSHINS I KDMKGKKLGAQ 
SGSSGFDAFNAKPD I LKKFVKGKEAVQYDTFTQAL I DLKNNRIDGLL I DEVYANYYLKQE 
GNI KAYY FVKTAYQGEN FVVGARKVDRRL I EKINKAFKQLHNKGRFQKI SYKWFGEDVYS 
KE 

SEQ ID NO. 5416 

STRAIN 18RS21 frame: 3 

WEHYQKEKKITI GFDNTFVPMGFESRSGDYTGFD IDLANAVFKEYGI SVKWQP I NWDMKE 
TELNNGNIDL I WNGYSKTAERAKKVAFTNPYMNNHQVI VTKTSSHINS I KDMKGKKLGAQ 
SGSSGFDAFNAKPD I LKKFVKGKEAVQYDTFTQALI DLKNNRI DGLL I DEVYANYYLKQE 
GN I KAYY FVKTAYQGEN FVVGARKVDRRL I EKINKAFKQLHNKGRFQKI SYKWFGEDVYS 
KE 

SEQ ID NO. 5417 

STRAIN M732 frame: 3 

WEHYQKEKKITIGFDNTFVPMGFESRSGDYTGFDIDLANAWKEYGISVKWQPINWDMKE 
TEIjNNGNIDLI WNGYSKTAERAKKVAFTNPYMNNHQVI WKTSSHINS I KD 
SGSSGFDAFNAKPD I LKKFVKGKEAVQYDTFTQAL I DLKNNRIDGLL I DEVYANYYLKQE 
GN I KAYYFVKTAYQGENFVVGARKVDRRL I EKI NKAFKQLHNKGRFQKI SYKWFGEDVYS 
KE 

SEQ ID NO. 5418 

STRAIN COH1 frame: 3 

WEHYQKEKKI T I GFDNT FVPMGFES RSGDYTG FDI DLANAVFKE YGI SVKWQP I NWDMKE 
TELNNGNIDL I WNGYSKTAERAKKVAFTNPYMNNHQVI VTKT 

SGSSG FD AFNAKPD I LKKFVKGKEAVQ YDT FTQAL I DLKNNR I DGLL ID E VYANYYLKQE 
GN I KAYYFVKTAYQGENFVVGARKVDRRL I EKINKAFKQLHNKGRFQKI SYKWFGEDVYS 
KE 
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Table 54: Comparative Sequences relating to SAG0949 



SEQ ID NO. 5419 

STRAIN M781 frame: 3 

WEHYQKEKKI T I GFDNTFVPMGFE S RSGD YTGFD I DIiANAVFKE YG I S VKWQ P I NWDMKE 
TELNNGN IDL I WNGYSKTAERAKKVAFTNPYMNNHQVI VTKTSSHINS I KDMKGKKLGAQ 
SGSSGFDAFNAKPD I LKKFVKGKEAVQYDTFTQAL I DLKNNRIDGLL IDEVYANYYLKQE 
GN I KAYYFVKTAYQGENFVVGARKVDRRL I EKINKAFKQLHNKGRFQKI SYKWFGEDVYS 
KE 

SEQ ID NO. 5420 

STRAIN CJB110 frame: 3 

WEHYQKEKKI TI GFDNTFVPMG FESRSGD YTGFD I DLANAVFKEYGI S VKWQPINWDMKE 
TELNNGNIDL I WNGYSKTAERAKKVAFTNPYMNNHQVI VTKTSSHINS I KDMKGKKLGAQ 
SGS S G FDAFNAKPD I LKKFVKGKEAVQYDTFTQAL I DIjKNNR I DGLL I DEVYANYYLKQE 
GNIKAYYFVKTAYQGENFVVGARKVDRRIi I EKINKAFKQLHNKGRFQKI SYKWFGEDVYS 

KE 

SEQ ID NO. 5421 

STRAIN 1169NT frame: 3 

WEHYQKEKKITI GFDNTFVPMGFE SRSGDYTGFD I DLANAVFKEYGI SVKWQPINWDMKE 
TELNNGNIDLIWNGYSKTAERAKKVAFTNPYMNNHQVIVTKTSSHINSIKDMKGKKLGA 
SGSSGFDAFNAKPD I LKKFVKGKEAVQYDT FTQAL I DLKNNRI DGLL I DEVYANYYLKQE 
GNI KAYYFVKTAYQGENFWGARKVDRRL I EKINKAFKQLHNKGKFQKI SYKWFGEDVYS 
KE 

SEQ ID NO. 5422 

STRAIN JM9130013 frame: 3 

WEHYQKEKKI TI GFDNTFVPMGFE S RSGD YTGFD I DLANAVFKEYGI SVKWQP I NWDMKE 
TELNNGNIDLIWNGYSKTAERAKKVAFTNPYMNNHQVI VTKTSSHINS I KDMKGKKLGAQ 
SGSSGFDAFNAKPD I LKKFVKGKEAVQYDTFTQAL I DLKNNR IDGLL I DEVYANYYLKQE 
GNI KAYYFVKTAYQGENFVVGARKVDRRL I EKI NKAFKQLHNKGRFQKI SYKWFGEDVYS 



PRETTY of: /biotmp/msa45901 . 2 { *} February 19, 2003 03:09 



msa45901.2{225_090} 
msa45901.2{225_1169NT} 
msa45901.2{225_18RS2l} 
msa45901 . 2 { 225_2603 } 
msa45901 . 2 {225_A909 } 
msa45901.2{225_CJB110} 
msa4 5901.2(22 5_COHl } 
msa45901.2{225_H36B} 
msa45901 . 2 (225_JM9130013 } 
msa45901 . 2 {225_M732 } 
msa45901.2{225_M78l) 
Consensus 



1 50 

WEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

lthknillti ifglfmiils acgmsnkema gidnWEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

********** ********** ********** ********** ********** 



msa45901 . 2 { 225_090 J 
msa45901.2{225_1169NT} 
msa45901.2{225_18RS2l} 
msa45901 . 2 (225_2603 } 
msa45901.2{225_A909> 
msa45901.2{225_CJB110} 
msa45901 . 2 { 225_C0H1 } 
msa45901.2{225_H36B} 
msa45901 . 2 {225_JM9130013 } 
msa45901.2(225_M732} 
msa45901.2{225_M78l} 
Consensus 



51 

TFVPMGFESR 
TFVPMGFESR 
TFVPMGFESR 
TFVPMGFESR 
TFVPMGFESR 
TFVPMGFESR 
TFVPMGFESR 
TFVPMGFESR 
TFVPMGFESR 
TFVPMGFESR 
TFVPMGFESR 



SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 



LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 



ISVKWQPINW 
ISVKWQPINW 
ISVKWQPINW. 
ISVKWQPINW 
ISVKWQPINW 
ISVKWQPINW 
ISVKWQPINW 
ISVKWQPINW 
ISVKWQPINW 
ISVKWQPINW 
ISVKWQPINW 



100 

DMKETELNNG 
DMKETELNNG 
DMKETELNNG 
DMKETELNNG 
DMKETELNNG 
DMKETELNNG 
DMKETELNNG 
DMKETELNNG 
DMKETELNNG 
DMKETELNNG 
DMKETELNNG 



********** ********** ********** ********** ********** 



msa45901.2{225_090; 
msa45901.2{225_1169NT; 
msa45901.2{225_18RS2i; 
msa45901 . 2 {225_2603 
msa45901.2{225_A909 
msa45901.2{225_CJB110 
msa45901 . 2{225_COHl 
msa45901 . 2 {225__H36B 
msa45901.2{225_JM913 0013 
msa45901.2(225_M732 
msa45901.2{225_M781 
Consensus 



msa45901 . 2 (225_090 ) 
msa45901.2{225_1169NT> 
msa45901.2{225_18RS2l} 
msa45901 . 2f 225_2603 \ 
msa45901 . 2 {225__A909 } 
msa45901.2{225__CJB110} 



101 

NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
********** 

151 

LGAQSGSSGF 
LGAQSGSSGF 
LGAQSGSSGF 
LGAQSGSSGF 
LGAQSGSSGF 
LGAQSGSSGF 



KTAERAKKVA 
KTAERAKKVA 
KTAERAKKVA 
KTAERAKKVA 
KTAERAKKVA 
KTAERAKKVA 
KTAERAKKVA 
KTAERAKKVA 
KTAERAKKVA 
KTAERAKKVA 
KTAERAKKVA 
********** 



DAFNAKPD I L 
DAFNAKPDIL 
DAFNAKPD I L 
DAFNAKPDIL 
DAFNAKPDIL 
DAFNAKPDIL 



FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 



VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 



********** ********** 



KKFVKGKEAV 
KKFVKGKEAV 
KKFVKGKEAV 
KKFVKGKEAV 
KKFVKGKEAV 
KKFVKGKEAV 



QYDTFTQALI 
QYDTFTQALI 
QYDTFTQALI 
QYDTFTQALI 
QYDTFTQALI 
QYDTFTQALI 



150 

NS I KDMKGKK 
NSIKDMKGKK 
NS I KDMKGKK 
NSIKDMKGKK 
NSIKDMKGKK 
NSIKDMKGKK 
NSIKDMKGKK 
NSIKDMKGKK 
NSIKDMKGKK 
NSIKDMKGKK 
NSIKDMKGKK 
********** 

200 

DLKNNR I DGL 
DLKNNR I DGL 
DLKNNRI DGL 
DLKNNRI DGL 
DLKNNR I DGL 
DLKNNR I DGL 
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Table 54: Comparative Sequences relating to SAG0949 



301.2f225_COHl) 
301.2{225_H 



msa4590J _ 
msa45901 . 2 {225_H3 6B} 
msa45901 . 2 { 225_JM9130013 } 
msa45901 . 2 (225J4732 } 
msa45901.2{225_M78l} 
Consensus 



LGAQSGSSGF 
LGAQSGSSGF 
LGAQSGSSGF 
LGAQSGSSGF 
LGAQSGSSGF 
********** 



DAFNAKPD I L 
DAFNAKPD I L 
DAFNAKPD I L 
DAFNAKPD I L 
DAFNAKPD IL 
********** 



KKFVKGKEAV 
KKFVKGKEAV 
KKFVKGKEAV 
KKFVKGKEAV 
KKFVKGKEAV 



QYDTFTQALI 
QYDTFTQALI 
QYDTFTQALI 
QYDTFTQALI 
QYDTFTQALI 



DLKNNRIDGL 
DLKNNRIDGL 
DLKNNRIDGL 
DLKNNRIDGL 
DLKNNRIDGL 



********** ********** ********** 



msa45901.2{225_090} 
msa45901.2{225_1169NT} 
msa45901.2{225_18RS2l} 
msa45901 . 2{225_2603 } 
msa45901.2{225_A909} 
msa45901 . 2 {225_CJB110 } 
msa4 5901.2(225 _COH 1 } 
msa45901.2{225_H36B} 
msa45901.2{225_JM9130013} 
msa45901 . 2 { 225_M732 } 
msa45901 . 2 {225_M78l} 
Consensus 



201 

L I DEVYANYY 
LIDEVYANYY 
L I DEVYANYY 
LIDEVYANYY 
LIDEVYANYY 
LIDEVYANYY 
LIDEVYANYY 
LIDEVYANYY 
LIDEVYANYY 
LIDEVYANYY 
LIDEVYANYY 



LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 



YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 



********** ********** ********** 



NFWGARKVD 
NFWGARKVD 
NFWGARKVD 
NFWGARKVD 
NFWGARKVD 
NFWGARKVD 
NFWGARKVD 
NFWGARKVD 
NFWGARKVD 
NFWGARKVD 
NFWGARKVD 
********** 



250 

RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
********** 



msa45901 . 2 { 225_090 } 
msa45901.2(225_1169NTj 
msa45901.2{225_18RS2l} 
msa45901.2{225_2603} 
msa45901 . 2 {225__A909 } 
msa45901-2{225_CJBHOj 
msa45901 . 2 {225_COHl } 
msa45901 . 2 {225_H36B} 
msa45901 . 2 { 225_JM9130013 } 
msa45901.2{225_M732} 
msa45901.2{225_M78l} 
Consensus 



251 

FKQLHNKGkF 
FKQLHNKGkF 
FKQLHNKGrF 
FKQLHNKGrF 
FKQLHNKGrF 
FKQLHNKGrF 
FKQLHNKGrF 
FKQLHNKGrF 
FKQLHNKGrF 
FKQLHNKGrF 
FKQLHNKGrF 



QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 



********_* ********** 



276 
DVYSKE 
DVYSKE 
DVYSKE 
DVYSKE 
DVYSKE 
DVYSKE 
DVYSKE 
DVYSKE 
DVYSKE 
DVYSKE 
DVYSKE 
****** 
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Table 55: Comparative Sequences relating to SAG1592 



SEQ ID NO. 5501 
STRAIN 2603 

ATGCTTAAATCTTTTTTGATTTTCTTAGTTCG 

TT CC CAG CT AG CTGT CGTT AT CGTC CAACTTGCT CTACGTATATG AT AG AAG CTATT CAA 
AAACATG GTCTAAAAGGTGTGTTG ATGGGGATTG CACGT ATTTTG CGATGT CAT C CCTT A 
GCCCACGGAGGAAATGATCCTGTCCCTGATCATTTTAGCTTAAGACGTAATAAAACGGAT 

ATATCAGAT 

SEQ ID NO. 5502 

STRAIN 090 

TTCCC^GCTAGCTGTCGTTATCGTCCAACITGCTCTACGTATATGATAGA 
AG CTATT CAAAAACATGGTCTAAAAGGTGTGTTG ATGGGGATTG CACGT A 
TTTTGCX3ATGTCATCCCTITAGCCCA.CGGAGGAAATGATCCTGTCCCTGAT 
CATTTTAGCTT 

SEQ ID NO. 5503 
STRAIN A909 

TT CC CAG CT AG CTGT CGTT ATC GT CCAAC t TG CT CTACGTATATGATAGA 
AGCTATTCAAAAACATGGTCTAAAAGGTGTGTTGATGGGGATTGCACGTA 
TTTTGCGATGTCATCCCTTAGCCCACX3GAGGAAATGATCCTGTCCCTGAT 
CATTTTAgCTTAAGACGTAATAAAACGGATATA 

SEQ ID NO. 5504 

STRAIN H36B 

TT CCCAG CT AGCTGT CGTTAT CGT C CaACTTG CT CTACGTATATGATAGA 
AGCTATT CAAAAACATGGT CTAAAAGGTGTTCTGATGGGGATTG CACGT A 
TTTTGCGAT GTCAT C C CTTAGC CGACGGAGGAAATGAT CCTGT C C CTGAT 
CATTTTAG CTTAAG ACGTAATAAAACGGATATAT CAGAT 

SEQ ID NO. 5505 

STRAIN 18RS21 

TTCCCAG CT AGCTGT CGTTATCGT CCAACTTG CTCTACGTATATGATAGA 
AG CTATT CAAAAACATGGT CTAAAAGGTGTGTTGATGGGGATTG CACGTA 
TTTTG CGAT GT CAT C C CTT AG C CCACGGAGGAAATGAT C CTGTC C CTGAT 
CATTTTAGCTTAAGACGTAATAAAACGGATATATCAGAT 

SEQ ID NO. 5506 

STRAIN M732 

TTCCCAGCTAGCTGTCGTTATCGTCCAACTTGCTCTACGTATATGATAGA 
AG CTATT CAA?VAACATGGTCTAAAAGGTGTGTTGATGGGGATTG CACGTA 
TTTTGCGATGTCATCCCTTAgCCCACGC^GGAAATGATCCTGTCCCTGAT 
CATTTTAGCTTAAGACGTAATAAAACGGATATATCAGAT 

SEQ ID NO. 5507 

STRAIN COH1 

TT CC CAG CT AG CTGT CGTTATCGT CCAACTTG CT CTACGTATATG ATAG AAG CTATT CAA 
AAACATGGT CTTAAAAGGTGTGTTGATGGGGATTG CACGTATTTTG CGATGT CATC CCTTA 
GCCCACGGAGGAAATGAtCCTGtCCCTGATCATTTTAGCT 

SEQ ID NO. 5508 

STRAIN M781 

TTCCCAG CTAGCTGTCGTTATCGTCCAACTTG CTCTACGTATATGATAGA 
AG CTATT CAAAAACATGGT CrTAAAAGGTGTGTTGATGGGGATTG CACGTA 
TTTTG CGATGTCAT C C CTTAG CCCACGGAGGAAATGATCCTGTC CCTGAT 
CATTTTAGCTTAAGACGTAATAAAACGGATATATCAGAT 

SEQ ID NO. 5509 

STRAIN CJB110 

TTC C CAG CT AGCTGT CGTTATCGTC CAACTTG CT CTACGTATATGATAGA 
AG CTATT CAAAAACATGGT CTAAAAGGTGTGTTGATGCXK5 ATTG CACGTA 
TTTTGCX3ATGTCATCCCTTAGCCCACGGAGGAAATGATCCTGTC 
CATTTTAG CTTAAGACGTAATAAAACGGATAT AT CAGAT 

SEQ ID NO. 5510 

STRAIN 1169NT 

TTCCCAGCTAGCTGTCGTTATCGTCCAACTTG CTCTACGTATATGATAGA 
AG CTATT CAAAAACATGGT CTAAAAGGTGTGGTGATGGGG ATTG CACGTA 
TTTTGCGATGTCATCCCTTAGCCCACGGAGGAA 
TATTTTAGCITAAGACGTAATAAAACGGATATAT CAGAT 

SEQ ID NO. 5511 

STRAIN JM9130013 

TTCC CAG CT AGCTGT CGTTAT CGTC CAACTTG CTCTACGTATATGATAGA 
AG CTATT CAAAAACATGGT CTAAAAGGTGTT CTGATGGGGATTG CACGTA 
TTTTGCGATGTC^TCCCTT'AGCCCACGGAGGAAATGATCCTGTCCCT 
CATTTTAG CTTAAGACGTAATAAAACGGATAT AT CAGAT 



PRETTY Of : /biotmp/msall9306 . 2 { * } April 29, 2003 06:23 .. 

1 50 

msall9306 . 2 {233_H36B} ~- 

.msalX9306.2{233_JM9130013} 
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Table 55: Comparative Sequences relating to SAG1592 



msall9306 
msall9306 .2{ 

msall9306 

msall9306 . 
msall9306 .2{ 

msall9306 . 

rasall9306 . 

msall9306 . 
msall9306 .2{ 



msall9306 
msall9306.2{233 
msall9306 
msall9306 ,2{ 

msall9306 . 

msa!19306 . 
msall9306 .2{ 

msall9306 . 

msall9306 . 

msall9306 . 
msall9306 .2{ 



• 2{233_090} 

233 1 8RS2 1 ) — — — — ^ r — ~ — — — 

2{233_2603} atgcttaaat cttttttgat tttcttagtt cgcttttacc aaaaaaatat 

2{233_A909} 

233_CJB110} 

2{233_COHl) 

2{233_M732} ~" ~ 

2{233_M78l} ~ 

233 1169NT} 

Consensus ********** ********** ********** ********** ********** 

51 100 

2{233 H36B} TTCCCAGCTA GCTGTCGTTA TCGTCCAACT TGCTCTACGT 

JM9130013) TTCCCAGCTA GCTGTCGTTA TCGTCCAACT TGCTCTACGT 

72(233 090} TTCCCAGCTA GCTGTCGTTA TCGTCCAACT TGCTCTACGT 

233 18RS21} TTCCCAGCTA GCTGTCGTTA TCGTCCAACT TGCTCTACGT 

2 {233 2603} ttctccagct TTCCCAGCTA GCTGTCGTTA TCGTCCAACT TGCTCTACGT 

2{233~A909) TTCCCAGCTA GCTGTCGTTA TCGTCCAACT TGCTCTACGT 

233 CJB110} TTCCCAGCTA GCTGTCGTTA TCGTCCAACT TGCTCTACGT 

2 {233 COHl} TTCCCAGCTA GCTGTCGTTA TCGTCCAACT TGCTCTACGT 

2{233~M732} TTCCCAGCTA GCTGTCGTTA TCGTCCAACT TGCTCTACGT 

2{233~M78l} TTCCCAGCTA GCTGTCGTTA TCGTCCAACT TGCTCTACGT 

233 1169NT} TTCCCAGCTA GCTGTCGTTA TCGTCCAACT TGCTCTACGT 

Consensus ********** ********** ********** ********** ********** 



msall9306 . 
msall9306.2{233 
msa!19306 
msall9306.2{ 

msall9306 . 

msal!9306 . 
msall9306.2{ 

msall9306 . 

msall9306. 

msall9306. 
msall9306.2{ 



2{233_H36B} 
JM9130013> 
'2{233_090) 
233_18RS2l} 
2{233_2603} 
2{233_A909} 
233_CJB110} 
2{233_COHl} 
2{233_M732} 
2{233_M781} 
233_1169NT} 
Consensus 



101 

ATATGATAGA 
ATATGATAGA 
ATATGATAGA 
ATATGATAGA 
ATATGATAGA 
ATATGATAGA 
ATATGATAGA 
ATATGATAGA 
ATATGATAGA 
ATATGATAGA 
ATATGATAGA 



AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 



********** ********** 



AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
********** 



TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
********** 



150 

tcTGATGGGG 
tcTGATGGGG 
gtTGATGGGG 
gtTGATGGGG 
gtTGATGGGG 
gtTGATGGGG 
gtTGATGGGG 
gtTGATGGGG 
gtTGATGGGG 
gtTGATGGGG 
ggTGATGGGG 
_******** 



151 

msall9306.2{233_H36B} ATTGCACGTA TTTTGCGATG TCATCCCTTA 

msa!19306.2{233_JM9130013} ATTGCACGTA TTTTGCGATG TCATCCCTTA 

msall9306.2{233_090} ATTGCACGTA TTTTGCGATG TCATCCCTTA 

msall93 06 .2{233_18RS2l) ATTGCACGTA TTTTGCGATG TCATCCCTTA 

msall9306.2{233_2603} ATTGCACGTA TTTTGCGATG TCATCCCTTA 

msa!19306.2{233_A909} ATTGCACGTA TTTTGCGATG TCATCCCTTA 

msall9306.2{233_CJB110} ATTGCACGTA TTTTGCGATG TCATCCCTTA 

msall9306.2{233_COHl} ATTGCACGTA TTTTGCGATG TCATCCCTTA 

msall9306.2{233__M732} ATTGCACGTA TTTTGCGATG TCATCCCTTA 

msall9306 .2{233_M78l} ATTGCACGTA TTTTGCGATG TCATCCCTTA 

msall9306.2{233_1169NT} ATTGCACGTA TTTTGCGATG TCATCCCTTA 

Consensus ********** ********** ********** 



GCCCACGGAG 
GCCCACGGAG 
GCCCACGGAG 
GCCCACGGAG 
GCCCACGGAG 
GCCCACGGAG 
GCCCACGGAG 
GCCCACGGAG 
GCCCACGGAG 
GCCCACGGAG 
GCCCACGGAG 
********** 



200 

GAAATGATCC 
GAAATGATCC 
GAAATGATCC 
GAAATGATCC 
GAAATGATCC 
GAAATGATCC 
GAAATGATCC 
GAAATGATCC 
GAAATGATCC 
GAAATGATCC 
GAAATGATCC 
********** 



msall9306 . 
msa!19306.2{233 
msa!19306 

msall9306.2{ 
msall9306 
rasa!19306 

msall9306.2{ 
msall9306. 
msa!19306 
msall9306 

msall9306.2{ 



201 249 

2{233_H36B) TGTCCCTGAT cATTTTAGCT taagacgtaa taaaacggat atatcagat 

JM9130013} TGTCCCTGAT cATTTTAGCT taagacgtaa taaaacggat atatcagat 

2{233_090} TGTCCCTGAT cATTTTAGCT t " 

233_18RS21} TGTCCCTGAT cATTTTAGCT taagacgtaa taaaacggat atatcagat 

2{233_2603) TGTCCCTGAT cATTTTAGCT taagacgtaa taaaacggat atatcagat 

2{233 — A909} TGTCCCTGAT cATTTTAGCT taagacgtaa taaaacggat ata 

233_CJB110} TGTCCCTGAT cATTTTAGCT taagacgtaa taaaacggat atatcagat 

2{233_COHl) TGTCCCTGAT cATTTTAGCT 

2{233_M732) TGTCCCTGAT cATTTTAGCT taagacgtaa taaaacggat atatcagat 

2{233_M781) TGTCCCTGAT cATTTTAGCT taagacgtaa taaaacggat atatcagat 

233__1169NT} TGTCCCTGAT tATTTTAGCT taagacgtaa taaaacggat atatcagat 

Consensus ********** _********* 



SKQ ID NO. 5512 

STRAIN 2603 frame: 1 

MLKSFLI FLVRFYQKNI SPAFPASCRYRPTCSTYMI EAI QKHGLKGVLMGI ARI LRCHPL 
AHGGNDPVPDHFSLRRNKTDISD 

SEQ ID NO. 5513 

STRAIN 090 frame: 1 

FPASCRYRPTCSTYMI EAI QKHGLKGVLMG I ARILRCHPLAHGGNDPVPDHFS 

SEQ ID NO. 5514 

STRAIN A909 frame: 1 

FPASCRYRPTCSTYMI EAI QKHGLKGVLMG I ARI LRCHPLAHGGNDPVPDHFSLRRNKTD 
I 

SEQ ID NO. 5515 

STRAIN H36B frame: 1 



889 



WO 2004/018646 



PCT/US2003/026827 



Table 55: Comparative Sequences relating to SAG1592 



FPASCRYRPTCSTYMIEAI QKHGLKGVLMG I ARI LRCHPLAHGGNDP VPDHFSLRRNKTD 
ISD 

SEQ ID NO. 5516 
STRAIN 18RS21 frame: 1 

FPASCRYRPTCSTYMIEAI QKHGLKGVLMG IARI LRCHPLAHGGNDPVPDHFSLRRNKTD 
ISD 

SEQ XD NO. 5517 

STRAIN M732 frame: 1 

FPASCRYRPTCSTYMIEAI QKHGLKGVLMG IARI LRCHPLAHGGNDPVPDHFSLRRNKTD 
ISD 

SEQ XD NO. 5518 

STRAIN COH1 frame: 1 

FPASCRYRPTCSTYMIEAI QKHGLKGVLMG I ARI LRCHPLAHGGNDP VPDHFS 

SEQ XD NO. 5519 

STRAIN M781 frame: 1 

FPASCRYRPTCSTYMI EAI QKHGLKGVLMG I ARI LRCHPLAHGGNDPVPDHFSLRRNKTD 
ISD 



SEQ XD NO. 5520 
STRAIN CJB110 frame: 1 

FPASCRYRPTCSTYMI EAI QKHGLKGVLMGI ARI LRCHPLAHGGNDPVPDHFSLRRNKTD 
ISD 

SEQ XD NO. 5521 

STRAIN 1169NT frame: 1 

FPASCRYRPTCSTYMI EAI QKHGLKGWMGI ARI LRCHPLAHGGNDPVPDYFSLRRNKTD 
ISD 

SEQ XD NO. 5522 

STRAIN JM9130013 frame: 1 

FPASCRYRPTCSTYMI EAI QKHGLKGVLMG I ARI LRCHPLAHGGNDPVPDHFSLRRNKTD 
ISD 

PRETTY of : /biotmp/msal!9415 . 2 { * } April 29, 2003 06:25 .. 

1 

msall9415.2{233_090} FPAS CRYRPT 

msall9415.2{233_18RS2l} FPASCRYRPT 

msall9415.2{233_COHl} FPASCRYRPT 

msall9415.2{233_A909} FPASCRYRPT 

msall9415.2{233_2603} mlksfliflv rfyqknispa FPASCRYRPT 

msall9415 . 2 {233_CJB110 } FPASCRYRPT 

msall9415.2{233_H36B} FPASCRYRPT 

msall9415.2{233_JM9130013} FPASCRYRPT 

msall9415.2(233_M732j FPASCRYRPT 

msall9415.2{233_M78l} FPASCRYRPT 

msall9415 . 2 {233_1169NT} FPASCRYRPT 

Consensus ********** ********** ********** 



CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
********** 



50 

KHGLKGV1MG 
KHGLKGV1MG 
KHGLKGV1MG 
KHGLKGV1MG 
KHGLKGVlMG 
KHGLKGV1MG 
KHGLKGVlMG 
KHGLKGVlMG 
KHGLKGVlMG 
KHGLKGVlMG 
KHGLKGVvMG 
*******_** 



msall9415 
msall9415 .2 { 

msall9415. 

msall9415. 

msa!19415 . 
msall94l5.2{ 

tnsall9415 
msall9415.2{233 

msall9415. 

msall9415. 
msall9415.2{ 



2{233_090} 
233_18RS21} 
2{233_C0H1) 
2<233_A909) 
2{233_2603} 
233_CJB110} 
2{233_H36B} 
_JM9130013} 
2f 233JM732} 
2{233_M781} 
233_1169NT} 
Consensus 



51 

IARILRCHPL 
IARILRCHPL 
IARILRCHPL 
IARILRCHPL 
IARILRCHPL 
IARILRCHPL 
IARILRCHPL 
IARILRCHPL 
IARILRCHPL 
IARILRCHPL 
IARILRCHPL 
********** 



83 



AHGGNDPVPD 
AHGGNDPVPD 
AHGGNDPVPD 
AHGGNDPVPD 
AHGGNDPVPD 
AHGGNDPVPD 
AHGGNDPVPD 
AHGGNDPVPD 
AHGGNDPVPD 
AHGGNDPVPD 
AHGGNDPVPD 



hFS 

hFSLRRNKTD 

hFS 

hFSLRRNKTD 
hFSLRRNKTD 
hFSLRRNKTD 
hFSLRRNKTD 
hFSLRRNKTD 
hFSLRRNKTD 
hFSLRRNKTD 
yFSLRRNKTD 



ISD 



********** _********* 



I~~ 
ISD 
ISD 
ISD 
ISD 
ISD 
ISD 
ISD 
*** 
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SEQ ID NO. 5601 
STRAIN 2603 

aagaagcttacttttatttgggatttagatgggacattaatagattcgta 
tgtaccaattatggaagctcttgaagaaacctatcgtcattttggtttaa 
tatttgataaagaattaatccatgaatatattttacaggaatcagtgggg 
aaattattggtaaacctttcagaggaagagcaaatacctcatgaaaaact 
gaaagcatattttacaaaagaacaagaaagtcgagattctaaaatacatt 
taatgccatatgcaaaagagattttagaatggaccaaagaacaagatatc 
cccaattttatgtatacacataaaggagcaagtacgcattcagtgttgga 
aaccttgcagatctctcattattttgatgaaattttaactggtgtttcgg 
gattcgagcgaaaaccacatccacaagggattaattatttagttaaacga 
tattctttagataaatcaatgacttattacataggagatcgtccactaga 
tttggaggttgctcaaaatgctggtataaaatccataaacttaaggttag 
agaattccaaagaaaactataatatttcaagtctcaaagatataatatca 
ct tgatt tcactcgt ttggat 

SEQ ID NO. 5602 
STRAIN COH1 

AAGAAGCTTACTTTTATTTGGGATTTAGATGGGACATTAA 
TAGATTCGTATGTACCAATTATGGAAGCTCTTGAAGAAACCTATCGTCAT 
TTTGGCTTAATATTTGAT AAAGAATTAAT C CATG AATATATTTT ACAGGA 
AT CAGTGGGG GAATTATTGGTAAACCTTT CAGAGG AAGAG CAAATACCT C 
ATGAAAAACTGAAAG CATATTTTACAAAAGAACAAGAAAGT CGAGATTCT 
AAAATACATTTAATG CCATATG CAAAAGAGATTTTAGAATGGAC CAAAGA 
ACAAGATATTCCCAATTTTATGTATACACATAAAGGAGCAAGTACGCATT 
CAGTGTTGGAAACCTI^re CAGATCT CTCATT ATTTTGA 
GGTGTTTCGGGATT CGAG CGAAAAC CACATCCACAAGGGATTAATTATTT 
AGTTAAACX3ATATTCTTTAGATAAATGAATGACITATTACATAGGAGATC 
GTCCACT AGATTTGGAGGTTG CTCAAAATG CTGGT AT AAAATCCATAAAC 
TTAAGGTTAGAGAATTC CAAAGAAAACTATAATATTT CAAGTCTCAAAG A 
TATAATAT CACrTGATTTCACT CGTTTGGAT 

SEQ ID NO. 5603 
STRAIN A909 

AAGAAG CTTACTTTTATTTGGGATTTAGATGGGACATTAAT 

AGATT CGTATGTAC CAATTATGGAAGCTCTTG AAGAAACCT^ 

ATTTGATAAAGAATTAAT C CATGAATATATTTTACAGGAATCAGTGGGGAAATT ATTGGT 
AAACCTTTCAGAGGAAGAGCAAATACCTCATGAAAAACTGAAAG CATATTTTACAAAAGA 
ACAAGAAAGT CGAGATT CTAAAATAG^TTT AATG CCATATG CAA^GAG ATTTTAGAATG 
GACCAAAGAACAAGATATC C CCAATTTTATGTATACACAT AAAGG AG CAAGTACG CATT C 
AGTGTTGGAAAC CTTTG CAGATCTCTCATTATTTTG ATGAAATT^ 

ATTCGAG CGAAAAC CACAT C CAC!AAGGGATTAATT ATTTAGTTAAACGATATTCTTT AG A 
TAAATCAATGACTTATTACATAGGAGATCGTCCACTAGATTTGGAGGTTGCTCAAAATGC 
TGGTATAAAAT CCAT AAACTTAAGGTT AGAGAATT CCAAAGAAAACT ATAATATTT CAAG 
T CTCAAAGATATAATAT CACITGATTTCACTCGT 

SEQ ID NO. 5604 
STRAIN H36B 

AAGAAGCTTACITTTATTTGGGATTTAGATGGGACATTAATAGATTCG 

TATGTACCAATTATGGAAGCTCTTGAAGAAACCTATCGTCATTTTGGTTTAATATTT 

AAAGAATTAAT CCATGAATATATTTTACAGGAATCAGTGGGGAAATTATTGGTAAAC CTT 

TCAGAGGAAGAGCAAATACCTCATGAAAAACTGAAAGCATATTTT 

AGTCGAGATTCTAAAATACATTTAA1X3CCATATGCAAAAGAGATTTT 

GAACAAGATATCCCCAATTTTATCT 

GAAACCTTGCAGATCrcrCATTATTTTGATGAAATTTTAACTGGTGT^ 
CGAAAAC CACAT CCACAAGGGATTAATTATTTAGTTAAACGATATT CTTTAGATAAATCA 
ATGACTTATTACAT AGG AG ATCGT C CACTAGATTTGGAGGTTGCT CAAAATGCTGGTATA 
AAAT C CAT AAACTT AAGGTTAGAG AATT C CAAAGAAAACTATAATATTT CAAGT CT CAAA 
GATATAATAT CACTTGATTT CACT CGTTTGGAT 

SEQ ID NO. 5605 
STRAIN 18RS21 

AAGAAG CTT ACTTTT ATTTGGG ATTTAGATGGGACATTAATAGA^ 
CGTATGTACCAATTATGGAAGCTCTTGAAGAAACCTATCGTCATTTTGGTTTAATATT^ 
ATAAAGAATT AATC CATGAATATATTTTACAGGAAT CAGTGGGG AAATTATTGGTAAAC C 
TTTCAGAGGAAGAG CAAATACCTCATG AAAAACTGAAAG CAT ATTTTACAAAAG AACAAG 
AAAGT CGAGATT CTAAAATACATTTAATG C CATATG CAAAAG AGATTTT AGAATGGACCA 
AAGAACAAG ATATC C CCAATTTTATGTATACACATAAAGGAG CAAGTACG CATTCAGTGT 
TGGAAACCTTGCAGATCTCTCATTATTTTGATGAAAT^ 

AG CGAAAAC CACATC CACAAC4GGATTAATT ATTTAGTTAAACGAT ATT CTTTAGATAAAT 
CAATGACTT ATTACATAGGAGAT CGT C CACTAGATTTGGAGGTTGCT CAAAATG CTGGT A 
TAAAAT C CATAAACTTAAGGTTAG AGAATT CCAAAGAAAACTAT AATATTTCAAGTOT 
AAGATATAATATCACTTGATTTCACTCGTTTGGAT 

SEQ ID NO. 5606 
STRAIN M732 

AAGAAGCITACTTTTATTTGGGATTTAGATG^ 

TCGTATGTAC CAATT ATGGAAG CT CTTGAAGAAAC CT ATCGT CATTTTGG CTTAATATTT 
GATAAAGAATTAATCCATGAATATATTTTACAGGAATC^^ 

CTTT CAGAGG AAGAG CAAATAC CT CATGAAAAACTGAAAG CATATTTTACAAAAGAACAA 
GAAAGTCGAGATT CTAAAAT ACATTTAATG C CAT ATGCAAAAGAGATTTTAG AATGGAC C 
AAAGAACAAGATATTCCCAATTTTATGTATACACATAAAGGAGCAAGTACGCATTC^ 
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TTC^AAACCTTGCAGATCTCTCATT^TTTTGATGAAATTTTAACTGGTGTTTCGGGATTC 
GAGCGAAAACCACATCCACAAGGGATTAATTATTTAGTTAAACGATA'TTCTTTAGATAAA 
TCAATGACTTATTACATAGGAGATCGTCCACTAGATTTGGAGGTTGCTCAAAATGCTGGT 
AT AAAAT C CATAAACTT AAGGTTAG AG A^TT C CAAAG AAAAC TATAAT AT TTCAAGT CT C 
AAAGATATAATATCACTTGATTTCACTCGTTTGGAT 

SEQ ID NO. 5607 
STRAIN CJB 110 

AAGAAG CTTACTTTTATTTGGG ATTTAGAT GG GACATT 

AATAGATTCGTATGT AC CAATT ATGGAAG CT CTTGAAGAAAC CTATCGT CATTTTGG CTT 
AATATTTGATAAAG AATTAAT C CATG AATATATTTTACAG GAAT CAGTGGGG CAATTATT 
GGTAAACCTTTCAGAGGAAGAGCAAATACCTCATGAAAAACTGAAAGC^^ 
AGAACAAGAAAGTCGAGATTCTAAAATACATTTAATGCCATATGC 

. ATGG ACCAAA3AAGAAG ATAT C CC CAATTTTATGTAT ACACATAAAGGAG CAAGTACG CA 
TTCAGTGTTGGAAACCTTGCAGATCTCTCATTATTTTG^ 

TGGATTCGAG CG AAAACCA CAT CCACAAG GGATTAATT ATTTAGTT^ T 
AGATAAAT CAATG ACTT ATT ACATAGGAG AT CGT CCC CT AGATTTGGAG GTTGCTCAAAA 
TG CTGGTATAAAAT C CAT AAACTT AAGGTTAGAG AATTC CAAAGAAAACTATA^TATTT C 
AAGT CTC^GGATATAATAT CACTTGATTT CACT CGTT 

SEQ ID NO. 5608 
STRAIN 1I69NT 

aAGAAGCTTACTTTTATTTGGGATTTAGATGGGACATT^ 

TAGAAGCT CTTGAAGAAAC CTATCGTCATTTTGG CTT AATATTTGATAAAGAATTAATC 
ATGAATATATTTTACAGGAATCAGTGGGGAAATTATTGGTAAAC CTTT CAGAGGAAG AG C 
AAATAC CTCATG AAAAACTG AAAG CATATTTTACAAAAGAACAAG AAAGT CGAGATT CTA 
AAATACATTTAATG C CATACG CAAAAGAGATTTTAGAATGGACCAAAGAACAAGATATC C 
CCAATTTTATGTATACACAT AAAGGAG CAAGTACG CATT CAGTGTTGGAAACCTTG CAGA 
TCTCT CATTATTTTGATGAAATTTT AA.CTGGTGTTTCGGGATT CGAG CGAAAACCACAT C 
CACAAGGGATTAATTATTTAGTTAAACGATATTCTTTAGATA^ 

TAGGAGATCGT CCC CTAGATTTGG AGGTTG CT CAAAATGCTGGTATAAAATCCATAAACT 

TAAGGTTAGAGAATTCCAAAGAAAACTATAATATTTCAAGTCTCAAGGATATAATATC^ 

TTGATTTCACTCGTTTGGAT 

SEQ ID NO. 5609 
STRAIN JM9130013 

AAGAAGCTTACTTTTATTTGGGATTTAGATGGGACATTAATAGA 

TTCGTATGTACCAATTATGGAAGCTCTTGAAGAAACCTATCGTCATTTTGGT^ 

TGATAAAGAATTAATCCATGAATATATTTTACAGGAATCAGTGGGGAAATTATTGGTAAA 

CCTTT CAGAGGAAGAG CAAATAC CT CATGAAAAACTGAAAG CATATTTTACAAAAGAACA 

AGAAAGTCGAGATTCTAAAATACATTTAATGCCATATGCAAAAGAGATT^ 

CAAAG AACAAGATAT CCC CAATTTTATGTATACACATAAAGGAGCAAGT ACG CATTCAGT 

GTTGGAAAC CTTG CAGATCT CT CATTATTTTGATGAAATTTTAACTGGTGTTTCGGGATT 

CGAG CGAAAAC CACATCCACAAGGG ATTAATTATTTAGTTAAACGATATT CTTT AGATAA 

AT CAATGACTTATTACATAGGAGAT CGTC CACTAGATTTGGAG^ 

TATAAAAT CCATAAACTTAAGGTT AGA.GAATT C CAAAGAAAACT ATAATATTTCAAGTCT 
CAAAGATATAAT AT CACTTGATTT CACTCGT 

SEQ ID NO. 5610 

STRAIN 690 

AAGAAGCTTACTTTTATTTGG 

GATTTAGATGGGACATTAATAGATTCGTATGTACCAATTATGGAAGCTCT 
TGAAGAAAC CTAT CGTCATTTTGG C1TAATATTTG ATAAAGAATT AATC C 
ATGAATATATTTTACAGGAATCAGTGGGG CAATTATTGGTAAACCTTTCA 
GAGGAAGAGCAAATACCTCATGAAAAACTGAAAGCATATTTTACAA 
ACAAGAAAGT CGAGATT CTAAAATACATTTAATGCCATATGCAAAAGAG A 
TTTTAGAATGGACCAAAGAACAAGATATCCCCAATTTTATGTATACACAT 
AAAGGAG CAAGTACG CATT CAGTGTTGGAAAC CTTGCAGATCTCT CATTA 
TTTTGATGAAATTTTAACTGGTGTTTCTGGATTCG AG CGAAAAC CACATC 
CACAAGGGATTAATTATTTAGTTAAACGATATTCTTTAGATAAAT CAATG 
ACTTATT ACATAGGAGATCGTCCCCTAGATTTGGAGGTTG CT CAAAATG C 
TGGTATAAAATCCATAAACTTAAGGTTAGAGAATTCCAAAGAAAACTATA 
AT ATTTCAAGT OTCAAGGATATAAT AT CACTTGATTT CACTCGT 

SEQ ID NO. 5611 
STRAIN M781 

AAGAAGCTT ACTITTTATTT GGG ATTTAGATGGGACATTAATAGATTCGT 
ATGTACCAATTATGGAAGCTCTTGAAGAAACCTATCGTCATTTT 
ATATTTGATAAAGAATTAATCCATGAATATATTTTACAGGAATCAGTGGG 
GCAATTATTGGTAAACCITTCAG^GGAAGAGCAAATACCT 
TGAAAGCATATTTTACAAAAGAACAAGAAAGTCXIAGATT^ 
TTAATGC CATATG CAAAAGAGATTTTAGAATGGACCAAAGAACAAGATAT 
T CCCAATTTTATGTATACACAT AAAGGAG CAAGTACG CATTCAGTGTTGG 
AAACCTTG CAGATCT CTCATTATTTTG AT GAAATTTTAACTGGTGTTTCG 
GGATTCGAG CGAAAACCACAT C CACAAGGGATTAATTATTTAGTTAAACG 
ATATT CTTTAGATAAAT CAATGACTTATT ACATAGGAGAT CGTC CACTAG 
ATTTGGAGGTTG CT CAAAATGCTGGTATAAAATC CAT AAACTTAAGGTTA 
GAGAATTC CAAAGAAAACTATAATATTT CAAGTCT CAAAGATATAATAT C 
ACTTGATTT CACTCGT 

PRETTY of : /biottnp/msa45163 . 2 { *} January 21, 2003 06:53 
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msa45163.2{240_18RS2l} 
msa45163 .2{240_2603} 
msa45163 . 2 (240_A909 } 
msa4 5163.2(24 0_H3 6B ) 
msa45163 . 2 {240_JM9130013 } 
msa45163 . 2 { 240_COH1 J 
msa45163 . 2 { 240_M732 } 
msa45163.2(240_M78l} 
msa45163 . 2 { 240_090 } 
msa45163 . 2 { 240_CJB110 } 
msa45163 . 2 { 240_1169NT } 
Consensus 



msa45163.2(240_18RS21 
msa45163 . 2 (240_2603 
msa45163 . 2 (240_A909 
msa4 5163.2(24 0 JH3 6B 
msa45163.2{240_JM9130013 
msa4 5163. 2(24 0_COH1 
msa45163.2(240_M732 
msa45163.2(240_M781 
msa45163 .2{240_090 
msa45163.2{240_CJB110, 
msa45163.2(240_1169NT • 
Consensus 



AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 



CTTTTATTTG 
CTTTTATTTG 
CTTTTATTTG 
CTTTTATTTG 
CTTTTATTTG 
CTTTTATTTG 
CTTTTATTTG 
CTTTTATTTG 
CTTTTATTTG 
CTTTTATTTG 
CTTTTATTTG 



GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 



********** ********** ********** 
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TGTACCAATT 
TGTACCAATT 
TGTACCAATT 
TGTACCAATT 
TGTACCAATT 
TGTACCAATT 
TGTACCAATT 
TGTACCAATT 
TGTACCAATT 
TGTACCAATT 
TGTACCAATT 



ATgGAAGCTC 
ATgGAAGCTC 
ATgGAAGCTC 
ATgGAAGCTC 
ATgGAAGCTC 
ATgGAAGCTC 
ATgGAAGCTC 
ATgGAAGCTC 
ATgGAAGCTC 
ATgGAAGCTC 
ATaGAAGCTC 



********** **_******* 



TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
********** 
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TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
********** 
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TTTGGtTTAA 
TTTGGtTTAA 
TTTGGtTTAA 
TTTGGtTTAA 
TTTGGtTTAA 
TTTGGcTTAA 
TTTGGcTTAA 
TTTGGcTTAA 
TTTGGcTTAA 
TTTGGcTTAA 
TTTGGcTTAA 
********** *****_**** 



GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
********** 



CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 



101 150 

msa45163.2(240_18RS2l} TATTTGATAA AGAATTAATC CATGAATATA TTTTACAGGA ATCAGTGGGG 

msa45163.2{240_2603} TATTTGATAA AGAATTAATC CATGAATATA TTTTACAGGA ATCAGTGGGG 

msa45163 . 2 (240_A909 } TATTTGATAA AGAATTAATC CATGAATATA TTTTACAGGA ATCAGTGGGG 

msa45163 . 2{240_H36B} TATTTGATAA AGAATTAATC CATGAATATA TTTTA CAGGA ATCAGTGGGG 

msa45163.2{240_JM9130013} TATTTGATAA AGAATTAATC CATGAATATA TTTT ACAGGA ATCAGTGGGG 

msa45163 .2(240_COH1} TATTTGATAA AGAATTAATC CATGAATATA TTTTACAGGA ATCAGTGGGG 

msa45163.2{240_M732} TATTTGATAA AGAATTAATC CATGAATATA TTTTACAGGA ATCAGTGGGG 

msa4 5163. 2(24 0_M7 8 1 } TATTTGATAA AGAATTAATC CATGAATATA TTTTACAGGA ATCAGTGGGG 

msa45163 .2(240_090} TATTTGATAA AGAATTAATC CATGAATATA TTTTACAGGA ATCAGTGGGG 

ms a4 5163. 2(24 0_CJB1 1 0 } TATTTGATAA AGAATTAATC CATGAATATA TTTTACAGGA ATCAGTGGGG 

msa45163 .2(240 1169NT) TATTTGATAA AGAATTAATC CATGAATATA TTTTACAGGA ATCAGTGGGG 

Consensus ********** ********** ********** ********** ********** 



msa45163.2(240_18RS2l} 
msa45163. 2 { 240^2603} 
msa45163.2{240_A909} 
msa45163.2{240_H36B) 
msa45163 . 2 (240_JM9130013 ] 
msa45163.2l240_C0Hl) 
msa45163.2{240_M732] 
msa45163.2(240_M781 
msa45163 .2(240_090] 
msa45163 . 2 (240_CJB110 ] 
msa45163.2(240_1169NT) 
Consensus 



151 

aAATTATTGG 
aAATTATTGG 
aAATTATTGG 
aAATTATTGG 
aAATTATTGG 
cAATTATTGG 
cAATTATTGG 
CAATTATTGG 
CAATTATTGG 
cAATTATTGG 
aAATTATTGG 
_********* 



TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
********** 



AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
********** 



CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
********** 



201 

msa45163.2{240_18RS2l} GAAAGCATAT TTTACAAAAG 

msa45163.2f240_2603) GAAAGCATAT TTTACAAAAG 

msa45163.2{240_A909} GAAAGCATAT TTTACAAAAG 

msa45163 .2{240_H36B} GAAAGCATAT TTTACAAAAG 

msa45163. 2 (240_JM9130013 } GAAAGCATAT TTTACAAAAG 

msa45163.2{240_COHlJ GAAAGCATAT TTTACAAAAG 

msa45163.2{240_M732 J GAAAGCATAT TTTACAAAAG 

msa45163.2(240_M78lJ GAAAGCATAT TTTACAAAAG 

msa45163 .2 { 240^090} GAAAGCATAT TTTACAAAAG 

msa45163.2{240_CJB110} ' GAAAGCATAT TTTACAAAAG 

msa45163.2(240_1169NT} GAAAGCATAT TTTACAAAAG 

Consensus ********** ********** 



AACAAGAAAG 
AACAAGAAAG 
AACAAGAAAG 
AACAAGAAAG 
AACAAGAAAG 
AACAAGAAAG 
AACAAGAAAG 
AACAAGAAAG 
AACAAGAAAG 
AACAAGAAAG 
AACAAGAAAG 



TCGAGATTcT 
TCGAGATTcT 
TCGAGATTcT 
TCGAGATTcT 
TCGAGATTcT 
TCGAGATTcT 
TCGAGATTcT 
TCGAGATTyT 
TCGAGATTcT 
TCGAGATTcT 
TCGAGATTcT 



********** ********-* 



200 

ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
********** 

250 

AAAATACATT 
AAAATACATT 
AAAATACATT 
AAAATACATT 
AAAATACATT 
AAAATACATT 
AAAATACATT 
AAAATACATT 
AAAATACATT 
AAAATACATT 
AAAATACATT 
********** 



msa45163.2{ 

msa45163 . 

msa45163. 

msa45163 . 
msa45163.2{240 

msa45163 . 

msa45163 . 

msa45163 
msa45163 
msa45163.2f 
msa45163.2{ 



240_18RS2l] 
2{240_2603; 
2{240_A909l 
2(240_H36B] 
JM9130013; 
2(240_COH1] 
2(240_M732 
2(240_M781] 
2{240__090] 
240_CJB110] 
240_1169NT] 
Consensus 



251 

TAATGCCATA 
TAATGCCATA 
TAATGCCATA 
TAATGCCATA 
TAATGCCATA 
TAATGCCATA 
TAATGCCATA 
TAATGCCATA 
TAATGCCATA 
TAATGCCATA 
TAATGCCATA 
********** 



tGCAAAAGAG 
tGCAAAAGAG 
tGCAAAAGAG 
tGCAAAAGAG 
tGCAAAAGAG 
tGCAAAAGAG 
tGCAAAAGAG 
tGCAAAAGAG 
tGCAAAAGAG 
tGCAAAAGAG 
cGCAAAAGAG 
_********* 



ATTTTAGAAT 
ATTTTAGAAT 
ATTTTAGAAT 
ATTTTAGAAT 
ATTTTAGAAT 
ATTTTAGAAT 
ATTTTAGAAT 
ATTTTAGAAT 
ATTTTAGAAT 
ATTTTAGAAT 
ATTTTAGAAT 
********** 



GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 



300 

ACAAGATATc 
ACAAGATATc 
ACAAGATATc 
ACAAGATATc 
ACAAGATATc 
ACAAGATATt 
ACAAGATATt 
ACAAGATATt 
ACAAGATATc 
ACAAGATATc 
ACAAGATATc 



********** *********_ 
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msa45163.2{240_18RS2l) 
msa45163 . 2{240_2603 } 
msa45163.2{240_A909} 
msa45163 . 2 {240_H36B} 
msa45163 . 2 { 240_JM9130013 } 
msa4 5 1 6 3 . 2 { 2 4 0_COH1 } 
msa4 5163 .2(24 0_M73 2 ) 
msa4 5 1 63 . 2 { 2 4 0_M78 1 } 
msa45163 .2{240__090} 
msa45163 . 2{240__CJB110} 
msa45163 .2{240_1169NT} 
Consensus 



301 

CCCAATTTTA 
CCCAATTTTA 
CCCAATTTTA 
CCCAATTTTA 
CCCAATTTTA 
CCCAATTTTA 
CCCAATTTTA 
CCCAATTTTA 
CCCAATTTTA 
CCCAATTTTA 
CCCAATTTTA 
********** 



TGTATACACA 
TGTATACACA 
TGTATACACA 
TGTATACACA 
TGTATACACA 
TGTATACACA 
TGTATACACA 
TGTATACACA 
TGTATACACA 
TGTATACACA 
TGTATACACA 
********** 



TAAAGGAGCA 
TAAAGGAGCA 
TAAAGGAGCA 
TAAAGGAGCA 
TAAAGGAGCA 
TAAAGGAGCA 
TAAAGGAGCA 
TAAAGGAGCA 
TAAAGGAGCA 
TAAAGGAGCA 
TAAAGGAGCA 
********** 



AGTACGCATT 
AGTACGCATT 
AGTACGCATT 
AGTACGCATT 
AGTACGCATT 
AGTACGCATT 
AGTACGCATT 
AGTACGCATT 
AGTACGCATT 
AGTACGCATT 
AGTACGCATT 
********** 



350 

CAGTGTTGGA 
CAGTGTTGGA 
CAGTGTTGGA 
CAGTGTTGGA 
CAGTGTTGGA 
CAGTGTTGGA 
CAGTGTTGGA 
CAGTGTTGGA 
CAGTGTTGGA 
CAGTGTTGGA 
CAGTGTTGGA 
********** 



351 

msa45163.2{240_18RS2l} AACCTTGCAG 

msa45163 .2{240_2603} AACCTTGCAG 

ms a4 5 1 6 3 . 2 { 2 4 0_A9 0 9 } AACCTTGCAG 

msa45163.2{240_H36Bj AACCTTGCAG 

msa45163 .2{240_JM913 0013) AACCTTGCAG 

msa4 5163 .2(24 0_COHl} AACCTTGCAG 

msa45163.2{240_M732} AACCTTGCAG 

msa45163 .2{240_M781) AACCTTGCAG 

msa45163. 2 { 240_090) AACCTTGCAG 

msa45163 . 2{240__CJB110) AACCTTGCAG 

msa45163 .2{240_1169NT} AACCTTGCAG 

Consensus ********** 



ATCTCTCATT 
AT CTCTCATT 
ATCTCTCATT 
ATCTCTCATT 
ATCTCTCATT 
ATCTCTCATT 
ATCTCTCATT 
ATCTCTCATT 
ATCTCTCATT 
ATCTCTCATT 
ATCTCTCATT 



ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 



********** ********** 



AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
********** 



400 

GGTGTTTCgG 
GGTGTTTCgG 
GGTGTTTCgG 
GGTGTTTCgG 
GGTGTTTCgG 
GGTGTTTCgG 
GGTGTTTCgG 
GGTGTTTCgG 
GGTGTTTCtG 
GGTGTTTCtG 
GGTGTTTCgG 
********_* 



401 450 

msa45163.2{240_18RS21) GATTCGAGCG AAAACCACAT CCACAAGGGA TTAATTATTT AGTTAAACGA 

msa45163.2f 240_2603) GATTCGAGCG AAAACCACAT CCACAAGGGA TTAATTATTT AGTTAAACGA 

msa45163.2{240_A909} GATTCGAGCG AAAACCACAT CCACAAGGGA TTAATTATTT AGTTAAACGA 

msa45163 .2{240_H36B} GATTCGAGCG AAAACCACAT CCACAAGGGA TTAATTATTT AGTTAAACGA 

msa45163.2{240__JM9130013} GATTCGAGCG AAAACCACAT CCACAAGGGA TTAATTATTT AGTTAAACGA 

msa45163.2{240_COHl} GATTCGAGCG AAAACCACAT CCACAAGGGA TTAATTATTT AGTTAAACGA 

msa45163-2{240_M732} GATTCGAGCG AAAACCACAT CCACAAGGGA TTAATTATTT AGTTAAACGA 

msa451 63. 2 { 24 0_M78l} GATTCGAGCG AAAACCACAT CCACAAGGGA TTAATTATTT AGTTAAACGA 

msa45163. 2 {240^090} GATTCGAGCG AAAACCACAT CCACAAGGGA TTAATTATTT AGTTAAACGA 

msa45163 .2{240_CJB110} GATTCGAGCG AAAACCACAT CCACAAGGGA TTAATTATTT AGTTAAACGA 

msa45163.2{240_1169NT} GATTCGAGCG AAAACCACAT CCACAAGGGA TTAATTATTT AGTTAAACGA 

Consensus ********** ********** ********** ********** ********** 



msa45163.2{ 
msa45163 . 
msa45163 - 
msa45163 . 
msa45163.2{240 
msa45163 . 
msa45163 . 
msa45163l 
msa45163 
msa45163.2{ 
msa45163 .2{ 



240_18RS21} 
2 {240^2603} 
2{240_A909"} 
2{240_H36B} 
_jJM9130013} 
2{240_COH1} 
2{240_M732} 
2{240_M781} 
2{240_090} 
240_CJB110} 
240_JL169NT} 
Consensus 



451 

TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
********** 



ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
********** 



GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
********** 



ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 



500 

GTCCaCTAGA 
GTCCaCTAGA 
GTCCaCTAGA 
GTCCaCTAGA 
GTCCaCTAGA 
GTCCaCTAGA 
GTCCaCTAGA 
GTCCaCTAGA 
GTCCcCTAGA 
GTCCcCTAGA 
GTCCcCTAGA 



********** ****_***** 



501 

msa45163.2{240_18RS2l} TTTGGAGGTT GCTCAAAATG CTGGTATAAA ATCCATAAAC 

msa45 163. 2 {240^2603} TTTGGAGGTT GCTCAAAATG CTGGTATAAA ATCCATAAAC 

tnsa45163.2{240_A909} TTTGGAGGTT GCTCAAAATG CTGGTATAAA ATCCATAAAC 

msa45163.2{240_H36B} TTTGGAGGTT GCTCAAAATG CTGGTATAAA ATCCATAAAC 

msa45163.2{240_JM9130013 J TTTGGAGGTT GCTCAAAATG CTGGTATAAA ATCCATAAAC 

msa45163 .2{240_COH1} TTTGGAGGTT GCTCAAAATG CTGGTATAAA ATCCATAAAC 

msa45163.2{240_M732} TTTGGAGGTT GCTCAAAATG CTGGTATAAA ATCCATAAAC 

rnsa45163 .2{240_M781) TTTGGAGGTT GCTCAAAATG CTGGTATAAA ATCCATAAAC 

msa4 5163.2(24 0_0 9 0 } TTTGGAGGTT GCTCAAAATG CTGGTATAAA ATCCATAAAC 

msa45163.2{240_CJB110} TTTGGAGGTT GCTCAAAATG CTGGTATAAA ATCCATAAAC 

msa45163.2{240_1169NT} TTTGGAGGTT GCTCAAAATG CTGGTATAAA ATCCATAAAC 

Consensus ********** ********** ********** ********** 



550 

TTAAGGTTAG 
TTAAGGTTAG 
TTAAGGTTAG 
TTAAGGTTAG 
TTAAGGTTAG 
TTAAGGTTAG 
TTAAGGTTAG 
TTAAGGTTAG 
TTAAGGTTAG 
TTAAGGTTAG 
TTAAGGTTAG 
********** 



msa45163.2{ 

msa45163 . 

msa45163 . 

msa45163 . 
msa45163.2{240, 

msa45163 .' 

msa45163 . 

msa45163 . 
msa45163 
msa45163.2{ 
msa45163.2{ 



240_18RS21} 
2 f 240_2603\ 
2{240__A909) 
2{240_H36B} 
JM9130013} 
2{240_COHl} 
2(240_M732} 
2{240_M781} 
.2{240_090} 
240_CJB110) 
240_1169NT) 



551 

AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 



AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 



AATATTTCAA 
AATATTTCAA 
AATATTTCAA 
AATATTTCAA 
AATATTTCAA 
AATATTTCAA 
AATATTTCAA 
AATATTTCAA 
AATATTTCAA 
AATATTTCAA 
AATATTTCAA 



GTCTCAAaGA 
GTCTCAAaGA 
GTCTCAAaGA 
GTCTCAAaGA 
GTCTCAAaGA 
GTCTCAAaGA 
GTCTCAAaGA 
GTCTCAAaGA 
GTCTCAAgGA 
GTCTCAAgGA 
GTCTCAAgGA 



600 

TATAATATCA 
TATAATATCA 
TATAATATCA 
TATAATATCA 
TATAATATCA 
TATAATATCA 
TATAATATCA 
TATAATATCA 
TATAATATCA 
TATAATATCA 
TATAATATCA 
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Consensus ********** ********** ********** *******_** ********** 

601 621 

msa45163 . 2 {240_18RS2l} CTTGATTTCA CTCGTttgga t 

msa45163 .2f 240_2603} CTTGATTTCA CTCGTttgga t 

msa45 163 .2(24 0_A909) CTTGATTTCA CTCGT 

msa45163.2{240__H36B} CTTGATTTCA CTCGTttgga t 

msa45163.2{240_JM9130013} CTTGATTTCA CTCGT 

msa45163.2{240_COHl} CTTGATTTCA CTCGTttgga t 

ms'a45163.2{240_M732} CTTGATTTCA CTCGTttgga t 

msa45163 -2{240_M78l} CTTGATTTCA CTCGT - 

msa45163.2{240_090} CTTGATTTCA CTCGT 

msa45163.2{240 CJB110} CTTGATTTCA CTCGT t- 



msa45163-2{240_1169NT) CTTGATTTCA CTCGTttgga t 
Consensus ********** *★***_- 

SEQ ZD NO. 5612 
STRAIN 2603 frame: 1 

KKLTP I WDLDGTL I D S YVP I MEALEET YRHFGL I FDKEli I HE YI LQE SVGKLLVNLSEEE 
QI PHEKLKAYFTKEQESRDSKI HLMPYAKE I LEWTKEQD I PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYL VKRYS LD KS MTYY I GDRPLDLiEVAQNAG IKS IN 
LRLENSKENYNISSLKDI I SLDFTRLD 

SEQ ID NO. 5613 

STRAIN A909 frame: 1 

KKLTF I WDLDGTL IDS YVP IMEALEETYRHFGLI FDKELI HE Y I LQE SVGKLLVNLSEEE 
QI PHEKLKAYFTKEQESRDSKI HLMPYAKE I LEWTKEQD I PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYLVKRYSLDKSMTYY I GDRPLDLEVAQNAG IKS IN 
LRLENSKENYNI SSLKD I ISLDFTR 

SEQ ID NO. 5614 

STRAIN H36B frame: 1 

KKLTF I WDLDGTLIDS YVP IMEALEETYRHFGLI FDKELIHEYI LQESVGKLLVNLSEEE 
QI PHEKLKAYFTKEQESRDSKI HLMPYAKE I LEWTKEQD I PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYLVKRYSLDKSMTYY I GDRPLDLEVAQNAG IKS IN 
LRLENSKENYNI SSLKD I I SLD FT RLD 

SEQ ID NO. 5615 

STRAIN 18RS21 frame: 1 

KKLTFIWDLDGTLI DS YVP IMEALEETYRHFGLI FDKELIHEYI LQESVGKLLVNLSEEE 
QI PHEKLKAYFTKEQESRDSKI HLMPYAKE I LEWTKEQD I PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYL VKRYS DDKS MTYY I GDRPLDLEVAQNAG IKSIN 
LRLENSKENYNI SSLKD I I SLDFTRLD 

SEQ ID NO. 5616 

STRAIN M732 frame: 1 

KKLTFI WDLDGTLIDS YVP IMEALEETYRHFGLI FDKELI HEY I LQESVGQLLVNLSEEE 
Q I PHEKLKAYFTKEQESRDSKI HLMPYAKE I LEWTKEQD I PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYLVKRYSLDKSMTYY I GDRPLDLEVAQNAG IKS IN 
LRLENSKENYNI SSLKD 1 1 SLDFTRLD 

SEQ ID NO. 5617 

STRAIN COH1 frame: 1 

KKLTF I WDLDGTL I DSYVP IMEALEETYRHFGLI FDKELI HEY I LQESVGQLLVNLSEEE 
QI PHEKLKAYFTKEQESRDSKI HLMPYAKE I LEWTKEQD I PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYL VKRYS LDKSMTYY I GDRPLDLEVAQNAG IKSIN 
LRLENSKENYNI SSLKDI I SLDFTRLD 

SEQ ID NO. 5618 

STRAIN CJB110 frame: 1 

KKLTF I WDLDGTL I DSYVP I MEALEET YRHFGL I FDKEL I HEY I LQESVGQLLVNLSEEE 
QI PHEKLKAYFTKEQESRDSKI HLMPYAKE I LEWTKEQD I PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYLVKRYS LDKSMTYY I GDRPLDLEVAQNAG I KS I N 
LRLENSKENYNI SSLKD I ISLDFTR 

SEQ ID NO. 5619 

STRAIN 1169NT frame: 1 

KKLTF I WDLDGTL I DSYVP 1 1 EALEET YRHFGL I FDKEL I HE YI LQE SVGKLLVNLSEEE 
QIPHEKLKAYFTKEQESRDSKIHLMPYAKE ILEWTKEQDI PNFMYTHKGASTHSVLETLQ 
I SHYFDEILTGVSGFERKPHPQGINYLVKRYSLDKSMTYY I GDRPLDLEVAQNAG I KS IN 
LRLENSKENYNI SSLKDI I SLDFTRLD 

SEQ ID NO. 5620 

STRAIN JM9130013 frame: 1 

KKLTFIWDLDGTLIDSYVPIMEALEETYRHFGLI FDKELIHEYI LQESVGKLLVNLSEEE 
QI PHEKLKAYFTKEQESRDSKI HLMPYAKE ILEWTKEQDI PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYLVKRYS LDKSMTYY I GDRPLDLEVAQNAGI KS IN 
LRLENSKENYNI SSLKD I ISLDFTR 

SEQ ID NO. 5621 

STRAIN 090 frame: 1 

KKLTF I WDLDGTLI DSYVP IMEALEETYRHFGLI FDKEL I HE YI LQESVGQLLVNLSEEE 
QI PHEKLKAYFTKEQESRDSKI HLMPYAKE I LEWTKEQD I PNFMYTHKGASTHSVLETLQ 
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I SHYFDE ILTGVSGFERKPHPQG I NYLVKRYSLDKSMTYY IGDRPLDLEVAQNAG IKS IN 
LRLENSKENYNISSLKDI I SLDFTR 

SEQ ID NO. 5622 
STRAIN M781 frame: 1 

KKLTFI WDLDGTLI DS YVP I MEALEETYRHFGLI FDKEL IHEYI LQES VGQLLVNLSEEE 
QI PHEKLKAYFTKEQESRDXKI HIiMPYAKE ILEWTKEQD I PNFMYTHKGASTHS VLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYL VKRYSLDKSMT Y Y I GDRPLDLE VAQNAG I KS IN 
LRLENSKENYNISSLKDI I SLDFTR 

PRETTY of: /biottnp/msa45645 . 2 { * } January 21, 2003 06:57 



msa45645.2{ 

msa45645 . 
msa45645.2{240 

msa45645. 

msa45645. 
msa45645 
msa45645.2{ 

msa45645 . 

msa45645 . 

msa4S645. 
msa45645.2{ 



240__18RS2l} 
2{240_A909} 
_JM9130013} 
2{240_2603} 
2{240_H36B} 
2{240_090} 
240_CJB110} 
2{240_M781} 
2{240_COH1} 
2{240_M732} 
240__1169NT} 
Consensus 



KKLTF I WDLD 
KKLTFIWDLD 
KKLTF I WDLD 
KKLTF I WDLD 
KKLTF I WDLD 
KKLTF I WDLD 
KKLTF I WDLD 
KKLTF I WDLD 
KKLTF I WDLD 
KKLTF I WDLD 
KKLTF I WDLD 
********** 



GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
********** 



mEALEETYRH 
mEALEETYRH 
mEALEETYRH 
mEALEETYRH 
mEALEETYRH 
mEALEETYRH 
mEALEETYRH 
mEALEETYRH 
mEALEETYRH 
mEALEETYRH 
i EALEETYRH 



FGLIFDKELI 
FGLIFDKELI 
FGLIFDKELI 
FGLIFDKELI 
FGLIFDKELI 
FGLIFDKELI 
FGLIFDKELI 
FGLIFDKELI 
FGLIFDKELI 
FGLIFDKELI 
FGLIFDKELI 



50 

HEYILQESVG 
HEYILQESVG 
HEYILQESVG 
HEYILQESVG 
HEYILQESVG 
HEYILQESVG 
HEYILQESVG 
HEYILQESVG 
HEYILQESVG 
HEYILQESVG 
HEYILQESVG 



_********* ********** ********** 



msa45645.2{ 
msa45645 . 
msa45645.2{240 t 
msa45645 
msa45645. 
msa45645 

msa45645.2{ 
msa45645 
msa45645 
msa45645. 

msa45645 .2{ 



240_18RS2l} 
2{240_A909} 
_JM9130013} 
2{240_2603) 
2{240JH36B} 
.2{240_090} 
240_CJB110i 
2{240_M781> 
2{240_COH1> 
2{240_M732} 
240_1169NT} 
Consensus 



51 1 

kLLVNLSEEE 
kLLVNLSEEE 
kLLVNLSEEE 
kLLVNLSEEE 
kLLVNLSEEE 
qLLVNLSEEE 
qLLVNLSEEE 
qLLVNLSEEE 
qLLVNLSEEE 
qLLVNLSEEE 
kLLVNLSEEE. 
_********* 



QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
********** *********_ 



FTKEQESRDs 
FTKEQESRDs 
FTKEQESRDs 
FTKEQESRDs 
FTKEQESRDs 
FTKEQESRDs 
FTKEQESRDs 
FTKEQESRDX 
FTKEQESRDs 



KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
********** 



100 

ILEWTKEQD I 
ILEWTKEQD I 
ILEWTKEQD I 
ILEWTKEQD I 
ILEWTKEQD I 
ILEWTKEQD I 
ILEWTKEQD I 
ILEWTKEQD I 
ILEWTKEQD I 
ILEWTKEQD I 
ILEWTKEQD I 
********** 



101 

msa45645.2{240_18RS2l} PNFMYTHKGA 

msa45645.2{240_A909} PNFMYTHKGA 

msa45645.2{240_JM9130013} PNFMYTHKGA 

msa45645.2{240_2603} PNFMYTHKGA 

msa45645.2{240_H36B} PNFMYTHKGA 

msa45645 . 2 {240_090 } PNFMYTHKGA 

msa45645 . 2 { 240_CJB110 } PNFMYTHKGA 

msa45645.2{240_M78l} PNFMYTHKGA 

ms a4 5 64 5 . 2 { 2 4 0_COH1 } PNFMYTHKGA 

msa45645 - 2 { 240_M732 } PNFMYTHKGA 

msa45645.2{240_1169NT} PNFMYTHKGA 

Consensus ********** 



STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 



ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 



********** ********** 



GVSGFERKPH 
GVSGFERKPH 
GVSGFERKPH 
GVSGFERKPH 
GVSGFERKPH 
GVSGFERKPH 
GVSGFERKPH 
GVSGFERKPH 
GVSGFERKPH 
GVSGFERKPH 
GVSGFERKPH 
********** 



150 

PQGINYLVKR 
PQGINYLVKR 
PQGINYLVKR 
PQGINYLVKR 
PQGINYLVKR 
PQGINYLVKR 
PQGINYLVKR 
PQGINYLVKR 
PQGINYLVKR 
PQGINYLVKR 
PQGINYLVKR 
********** 



151 200 

msa45645.2{240_18RS2l} YSLDKSMTYY I GDRPLDLE V AQNAGI KS IN LRLENSKENY NISSLKDIIS 

msa45645.2{240_A909} YSLDKSMTYY I GDRPLDLE V AQNAGI KS IN LRLENSKENY NISSLKDIIS 

msa45645.2{240_JM9130013} YSLDKSMTYY IGDRPLDLEV AQNAGIKSIN LRLENSKENY NISSLKDIIS 

msa45645.2{240_2603} YSLDKSMTYY IGDRPLDLEV AQNAGIKSIN LRLENSKENY NISSLKDIIS 

msa45645.2{240_H36B} YSLDKSMTYY IGDRPLDLEV AQNAGIKSIN LRLENSKENY NISSLKDIIS 

msa45645 . 2 { 240_090 } YSLDKSMTYY IGDRPLDLEV AQNAGIKSIN LRLENSKENY NISSLKDIIS 

msa45645.2{240_CJB110) YSLDKSMTYY IGDRPLDLEV AQNAGIKSIN LRLENSKENY NISSLKDIIS 

msa45645.2{240_M78l} YSLDKSMTYY IGDRPLDLEV AQNAGIKSIN LRLENSKENY NISSLKDIIS 

msa45645.2{240_COHl} YSLDKSMTYY IGDRPLDLEV AQNAGIKSIN LRLENSKENY NISSLKDIIS 

msa45645.2{240_M732) YSLDKSMTYY IGDRPLDLEV AQNAGIKSIN LRLENSKENY NISSLKDIIS 

msa45645.2{240_1169NT) YSLDKSMTYY IGDRPLDLEV AQNAGIKSIN LRLENSKENY NISSLKDIIS 

Consensus ********** ********** ********** ********** ********** 



. 201 

msa45645.2{240_18RS2l} LDFTRld 

msa45645.2{240_A909) LDFTR — 

msa45645.2{240__JM9130013) LDFTR — 

msa45645 .2 (240__2603 } LDFTRld 

msa45645.2{240_H36B} LDFTRld 

msa45645. 2 {240^090} LDFTR — 

msa45645.2{240_CJB110} LDFTR— 

msa4 5645 .2(24 0_M78l) LDFTR — 

moa45645 .2(240_COH1} LDFTRld 

msa45645.2{240_M732} LDFTRld 

msa45645.2{240_1169NT} LDFTRld 

Consensus *****•-- 
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SEQ ID NO: 5701 
STRAIN 2603 

ATGCTTATGACAAAAATAATAGGACTGACAGGAGGGATAGCTTCT 

GGAAAGTCAACGGTAACAAAAATAATACGAGAATCAGGTTTTAAAGTCATAGATGCGGAT 
CAAGTGGTT CAT AAATTG CAAG CT AAGGGTGGGAAACTTTAC CAAG CTTTATTAG AATGG 
TTGGGTCCCGAGATACTTGATGCTGATGGTGAGTTGGATAGACCAAAGCTTTCTCAAATG 
ATTTTTGCTAATCCAGACAATATGAAGACATCAGCTAGGCTACAAAATAGTATCATTCGT 
CAAGAGTTAGCATGTCAGCGCGACCAATTAAAACAAACAGAAGAGATATTTTTCATGGAT 
ATTC CTTT ATTGATTG AAGAAAAGT ATATAAAATGGTTTG AT GAGATTTG GTTGGTATTT 
GTTGATAAAGAAAAACAATTAGAACGATTAATGGCCCGTAACAACTACAGTCGAGAAGAA 
G GAGAATTACGACTTTCACAC CAAATG CCTTTAACAG AT AAAAAAAGTTT CG CTAGT CTT 
ATTATTGACMTAATGGTGATTTAATAACTTTAAAAGAGCAAATATTGGATGCTCTTC^ 
CGTTTA 

SEQ ID NO: 5702 
STRAIN 090 

AAGTCAACGGTAACAAAAATAATACGAGAATCAG 

GTTTTAAAGTCATAGATGCGGATCAAGTGGTTCATAAATTGCAAGCTAAG 
GGTGGGAAACTTTACCAAGCTTTATTAGAATGGTTGGGTCCCGAGATACT 
TGATG CTGATGGTG AGTTGG AT AG ACCAAAG CTTT CT CAAATGATTTTTG 
CT AAT CCAGACAATATGAAGACAT CAG CTAGG CTACAAAATAGTATCATT 
CGTCAAGAGTTAGCATGTCAGCGCGACCAATTAAAACAAACAGAAGAGAT 
ATTTTTCGTGGATATTCCTTTATTGATTGAAGAAAAGTATATAAAATGGT 
TTGATGAGATTTGGTTGGTATTTX3TTGATAAAG 

TTAATGGCCCGTAACAACTACAGTCGAGAAGAAGCAGAATTACGACITTC 
ACAC CAAA.TG CCTTTAACAGAT AAAAAAAGTTTCG CTAGT CTTATTATTA 
AT AATAATGGTG ATTTAATAACTTTAAAAG AGCAAAT ATTGGAT G CT CTT 
CAACGTTTA 

SEQ ID NO: 5703 
STRAIN A909 

AAGTCAACGGTAACAAAAATAATACGAGAATCAG 

GTTTTAAAGT CATAGATGCGGATCAAGTGGTT CATAAATTG CAAG CT AAG 
GGTGGGAAACTTTACCAAGCTTTATTAGAATG GTTGGGTC CCGAGATACT 
TGATG CTGATGGTGAGTTGGATAGACCAAAG CTTT CT CAAATGATTTTTG 
CT AATCCAGACAATATGAAGACATCAG CTAGG CTACAAAATAGTATCATT 
CGTCAAGAGTTAG CATGTCAG CGCGACCAATT AAAACAAACAGAAGAGAT 
ATTTTTCATGGATATTCCTTTATTGATTGAAGAAAAGTATATAAAATGGT 
TTGATGAGATTTGGTTGGTATTTGTTGATAAAGAAAAACAATTACAACGA 
TTAATGG C CCGTaACAACTACAGTCGAGAAGAAG CAGAATTACGACTTT C 
ACAC CAAATG CCTTTTAACAG ATAAAAAAAGTTTCG CTAGTCT 
AGAATAATGGTGATTTAAT AACTTT AAAAGAG CAAATATTGG ATGCT CTT 
CAACGTTTA 

SEQ ID NO: 5704 
STRAIN H36B 

AAGT CAACGGTAACAAAAAT AATACGAGAATCAGG 

TTTT AAAGT CATAGATGCGGAT CAAGTGGTTCAT AAATTG CAAG CTAAGG 
GTGGGAAACTTTAC CAAG CTTT ATTAGAATGGTTGGGTC C CGAGATACTT 
GATG CTGATGGTGAGTTGGATAGAC CAAAG CITT CTCAAATGATTTTTG C 
TAATCCAGACAATATGAAGACATCAGCTAGGCTACAAAATAGTATCATTC 
GTCAAGAGT TAG CATGTCAG CG CGACCAATTAAAACAAACAGAAGAGAT A 
TTTTTCATGGATATTCCTTTATTGATTGAAGAAAAGTATATAAA^ 
TGATGAGATTTGGTTGGTATTTGTTGATAAAGAAAAAC^TTACAACGAT 
TAATGGC CCG t^AACAACTACAGT CGAGAAGAAGCGGAATTACGACTTT CA 
CACCAAATAC CTTT AACAG ATAAAAAAAGTTT CG CTAGT CTTATTATTGA 
TAAT AATGGTGATTTAATAACTTT AAAAGAGCAAATGTTGGATG CTCT^ 
AACGTTTA 

SEQ ID NO: 5705 
STRAIN 18RS21 

AAGTCAACGGTAACAAAAATAATACGAGAATCAGG 

TTTTAAAGT CATAG ATG CGGAT CAAGTGGTTCAT AAATTG CAAG CTAAGG 
GTGGGAAACTTTACCAAGCTTTATTAGAA 

GATG CTGATGGTGAGTTGGATAGAC CAAAG CTTT CT CAAATGATTTTTGC 
TAATC CAGACAATATGAAGACAT CAGCTAGGCTACAAAATAGTAT CATTC 
GT CAAGAGTT AG CATGT CAG CG CGACCAATTAAAACAAACAG AAGAGATA 
TTTTTCATGGATATTCCTTTATTGATTG 

TGATGAGATTTGGTTGGTATTTGTTGATAAAGAAAAACAATTACAAC^T 
TAATGGC CCGTAACAACTACAGT CGAGAAGAAG CAGAATTACGACTTT CA 
CACCAAATGCCTTTAACAGATAAAAAAAGTTTCGCTAGTCTTA^ 
CAAT AATGGTGATTTAATAACTTT AAAAGAG CAAATATTGGATG CTCTT C 
AACGTTTA 

SEQ ID NO: 5706 
STRAIN M732 

AAGT CAACGGTAACAAAAATAATACGAGAAT CAGGTT 

ttaaagtcatagatgcggatcaagtggttcataaattgcmgctaagggt 

GGGAAACTTTACCAAG CTTT ATT AGAATGGTTGGGT C CCGAGATACT TGA 

tgctgatggtgagttggatagaccaaagctttctcaaatgatttttgcta 

AT C CAGACAATATG AAG ACATCAG CTAGG CTACAAAATAGTATCATT CGT 
CAAGAGTTAGCATGTCAGCGCGACCAATTAAAACAAACAGAAGAGATATT 
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TTTCATGGATATTCCTTTATTGATTGAAGAAAAGTATATAAAATGGTTTG 
ATGAGATTTGGTTGGTATTTGTTGATAAAGAAAAACAATTACAACGATTA 
ATGGCCCGTAACAACTACAGTCGAGAAGAAGCAGAATTACGACTTTCACA 
CCAAATGCCTTTAACAGATAAAAAAAGTTTCGCTAGTCTTATTATTGACA 
ATAATGGTG ATTT AATAACTTT AAAAG AG CAAAT ATTGGATG CT CTT CAA 
CGTTTA 

SEQ ID NO: 5707 
STRAIN COH1 

AAGTCAACGGTAACAAAAATAATACGAGAATCAGGT 

TTTAAAGTCATAGATGCGGATCAAGTGGTTCATAAATTGCAAGCTAAGGG 
TGGGAAACTTTACCAAGCTTTATTAGAATGGTTGGGTCCCGAGATACTTG 
ATGCTGATGGTGAGTTGGATAGACCAAAGCTTTCTCAAATGATTTTTGCT 
AAT C CAG ACAAT ATG AAGACAT CAG CTAGG CTACAAAATAGT ATCATT CG 
TCAAGAGTTAGCATGTCAGCGCGACCAATTAAAACAAACAGAAGAGATAT 
TTTT CATGGATATT C CTTT ATT GATTGAAGAAAAGTATATAAAATGGTTT 
GATGAGATTTGGTTGGTATTTGTTGATAAAGAAAAACAATTAC1AACGATT 
AATGG CC CGTaACAACTACAGT CGAGAAG AAG CAGAATTACG ACTTT CAC 
ACCAAATGCCTTTAACAGAT AAAAAAAGTTT CG CTAGTCTTATTATTGAC 
AAT AATGGTGATTT AAT AACTTTAAAAGAG CAAATATTGGATGCT CTT CA 
ACGTTTA 

SEQ ID NO: 5708 
STRAIN M781 

AAGTCAACGGTAACAAAAATAATACGAGAATCAGG 

'riT^AAAGTC^TAGATGCGGATCAAGTGGTTCATAAATTGCAAGCTAAGG 
GTGGGAAACTTTACCAAGCTTTATTAGAATGGTTGGGTCCCX3AGATACTT 
GATG CTGATGGTGAGTTGGATAGAC CAAAG CTTT CTCAAATGATTTTTG C 
TAAT CCAGACAATATGAAGACATCAG CTAGG CTACAAAATAGTATCATTC 
GT CAAGAGTTAG CATGT CAGCG CG ACCAATTAAAACAAACAGAAG AGATA 
TTTTT CATGGATATT CCTTTATTGATTGAAGAAAAGTATATAAAATGGTT 
TGATGAGATTTGGTTGGTATTTGTTGATAAAGAAAAACAATTACAACGAT 
TAATGGCCCGTAACAACTACAGTCGAGAAGAAGCAGAATTACGACTTTCA 
CAC CAAATG C CTTT AACAGATAAAAAAAGTTT CG CTAGTCTTATTATTGA 
CAATAATGGTGATTTAATAACTTTAAAAG AG CAAATATTGGATG CT CTT C 
AACGTTTA 

SEQ ID NO: 5709 
STRAIN OB 110 

AAGTCAACGGTAACAAAAATAATACGAGAA 

TCAGGTTTTAAAGTCATAGATGCGGATCAAGTGGTTCATAAATTGCAAGC 
TAAGGGTGGG AAACTTTAC CAAG CTTT ATTAGAATGGTTGGGTC C CGAGA 
TACTTGATG CTGAT GGTGAGTTGG ATAGACCAAAG CTTT CT CAAATG ATT 
TTTGCTAATCCAGACAATATGAAGACATCAGCTAGGCTACAAAATAGTAT 
CATT CGT CAAGAGTTAG CATGT CAG CG CG AC CAATTAAAAQ\AACAGAAG 
AGATATTTTTCGTGGATATTCCITTATTGATTGAAGAAAAGTATATAAAA 
TGGTTTGATGAGATTTGGTTGGTATTTGTTGATAAAGAAAAACAATTAC^ 
ACGATTAATGGCCCGTaACAACTACAGTCGAGAAGAAGCAGAATTACGAC 
TTT CACACCAAATG CCTTTAACAGATAAAAAAAGTTTCGCTAGT CTTATT 
ATTAATAATAATGGTGATTTAATAACTTTAAAA.GAGCAAATATTGGATGC 
TCTTCAACGTTTA 

SEQ ID NO: 5710 
STRAIN 1169NT 

AAGTCAACGGTAACAAAAATAATACGAGAATCAGG 

TTTTAAAGT CATAGATG CGGAT CAAGTGGTTCATAAATTG CAAGCTAAGG 
GTGGGAAACTTTACCAAGCITTATTAGAATGGTTGGGTCCCGAGATACTT 
GATGCTGATGGTGAGTTGGATAGACCAAAGCTTTCTCAAATGATTTTTGC 
TAAT C CAGACAATATGAAGACATCAGCTAGG CTACAAAA.T AGTATCATTC 
GT CAAGAGTTAGCATGT CAG CG CGACCAATTAAAACAAACAGAAG AGATA 
TTTTT CATGGATATTCCTTT ATTG ATTGAAGAAAAGTATATAAAATG GTT 
TGATGAGATTTGGTTGGTATTTGTTGATAAAGAAAAACAATTACAACGAT 
TAATGGCC CGTAACAACTACAGTCGAGAAGAAGCAGAATTACGACTTTCA 
CACCAAATAC CTTTAACAGATAAAAAAAGTTT CG CTAGT CTTATTATTG A 
TAATAATGGTGATTTAATAACTTTAAAAGAGCAAATGTTGGATGCTCTTC 
AACGTTTA 

SEQ ID NO: 5711 
STRAIN JM9130013 

AAGTCAACGGTAACAAAAATAATACGAGAATCAGGT 

TTTAAAGTCATAGATGCGGATCAAGTGGTTCATAAATTGCAAGCTAAGGG 
TGGGAAACTTTACCAAGCTTTATTAGAATGGTTGGGTCCCGAGATACTTG 
ATGCTGATGGTG AGTTGGAT AG ACCAAAG CTTT CT CAAATGATTTTTG CT 
AAT CCAG ACAAT ATGAAGACAT CAG CTAGG CTACAAAATAGTATCATTCG 
TCAAG AGTTAGCATGT CIAG CG CGAC CAATT AAAACAAACAGAAGAGATAT 
TTTT CATGGATATT C (CTTTATTGATTG AAGAAAAGTATAT AAAATGGTTT 
GATGAGATTTGGTTGGTATOTGTTGATAAAGAAAAACAATTACAACGATT 
AATGG C C CGTAACAACTACAGT CGAGAAG AAG CGGAATTACGACTTT CAC 
AC CAAATAC CTTTAACAGATAAAAAAAGTTTCG CT 

AAT AATGGTGATTTAATAACTTTAAAAGAG CAAATGTTGGATG CT CTT CA 
ACGTTTA 
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PRETTY of: /biotmp/msa221059 . 2 { * } February 10, 2003 07:07 



msa221059 .2 { 245_H36B 
msa221059.2{245_JM9130013, 

msa221059 .2{245_1169NT} AA 



50 
AA 
AA 



msa221059.2{245_090 
msa221059.2{245_CJB110 
msa221059 . 2 { 245_18RS21 
maa221059.2{245_2603 
msa221059.2{245 > _A909 
msa221059 . 2 { 245_COHl 
msa221059.2(245_M732 
rasa221059.2{245_M781 
Consensus 



msa221059.2{245__H36B] 
msa221059 . 2 { 245_JM9130013 ] 
msa221059.2{245_1169NT 
msa221059.2{245_090; 
msa221059 . 2 { 245J2JB110 ; 
msa221059.2{24S_18RS2l} 
msa221059.2{245_2603} 
msa221059.2{245_A909) 
msa221059.2{24S_COHl} 
msa2210S9.2f245JM732} 
msa221059.2{245_M78l} 
Consensus 



msa221059.2{245_H36B} 
msa221059.2{245_JM9130013} 
msa221059.2{245_1169NT" 
msa221059.2{245_090 
msa221059 . 2 f 245_CJB110 
msa221059 . 2 {245_JL8RS21 

msa221059.2{245 2603 

msa221059.2 

msa221059.2 

msa221059.2 

msa221059.2 



■AA 

AA 

AA 

atgcttatga caaaaataat aggactgaca ggagggatag cttctggaAA 

— ■ — AA 

AA 

~~ AA 

-~ AA 

********** ********** ********** ********** ********** 



245_A909 
245_COHl'} 
245_M732} 
245_M78l} 
Consensus 



msa221059.2{245_H36B} 
msa221059.2{245_JM9130013} 
msa221059 . 2 { 245_1169NT} 
msa221059.2{245_090} 
msa221059.2(245_CJB110} 
msa221059.2{245_18RS2l) 
msa221059. 2 {245^2603} 
msa221059.2{245_A909} 
msa221059 . 2 { 245_COHl} 
msa221059.2(245_M732} 
msa221059.2{245_M78l} 
Consensus 



msa221059 .2{245_H36B} 
msa221059 . 2 {245_JM9130013 } 
msa221059 . 2 {245_1169NTl 
msa221059 . 2 {245_090 , 
msa221059.2{245_CJB110 
msa221059 . 2 {245_18RS2i; 
msa221059.2f 245_2603 
msa221059.2{245_A909. 
msa221059 . 2 ( 245_COHl } 
rnsa221059.2{245_M732} 
msa221059.2{245_M78l} 
Consensus 



msa221059.2{24S_H36B} 
msa22l059.2{245_JM9130013} 
mS a221059.2{245 < _1169NT} 
msa221059.2{245_090) 
m sa221059 . 2 (245_CJB110) 
mS a221059 . 2 {245_18RS2l} 
msa221059.2f24S__2603} 
msa221059 . 2 ( 245_A909 } 
msa221059.2(245_COHl} 
. msa221059.2{245_M732} 



51 

GTCAACGGTA 
GTCAACGGTA 
GTCAACGGTA 
GTCAACGGTA 
GTCAACGGTA 
GTCAACGGTA 
GTCAACGGTA 
GTCAACGGTA 
GTCAACGGTA 
GTCAACGGTA 
GTCAACGGTA 
********** 

101 

CGGATCAAGT 
CGGATCAAGT 
CGGATCAAGT 
CGGATCAAGT 
CGGATCAAGT 
CGGATCAAGT 
CGGATCAAGT 
CGGATCAAGT 
CGGATCAAGT 
CGGATCAAGT 
CGGATCAAGT 
********** 

151 

GCTTTATTAG 
GCTTTATTAG 
GCTTTATTAG 
GCTTTATTAG 
GCTTTATTAG 
GCTTTATTAG 
GCTTTATTAG 
GCTTTATTAG 
GCTTTATTAG 
GCTTTATTAG 
GCTTTATTAG 
********** 

201 

GGATAGACCA 
GGATAGACCA 
GGATAGACCA 
GGATAGACCA 
GGATAGACCA 
GGATAGACCA 
GGATAGACCA 
GGATAGACCA 
GGATAGACCA 
GGATAGACCA 
GGATAGACCA 
********** 

251 

AGACATCAGC. 
AGACATCAGC 
AGACATCAGC 
AGACATCAGC 
AGACATCAGC 
AGACATCAGC 
AGACATCAGC 
AGACATCAGC 
AGACATCAGC 
AGACATCAGC 



ACAAAAATAA 
ACAAAAATAA 
ACAAAAATAA 
ACAAAAATAA 
ACAAAAATAA 
ACAAAAATAA 
ACAAAAATAA 
ACAAAAATAA 
ACAAAAATAA 
ACAAAAATAA 
ACAAAAATAA 
********** 



TACGAGAATC 
TACGAGAATC 
TACGAGAATC 
TACGAGAATC 
TACGAGAATC 
TACGAGAATC 
TACGAGAATC 
TACGAGAATC 
TACGAGAATC 
TACGAGAATC 
TACGAGAATC 
********** 



GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 
********** 



AATGGTTGGG 
AATGGTTGGG 
AATGGTTGGG 
AATGGTTGGG 
AATGGTTGGG 
AATGGTTGGG 
AATGGTTGGG 
AATGGTTGGG 
AATGGTTGGG 
AATGGTTGGG 
AATGGTTGGG 
********** 



TTGCAAGCTA 
TTGCAAGCTA 
TTGCAAGCTA 
TTGCAAGCTA 
TTGCAAGCTA 
TTGCAAGCTA 
TTGCAAGCTA 
TTGCAAGCTA 
TTGCAAGCTA 
TTGCAAGCTA 
TTGCAAGCTA 



AGGTTTTAAA 
AGGTTTTAAA 
AGGTTTTAAA 
AGGTTTTAAA 
AGGTTTTAAA 
AGGTTTTAAA 
AGGTTTTAAA 
AGGTTTTAAA 
AGGTTTTAAA 
AGGTTTTAAA 
AGGTTTTAAA 
********** 



AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 



100 

GTCATAGATG 
GTCATAGATG 
GTCATAGATG 
GTCATAGATG 
GTCATAGATG 
GTCATAGATG 
GTCATAGATG 
GTCATAGATG 
GTCATAGATG 
GTCATAGATG 
GTCATAGATG 
********** 

150 

ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 



********** ********** ********** 



TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
********** 



AAGCTTTCTC 
AAGCTTTCTC 
AAGCTTTCTC 
AAGCTTTCTC 
AAGCTTTCTC 
AAGCTTTCTC 
AAGCTTTCTC 
AAGCTTTCTC 
AAGCTTTCTC 
AAGCTTTCTC 
AAGCTTTCTC 
********** 



TAGGCTACAA 
TAGGCTACAA 
TAGGCTACAA 
TAGGCTACAA 
TAGGCTACAA 
TAGGCTACAA 
TAGGCTACAA 
TAGGCTACAA 
TAGGCTACAA 
TAGGCTACAA 



AAATGATTTT 
AAATGATTTT 
AAATGATTTT 
AAATGATTTT 
AAATGATTTT 
AAATGATTTT 
AAATGATTTT 
AAATGATTTT 
AAATGATTTT 
AAATGATTTT 
AAATGATTTT 
********** 



CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
********** 



TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
********** 



AATAGTATCA 
AATAGTATCA 
AATAGTATCA 
AATAGTATCA 
AATAGTATCA 
AATAGTATCA 
AATAGTATCA 
AATAGTATCA 
AATAGTATCA 
AATAGTATCA 



TTCGTCAAGA 
TTCGTCAAGA 
TTCGTCAAGA 
TTCGTCAAGA 
TTCGTCAAGA 
TTCGTCAAGA 
TTCGTCAAGA 
TTCGTCAAGA 
TTCGTCAAGA 
TTCGTCAAGA 



200 

ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
********** 

250 

GACAATATGA 

GACAATATGA 

GACAATATGA 

GACAATATGA 

GACAATATGA 

GACAATATGA 

GACAATATGA 

GACAATATGA 

GACAATATGA ' 

GACAATATGA 

GACAATATGA 

********** 

300 

GTTAGCATGT 
GTTAGCATGT 
GTTAGCATGT 
GTTAGCATGT 
GTTAGCATGT 
GTTAGCATGT 
GTTAGCATGT 
GTTAGCATGT 
GTTAGCATGT 
GTTAGCATGT 
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msa221059.2{245 M78l} AGACATCAGC TAGGCTACAA AATAGTATCA TTCGTCAAGA GTTAGCATGT 
Consensus ********** ********** ********** ********** ********** 

350 

AATTAAAACA AACAGAAGAG ATATTTTTCa TGGATATTCC 
AATTAAAACA AACAGAAGAG ATATTTTTCa TGGATATTCC 
AATTAAAACA AACAGAAGAG ATATTTTTCa TGGATATTCC 
AATTAAAACA AACAGAAGAG ATATTTTTCg TGGATATTCC 
AATTAAAACA AACAGAAGAG ATATTTTTCg TGGATATTCC 
AATTAAAACA AACAGAAGAG ATATTTTTCa TGGATATTCC 
AATTAAAACA AACAGAAGAG ATATTTTTCa TGGATATTCC 
AATTAAAACA AACAGAAGAG ATATTTTTCa TGGATATTCC 
AATTAAAACA AACAGAAGAG ATATTTTTCa TGGATATTCC 
AATTAAAACA AACAGAAGAG ATATTTTTCa TGGATATTCC 
AATTAAAACA AACAGAAGAG ATATTTTTCa TGGATATTCC 
*********_ ********** 



msa221059.2{245_H36B 
msa221059 . 2 { 245_JM9130013 
msa221059.2{245_1169NT} 
msa221059. 2 {245_090} 
msa221059 . 2 { 245_CJB110 } 
msa221059.2{245_18RS2l} 
msa221059 .2{245_2603} 
msa221059.2{245_A909} 
msa2210S9 . 2 f 245_COHl } 
msa2 21059.2{245_M73 2} 
msa221059.2{245_M78l} 
Consensus 



301 

CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 



********** ********** ********** 



msa221059 
msa221059.2{245 
msa221059.2{ 
msa221059 
msa221059 
msa221059 

rasa221059 . 

msa221059. 

msa221059. 

msa221059 . 

msa221059 . 



2{245_H36B} 
_JM9130013} 
245_1169NT} 
.2{245_090} 
.2{245_CJB110} 
.2{245_18RS21> 
2 { 245_2603) 
2{245_A909j 
2{245_COHl) 
2{245_M732} 
2{245_M78l} 
Consensus 



msa221059.2{245_H36B} 
msa221059.2{245_JM9130013} 
msa221059 . 2 { 245_1169NT} 
msa221059.2{245_090} 
msa221059 . 2 {245_CJB110 } 
msa221059 - 2 { 245_18RS21 } 
msa221059.2{245_2603j 
msa221059.2{245_A909) 
msa221059.2{245_COHl} 
msa221059.2{245_M732} 
msa221059.2{245_M78l} 
Consensus 



351 

TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
********** 

401 

TATTTGTTGA 
TATTTGTTGA 
TATTTGTTGA 
TATTTGTTGA 
TATTTGTTGA 
TATTTGTTGA 
TATTTGTTGA 
TATTTGTTGA 
TATTTGTTGA 
TATTTGTTGA 
TATTTGTTGA 
********** 



msa221059 
msa221059.2{245 
msa221059.2{ 
msa221059 
msa221059 
msa221059 

msa221059 

msa221059 . 

msa221059. 

msa221059 . 

msa221059. 



2{245_H36B> 
_JM9130013} 
245_1169NT} 
.2{245_090} 
.2f 245_CJB110> 
.2{245_18RS21} 
2 {245^2603} 
2{245_A909} 
2{245__COHl} 
2{245_M732} 
2{245_M78l} 
Consensus 



msa221059.2{245_H36B} 
msa221059.2{245_JM9130013} 
msa221059.2{245_1169NT} 
msa221059.2{245_090} 
msa221059.2{245_CJB110} 
msa221059.2{245_18RS2l| 
msa221059 . 2 { 245_2603 } 
msa221059 . 2 { 245_A909} 
msa221059 . 2 { 245_C0H1 } 
msa221059.2(245_M732) 
tnsa221059.2{.245_M78l} 
Consensus 



msa221059. 
msa221059.2{245 
msa221059.2{ 
msa221059 
msa221059. 
msa221059 
msa221059. 
msa221059 . 
msa221059. 



GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 



ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
ATATAAAATG 



GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 



400 

ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 



********** ********** ********** ********** 



TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
********** 



CAATTACAAC 
CAATTACAAC 
CAATTACAAC 
CAATTACAAC 
CAATTACAAC 
CAATTACAAC 
CAATTACAAC 
CAATTACAAC 
CAATTACAAC 
CAATTACAAC 
CAATTACAAC 



GATTAATGGC 
GATTAATGGC 
GATTAATGGC 
GATTAATGGC 
GATTAATGGC 
GATTAATGGC 
GATTAATGGC 
GATTAATGGC 
GATTAATGGC 
GATTAATGGC 
GATTAATGGC 



********** ********** 



2{245_H36B} 
_JM9130013} 
245_1169NT} 
„ J ^.2{245_090j 
.2{245_CJB110} 
.2{245_18RS21} 
2{245_2603} 
2f 245_A909> 
2{245_COHl} 



451 

TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
********** 

501 

AGATAAAAAA 
AGATAAAAAA 
AGATAAAAAA 
AGATAAAAAA 
AGATAAAAAA 
AGATAAAAAA 
AGATAAAAAA 
AGATAAAAAA 
AGATAAAAAA 
AGATAAAAAA 
AGATAAAAAA 
********** 

551 

TAACTTTAAA 
TAACTTTAAA 
TAACTTTAAA 
TAACTTTAAA 
TAACTTTAAA 
TAACTTTAAA 
TAACTTTAAA 
TAACTTTAAA 
TAACTTTAAA 



AAGAAGCgGA 
AAGAAGCgGA 
AAGAAG CaGA 
AAGAAGCaGA 
AAGAAG CaGA 
AAGAAGCaGA 
AAGAAGCaGA 
AAGAAGCaGA 
AAGAAGCaGA 
AAGAAGCaGA 
AAGAAGCaGA 



ATTACGACTT 
ATTACGACTT 
ATTACGACTT 
ATTACGACTT 
ATTACGACTT 
ATTACGACTT 
ATTACGACTT 
ATTACGACTT 
ATTACGACTT 
ATTACGACTT 
ATTACGACTT 



TCACACCAAA 
TCACACCAAA 
TCACACCAAA 
TCACACCAAA 
TCACACCAAA 
TCACACCAAA 
TCACACCAAA 
TCACACCAAA 
TCACACCAAA 
TCACACCAAA 
TCACACCAAA 



450 

CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
********** 

500 

TaCCTTTAAC 
TaCCTTTAAC 
TaCCTTTAAC 
TgCCTTTAAC 
TgCCTTTAAC 
TgCCTTTAAC 
TgCCTTTAAC 
TgCCTTTAAC 
TgCCTTTAAC 
TgCCTTTAAC 
TgCCTTTAAC 



*******-** ********** ********** *_******** 



AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 
********** 



AGAGCAAATg 
AGAGCAAATg 
AGAGCAAATg 
AGAGCAAATa 
AGAGCAAATa 
AGAGCAAATa 
AGAGCAAATa 
AGAGCAAATa 
AGAGCAAATa 



GTCTTATTAT 
GTCTTATTAT 
GTCTTATTAT 
GTCTTATTAT 
GTCTTATTAT 
GTCTTATTAT 
GTCTTATTAT 
GTCTTATTAT 
GTCTTATTAT 
GTCTTATTAT 
GTCTTATTAT 



TgAtAATAAT 
TgAtAATAAT 
TgAtAATAAT 
TaAtAATAAT 
TaAtAATAAT 
TgAcAATAAT 
TgAcAATAAT 
TgAcAATAAT 
TgAcAATAAT 
TgAcAATAAT 
TgAcAATAAT 



550 

GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 



********** *_*_****** ********** 



TTGGATGCTC 
TTGGATGCTC 
TTGGATGCTC 
TTGGATGCTC 
TTGGATGCTC 
TTGGATGCTC 
TTGGATGCTC 
TTGGATGCTC 
TTGGATGCTC 



591 

TTCAACGTTT A 
TTCAACGTTT A 
TTCAACGTTT A 
TTCAACGTTT A 
TTCAACGTTT A 
TTCAACGTTT A 
TTCAACGTTT A 
TTCAACGTTT A 
TTCAACGTTT A 



900 



WO 2004/018646 



Table 57: Comparative Sequences relating to SAG 1488 



msa221059 .2 (245 M732} TAACTTTAAA AGAGCAAATa TTGGATGCTC TTCAACGTTT A 
tnsa221059.2{245_M78l} TAACTTTAAA AGAGCAAATa TTGGATGCTC TTCAACGTTT A 
Consensus ********** *********- ********** ********** * 

SEQ ID NO: 5712 
STRAIN 2603 frame: 1 

MLMTKI IGLTGGIASGKSTVTKI IRESGFKVIDADQVVHKLQAKGGKLYQALLEWLGPEI 
LDADGELDRPKLSQMI FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMD I PLL I 
EEKYIKWFDE IWLVFVDKEKQLQRLMARNNYSREEAELRLSHQMPLTDKKSFASLI IDNN 
GDLITLKEQI LDALQRL 

SEQ ID NO: 5713 
STRAIN 090 frame: 1 

KSTVTKI I RESGFKVIDADQVVHKLQAKGGKLYQALLEWLGPE I LDADGELDRPKLSQMI 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFVDI PLL I EEKY IKWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQMPLTDKKSFASLI INNNGDLITLKEQI LDALQR 
L 

SEQ ID NO: 5714 
STRAIN A909 frame: 1 

KSTVTKI I RESGFKVIDADQVVHKLQAKGGKLYQALLEWLGPE I LDADGELDRPKLSQMI 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMD I PLL I EEKY I KWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQMPLTDKKSFASLIIDNNGDLITLKEQILDALQR 

L 

SEQ ID NO: 5715 
STRAIN H36B frame: 1 

KSTVTKI IRESGFKVIDADQVVHKLQAKGGKLYQALLEWLGPEILDADGELDRPKLSQMI 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMD I PLL I EEKY I KWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQ I PLTDKKS FASLI IDNNGDLITLKEQMLDALQR 
L 

SEQ ID NO: 5716 
STRAIN 18RS21 frame: 1 

KSTVTKI IRESGFKVIDADQVVHKLQAKGGKLYQALLEWLGPEILDADGELDRPKLSQM I 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMD I PLL I EEKY I KWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQMPLTDKKS FASL I IDNNGDL ITLKEQI LDALQR 
L 

SEQ ID NO: 5717 
STRAIN M732 frame: 1 

KSTVTKI IRESGFKVIDADQWHKLQAKGGKLYQALLEWLGPE I LDADGELDRPKLSQM I 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMD I PLL I EEKY I KWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQMPLTDKKS FASLI IDNNGDL ITLKEQI LDALQR 
L 

SEQ ID NO: 5718 
STRAIN COH1 frame: 1 

KSTVTKI I RE SGFKVI DADQWHKLQAKGGKLYQALLEWLGPE I LDADGELDRP KLS QM I 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMD I PLLIEEKYI KWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQMPLTDKKS FASL I IDNNGDL ITLKEQI LDALQR 

L 

SEQ ID NO: 5719 
STRAIN M781 frame: 1 

KSTVTKI IRE SGFKV I DADQWHKLQAKGGKLYQALLEWLGPE I LDADGELDRPKLS QM I 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMD I PLLIEEKYI KWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQMPLTDKKSFASLI IDNNGDL ITLKEQI LDALQR 
L 

SEQ ID NO: 5720 
STRAIN CJB1 10 frame: 1 

KSTVTKI IRESGFKVIDADQWHKLQAKGGKLYQALLEWLGPE I LDADGELDRPKLSQMI 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFVDI PLLIEEKYI KWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQMPLTDKKSFASLI INNNGDLITLKEQI LDALQR 
L 

SEQ ID NO: 5721 

STRAIN 1 169NT frame: 1 

KSTVTKI I RE SGFKV I DADQWHKLQAKGGKLYQALLEWLGPE I LDADGELDRP KLSQM I 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMD I PLLIEEKYI KWFDE I WLVFV 
DKEKQLQRLMARNNY SREEAELRLSHQI PLTDKKS FASL I IDNNGDLITLKEQMLDALQR 
L 

SEQ ID NO: 5722 
STRAIN JM9 130013 frame: 1 

KSTVTKI IRESGFKVIDADQVVHKLQAKGGKLYQALLEWLGPEILDADGELDRPKLSQMI 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMD I PLLI EEKY I KWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQI PLTDKKS FASLI IDNNGDLITLKEQMLDALQR 

L 



901 



WO 2004/018646 



PCT/US2003/026827 



Table 57: Comparative Sequences relating to SAG 1488 



PRETTY of: /biotmp/msa221398 . 2 { * } February 10, 2003 07:15 



1 

msa221398 . 2 {245__090} KSTV TKIIRESGFK 

msa2213 98 . 2 { 245_CJB110 } KSTV TKIIRESGFK 

msa221398 . 2 { 245_1169NT) KSTV TKIIRESGFK 

msa221398 .2{245_H36B} KSTV TKIIRESGFK 

msa221398 . 2{245_JM9130013 } KSTV TKIIRESGFK 

msa221398.2{245_18RS2l} KSTV TKIIRESGFK 

msa221398 .2{245_2603} mlmtkiiglt ggiasgKSTV TKIIRESGFK 

msa221398 .2{245_A909} KSTV TKIIRESGFK 

msa221398'.2{245_COHl} KSTV TKIIRESGFK 

msa221398 .2{245_M732} KSTV TKIIRESGFK 

msa221398 . 2 { 245__M78 1 } KSTV TKIIRESGFK 

Consensus ********** ********** ********** 



VIDADQWHK 
VIDADQWHK 
VIDADQWHK 
VIDADQWHK 
VIDADQWHK 
VIDADQWHK 
VIDADQWHK 
VIDADQWHK 
VIDADQWHK 
VIDADQWHK 
VIDADQWHK 



50 

LQAKGGKLYQ 
LQAKGGKLYQ 
LQAKGGKLYQ 
LQAKGGKLYQ 
LQAKGGKLYQ 
LQAKGGKLYQ 
LQAKGGKLYQ 
LQAKGGKLYQ 
LQAKGGKLYQ 
LQAKGGKLYQ 
LQAKGGKLYQ 



********** ********** 



msa221398 
msa221398 
msa221398 
msa221398 
msa221398 .2(245 
msa221398.2{ 
msa221398 
msa221398 
msa221398 . 
msa221398 . 
msa221398 . 



-v^w .2{245_090} 
.2{245_CJB110} 
.2{245_1169NT} 
2{245_H36B} 
_JM9130013} 
245_18RS2l} 
2(245_2603] 
2{245_A909j 
2{245__COHl] 
2{245_M732] 
2{245_M781] 
Consensus 



51 

ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 



LDADGELDRP 
LDADGELDRP 
LDADGELDRP 
LDADGELDRP 
LDADGELDRP 
LDADGELDRP 
LDADGELDRP 
LDADGELDRP 
LDADGELDRP 
LDADGELDRP 
LDADGELDRP 



KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 



DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 



100 

NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 



********** ********** ********** ********** ********** 



msa221398 
msa221398 
msa221398 

msa221398 . 
msa221398.2{245 
msa221398.2{ 

msa221398 . 

msa221398 . 

msa221398 . 

msa221398 . 

msa221398 . 



.2{245_090j 
.2{245_CJB110} 
.2{245_1169NT} 
2{245_H36B} 
_JM9130013} 
245_18RS21} 
2{245_2603} 
2{245_A909j 
2{245_COHlJ 
2(245__M732} 
2{245_M78l} 
Consensus 



101 

QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 



IFFvDIPLLI 
IFFvDIPLLI 
IFFmDIPLLI 
IFFtnDIPLLI 
IFFmDIPLLI 
IFFmDIPLLI 
IFFmDIPLLI 
IFFmDIPLLI 
IFFmDIPLLI 
IFFmDIPLLI 
IFFmDIPLLI 



EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 



I WLVFVDKE K 
IWLVFVDKEK 
I WLVFVDKE K 
IWLVFVDKEK 
IWLVFVDKEK 
IWLVFVDKEK 
IWLVFVDKEK 
IWLVFVDKEK 
IWLVFVDKEK 
IWLVFVDKEK 
IWLVFVDKEK 



150 

QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 



151 

msa221398.2{245_090} YSREEAELRL SHQmPLTDKK SFASLIInNN 

msa221398.2{245_CJB110} YSREEAELRL SHQmPLTDKK SFASLIInNN 

msa221398.2{245_1169NT} YSREEAELRL SHQiPLTDKK SFASLIIdNN 

msa221398 .2{245_H36B} YSREEAELRL SHQiPLTDKK SFASLIIdNN 

msa221398.2{24S_JM9130013} YSREEAELRL SHQiPLTDKK SFASLIIdNN 

msa221398.2{245_18RS21> YSREEAELRL SHQmPLTDKK SFASLIIdNN 

msa221398 .2{245_2603) YSREEAELRL SHQmPLTDKK SFASLIIdNN 

msa221398 .2{245_A909} YSREEAELRL SHQmPLTDKK SFASLIIdNN 

msa221398 .2{245_C0H1} YSREEAELRL SHQmPLTDKK SFASLIIdNN 

msa221398 .2{245_M732} YSREEAELRL SHQmPLTDKK SFASLIIdNN 

msa221398 .2{245_M781} YSREEAELRL SHQmPLTDKK SFASLIIdNN 

Consensus ********** ***_****** *******_** 



GDLITLKEQi 
GDLITLKEQi 
GDLITLKEQm 
GDLITLKEQm 
GDLITLKEQm 
GDLITLKEQi 
GDLITLKEQi 
GDLITLKEQi 
GDLITLKEQi 
GDLITLKEQi 
GDLITLKEQi 



197 
LDALQRL 
LDALQRL' 
LDALQRL 
LDALQRL 
LDALQRL 
LDALQRL 
LDALQRL 
LDALQRL 
LDALQRL 
LDALQRL 
LDALQRL 



*********_ ******* 



902 



WO 2004/018646 



PCT/US2003/026827 



Table 58: Comparative Sequences relating to SAG0182 



SEQ ID NO. 5801 
STRAIN 2603 

ATGTTGATGGTGTTGTTAT T C CAAAGG CT AGG AATTATT ATGATTTTAG C CTTTTTATT G 

GTAAATAA.TAGTTATTTTAGAGAGTTAATTGAAGAGCGGTCTAAACGTGAAACGGTAGTC 

CTTGTCAT CATTTT CGG CTTGTTTGTT ATT AT ATCTAAT ATAACAGG AATTG AAATAAAA 

GGGGATCGAAGTTTGGTCGAGCGCCCTTTTCTAACAACGATTTCT 

GCTAATACAAGGACTTTAGTTATTACAACGGCAAGTTTGGTTGGTGGACCr 

TCAATTGTTGGTTT^ATTGGAGGAGTTCATCGCT 

TTCTATATTGTCAGTTCAGTT CT AGT CGG CATTGTTAG CGGAAAGATTGGTG AT AAG CTT 
AAGGAAAACCAT CT CTACC CTT CAACAAG CCAAGTTATTTTAATTAGTATTATTGCCGAA 
AGTAT CCAGATG CTATTTGTTGG CATTTTTACAGG ATGGGAACTTGT CAAAATG ATTGT C 
ATTCCAATGATGATTTTAAATAGTTTAGGTTCCACACTTT^ 

TATTTGTCAAATGAAAGTCAGTTACGCGCAGTTCAAACGAGAGATGTTCTTGAATTGACT 
CGACAGACTCTG CC CT ACCTTAGACAAGGTTTGACAC CG C^TCT GCTAGGAG CGTTTG C 
GAAATTATAAAGAG G CATACTAACTTTG ATG CTGTGGGAT TAACAGATCGGT CAAACGTA 
TTAGCTCATATTGGTGTTGGCCATGATCACCATATTGCAGGACAACCGGT 
TT AT CTAAAAGTGTT ATTTTTGATGG CG AAC CAAGAATTG CG CAAGATAAAGCGGCGATT 
TCTTGTC CAG AT CACAACTGTCAGTTAAATTCTG CTATTGTAGTTCCT CT AAAAATAAAT 
GATAAAACT GTGGGTG C CT^AAAAATGTACTTTG CAGGAGATAAGACAATGT CTGAGGTG 
GAGGAAAACCTAGT CCTTGGTTTAG CG CAAATATTTT CAGGACAACTGG CAATG GGGATA 
ACAGAGGAACAAAATAAGTT AG CCAGT ATGG CAG AGATAAAGGCTTTACAAG CACAAATC 
AACC CTCATTT CTT CTTTAATG CCATTAACACAATTAGTG CATTAAT CCGTATTGATTCT 
GATAAAG CACGTTATG CACTGATG CAGTTAAGTACTTTTTTT AGAACAAGTTTG CAGGGT 
GGTCAGGAT CGTGAGGTAAC G CTTG AG CAAGAAAAATCACATGTG GATG CTTATATGAAT 
GTTGAAAAATTACGTTT CC CTG AT AAA! ATCAGTTATCT^ 

AAAATGAAGTTACCACCITTTGGTTTACAGGTACTGGTAGAGAATGCAGTTCG^ 

TT CAAAGAACGTAAGACGGACAAC CAT ATATTGGTT CAAATAAAG CCAGATGGT CATTAT 

TATTGTGTTT CTGTTAGTGACAATGGACAAGGAATCTCAGATACTAT CATTGATAAATTA 

GGTCAAGAAACAGTTGCAGAGAGTAAGGGTACAGGTACTC 

AGGCTGAATTTATTATATGGTAGTGTAAGTTGCCTTCATTTT^ 

ACAAAAGTTTGGTAT CGAAT ACCTAAT AGAATAAGGGAGGATGAG CATG AAAATTTTAAT 
TCT 



SEQ ID NO. 5802 
STRAIN 090 

TTGATGGTGTTGTTATT CCAAAGG CTAGGAATTATTAT 
GATTTTAGCCTTTTTATTC 

AAGAGCGGTCTAAACGTGAAACGGTAGTACTTGTCIATCATTTTCGGCTTG 
TTTGTTATT ATATCTAATATAACAGGAATTGAAAT AAAAGGGGAT CG AAG 
TTTGGTCGAGCGCCCrrTTTCTAAC^CGATTTCCCAT^ 
CTAATACAAGGACTTTAGTTATTACAACGGCAAGTTTGG 
CTGGTTGGAT CAATTGTTGGTTTTATTGGAGGAGTTCAT CGC1T"1TTTCA 
AGGAAGCTTTTCAGGTT CTTT CTATATTGT CAGTTCAGTT CTAGT CGG CA 
TTGTTAGCGGAAAGATTGGTGATAAGCTTAAGGAAAACCATCTCTACCCT 
TCAACAAG C CAAGTTATTTTAATTAGTATT ATTG CCGAA&GTATCCAGAT 
GCTATTTGTTGGTATTTTTACAGGATGGG AACTTGT CAlAAATGATTGTCA 
TT CCAATGATGATTTTAAAT AGTTTAGGTT C CACACTTTT CCTTGCGATT 
TTGAAAACITATTTGTCAAATGAAAGT C^GTT ACGCG CAGTT CAAACGAG 
AGATGTTCITGAATTGACT CGACAG ACT CTGCCCTAC CT CAGACAAGGTT 
TGACACCGCAATCTGCTAGGAGCGTTTGCGAAATTATAAAGAGGCATACT 
AACTTTGATG CTGTAGGATTAACAGATCGGT CAAACGTATTAGCT CATAT 
TGGTGTTGG CCATGATCAC CATATTGCAGGACAACCAGT CAAAACAGAC C 
TATCTAAAAGTGTTATTTTTGATGGCGAAC CAAGAATTG CGCAAGAT AAA 
GCGGCGATTTCTTGT C CAGAT CACAACTGTCAGTTAAATT CTGCTATTGT 
AGTTCCTCTAAAAATAAATGATAAAACIGTGGGTGCCTTAAAAATGTACT 
TTGCAGGAGATAAGACAATGTCrGAGGTGGAGGAAAAC CTAGTC CTTGGT 
TTAG CX3CAAATATTTTCAGGACAACTGGCAATGGGGATAACAGAG 
AAATAAGTTAGC CAGTATGG CAGAGATAAAGG CTTTACAAGCACAAATCA 
AC CCTCATTT CTTCTTTAATGC CATTAACACAATTAGTG CATTAATCCGT 
ATTGATTCTGATAAAGCACGTTATGCACTGATGCAGTTAAGTACITTT^ 
TAGAACAAGTTTGCAAGGTGGT CAGGATCGTG AGGT AACG CTTG AG CAAG 
AAAAATCACATGTGGATGCTTATATGAATGTTGAAAAATTACGTTTCCCT 
GATAAATAT CAGTTATCTTATGATATTAGTG CAC CAGAAAAAATGAAGTT 
ACCGCCTTTTGGTTTACAGGTACrGGTAGAGAATGCAGTTAGACA.TGCT 
TCAAAGAACGTAAGACGGACAAC CATAT ATTGGTT CAAATAAAG CCAGAT 
GGTCATTATTATTGTGTTTCTGTTAGTGACAATGG ACAAGGAAT CT CAGA 
TACTATCATTGATAAATTAGGTCAAGAAACAGTTGCAGAGAGTAAGGGTA 
CAGGTACTGCTCTAGTTAATCrAAATAACAGGCTGAATT^ 
AGTGTAAGTTGC CTT CATTTTT CGAGCGAGAAGAATGGTACAAAAGTTTG 
GTAT CGAAT ACCTAATAGAATAAGGGAGGATGAG CATGAAAATTTTAATT 
CT 



SEQ ID NO. 5803 
STRAIN A909 

gattttagcc"1ttttattggtaaataatagttatttcagacagtt aattg 

aagagcggtctaaacgtgaaacggtagtccttgtcatcattttcggcttg 

tttgttattatatctaatataacaggaattgaaataaaaggggatcgaag 

tttggtcgagcgcccttttctaacaacgatttctca^ 

ctaatacaaggactttagttattacaacggcaagtttggttggtgg^ 

ctggttggatcaattgttggttttattggaggagttcatcg^ 

aggaagcttttcaggttctt^ctatattgtcag 

TTGTTAGCGGAAAGATTGGTGATAAGCTTA^GGAAAACCATCTCTACCCT 
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TCAACAAGCCAAGTTATTTTAATTAGTATTATTGCCGAAAGTATCCAGAT 
GCTATTTGTTGGCATTTTTACAGGATGGGAACTTGTCAAAATGATTGTCA 
TTCCAATGATGATTTTAAATAGTTTAGGTTCCA.C^CTTTTCCTTGCGATT 
TTGAAAACTTATTTGTCAAATGAAAGTCAGTTACG CGCAGTTCAAACGAG 
AGATGTT CTT GAATTGACT CGACAGACT CTG CCCT AC CTTAG ACAAGGTT 
TGACACCGCAATCTGCTAGGAGCGTTTGCGAAATTATAAAGAGGCATACT 
AACTTTG ATG CTGTGGGATTAACAG AT CGGTCAAACGTATTAG CT CAT AT 
TGGTGTTGGCCATGATCACCATATTGCAGGACAACCGGTCAAAACAGACT 
TATCTTAAAAGTGTTATTTTTGATGGCGAACCAAGAATTGCG CAAGATAAA 
GCGGCGATTTCTTGTCCAGATCACAACTGTCAGTTAAATTCTGCTATTGT 
AGTTCCrCTAAAAATAAATGATAAAACTGTGGGTGCCTTAAAAATGTACT 
TTGGAGGAG ATAAGACAAT GTCTG AGGTGG AG GAAAACCT AGT C CTTGGT 
TTAG CG CAAATATTTTCAGGACAACTGG CAATGG G GATAACAGAGGAACA 
AAAT AAGTTAGC CAGTATGG CAGAGATAAAGGCTTTACAAG CACAAATCA 
AC C CT CATTT CTT CTTTAATGC CATTAACACAAT TAGTG CATTAAT C CGT 
ATTGATTCTGATAAAGCACGTTATGCACTGATGCAGTTAAGTAC'r'iTTT'T 
TAGAACAAGTTTGCAGGGTGGT CAGGATCGTGAGGTAACG CTTG AGCAAG 
AAAAATGACATGTGGATGCTTATATGAATGTTGAAAAATTACGTTTCCCT 
GATAAATATCAGTTATCTTATGATATT AGTG CAC CAG AAAAAATGAAGTT 
ACCACCTTTTGGTTTAC^C4GTACTGGTAGA 

T CAAAGAAC GTAAG ACGGACAACCATATATTGGTT CAAAT AAAG C CAGAT 
GGTCATTATTATTGTGTTT CTGTTAGTGACAATGGACAAGGAAT CT CAG A 
TACTATCATTGATAAATTAGGTCAAGAAACAGTTGCAGAGAGTAAGGGTA 
CAGGT ACTG CTCTAGTTAAT CT AAATAACAGG CTGAATTTATTAT ATGGT 
AGTGTAAGTTGCCTT CATTTTT CGAGCGACAAGAATGGTACAAAAGTTTG 
GT AT CGAATACCTAATAGAATAAGGGAGG ATGAG CATGAAAATTTTAATT 
CT 

SEQ ID HO. 5804 
STRAIN H36B 

TTGATGGTGTTGTTATTCCAAAGGCTAGGAATTATTATG 
ATTTTAGCCTTTTTATTGGTAAATAATAGTTATTTCAGAC^ 
AGAGCGGTCTAAACGTGAAACGGTAGTCCTTGTCATCATTTTCG^CTTGT 
TTGTT ATTATAT CTAAT ATAACAGGAATTG AAAT AAAAGGGG AT C GAAGT 
TTGGTCGAGCGCCCTTTTCTAA.CAACGATTTCTCATTCTGACT 
TAATACAAGGACTTTAGTTATTACAACGG(^AAGTTTGGT^ 
TGGTTGGATCAATTGTTGGTTTTATTGGAGGAGTT CATCG CTTTTTTCAA 
GG AAGCTTTTCAGGTTCTTT CTAT ATTGT CAGTT CAGTTCTAGT CGGCAT 
TGTTAGCGGAAAGATTGGTGATAAGCTTAAGGAAAACCATCTCTACCCTT 
\ CAACAAG CCAAGTTATTTTAAT TAGTATTATTG C CGAAAGTATC CAGATG 

CTATTTGTTGGCATTTTTACAGGATGGG^ 

T C CAATGATGATTTTAAATAGTTTAGGTT CCAQA.CUT H iTCCTTG CGAT^ 

TGAAAACTTATTTGTCAAATGAAAGTCAGTTACG CGCAGTT CAAACGAGA 

GATGTTCTTGAATTGACTCGACAGACTCTGCCCTACCTTAGA 

GACAC CG CAATCTGCTAGGAGCGTTTG CG AAATT ATAAAG AGG CATACTA 

ACITTGATGCrGTGGGATTAACAGATCGGTCAA 

GGTGTTGGCCATGATCACCATATTGCAGGACAACCGGT 

ATCTAAAAGTGTTATTTTTGATGGCGAACCAAGAATTGCGCAAGATAAAG 

CGGCGATTTCTTGTCCAGATCACAACTGTCAGTTAAATTOT 

GTTCCrCTAAAAATAAATGATAAAACTGTGGGTGCCTTAAAAATGTACTT 

TGCAGGAGATAAGACAATGTCTGAGGTGGAGGAAAACCTAGTCCTTGGTT 

TAGCGCAAATATTTTCAGGACAACTGG CAATGGGG ATAACAGAGGAACAA. 

AATAAGTTAG CCAGTATGGCAGAGATAAAGG CTTTACAAG CACAAAT CAA 

CCCT CATTT CTT CTTTAATG CCATTAACACAATT AGTG CATTAAT CCGT A 

TTGATTCTGATAAAG CACGTTATG CACTGATG CAGTTAAGTACiTTTTTT 

AGAACAAGTTTGCAGGGTGGTCAGGATCGTGAGGTAA.CGCTTGAGCAAGA 

AAAATCACATGTGGATGCTTATATGAATGTTGAAAAATTACGTTT CC CTG 

ATAAATATCAGTTAT CTTATGATATTAGTG CACCAGAAAAAATGAAGTT A 

CCACCTTTTGGTTTACAGGTACTGGTAGAGAATG CAGTT CGAC^ 

CAAAGAACGTAAGACGGACAAC CATATATTGGTT CAAATAAAG C CAGATG 

GT CATTATTATTGTGTTTCTGTTAGTGACAATGGACAAGGAATCT 

ACTATCATTGATAAATTAGGTCAAGAAACAGTTGCAGAGAGTAAGGGTAC 

AGGTACTG CT CTAGTTAAT CTAAATAACAGG CTGAATTTATTAT ATGGTA 

GTGTAAGTTGCCTTCATTTTTCGAGCGACAAGAATGGTA(2AAAAGTT^ 

TATCGAATACCTAATAGAATAAGGGAGGATGAGCATGAAAATTTTAATTC 

T 

SEQ ZD NO. 5805 
STRAIN 18RS21 

TTGATGGTGTTGTTATTCCAAAGG CTAGGAATTATTATG 

ATTTTAG C CTTTTTATTGGTAAAT AATAGTTATTTTAGACAGTTAATTG A 

AGAGCGGTCTAAACGTGAAACGGTAGTCCTTGTCATCATTTTCGGCTTGT 

TTGTT ATTATAT CT AATATAACAGGAATTGAAATAAAAGGGGAT CGAAGT 

TTGGTCGAGCGCCCITTTCTAACMCGATTTCT 

TAATACAAGGACTTTAGTTATTACAACGG CAAGTTTGGTTGGTGGACCTC 
TGGTTGGAT CAATTGTTGGTTTTATTGGAGGAGTTCATCG CTTTTTT CAA. 
GG AAG CTTTT CAGGTTCTTT CTATATTGT CAGTT CAGTT CTAGTCGG CAT 
TGTTAG CGG AAAGATTGGTG ATAAG CTTAAGG AAAAC CATCT CTAC CCTT 
CAACAAG CCAAGTTATTTTAATTAGTATTATTG C CGAAAGTATC CAGATG 
CTATTTGTTGGCATTTTTACAGGATGGGAACT 

TCCAATGATGATTTTAAATAGTTT AGGTT C CACACTTTT C CTTG CGATTT 
TGAAAACTTATTTGTCAAATGAAAGTCAGTTACGCGCAGTTC^UyVCGAGA 
GATGTTCn?TGAATTGACrCGACAGACTCT 
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GACACCGCAATCTGCTAGGAGCGTTTGCGAAATTATAAAGAGGCATACTA 
ACTTTGATGCTGTGGGATTAACAGATCGGTCAAACGTATTAGCTCATATT 
GGTGTTGGCCATGATCACCATATTGCAGGACAACCGGTCAAAACAGACTT 
AT CTAAAAGTGTTATTTTTG ATGG CGAAC CAAG aATT G CG CAAG ATAAAG 
CGGCGATTTCTTGTCCAGATCACAACTGTCAGTTAAATTCTGCTATTGTA 
GTTCCTCTAAAAATAAATGATAAAACTGTGGGTGCCTTAAAAATGTACTT 
TGCAGGAGATAAGACAATGTCTGAGGTGGAGGAAAACCTAGTCCTTGGTT 
TAGCGCAAATATTTTCAGGACAACTGGCAATGGGGATAACAGAGGAACAA 
AATAAGTTAGCCAGTATGGCAGAGATAAAGGCTTTACAAGCACAAATCAA 
C C CT CATTT CTT CTTTAATG CC ATT AACACAATTAGTGCATTAAT CCGTA 
TTGATTCTGATAAAG CACGTTATG CACTGATG CAGTT AA3TACTTTTTTT 
AGAACAAGTTTGCAGGGTGGTCAGGATCGTGAGGTAACGCTTGAGCAAGA 
AAAATCACATGTGGATGCTTATATGAATGTTGAAAAATTACGTTTCCCTG 
ATAAATATCAGTTATCTTATGATATTAGTGCACCAGAAAAAATGAAGTTA 
CCACCTTTTGGTTTACAGGTACTGGTAGAGAATGCAGTTCGACZATGCTTT 
CAAAGAACGTAAGACGGACAAC CATATATTGGTTCAAAT AAAG C CAGATG 
GTCATTATTATTGTGTTTCrrGTTAGTGACAATGGACAAGGAATCTCAGAT 
ACTAT CATTGATAAATTAG GTCAAG AAACAGTTG CAGAG AGTAAGGGTAC 
AGGTACTGCT CTAGTTAATCTAAAT AACAGG CTGAATTTATTATATGGTA 
GTGTAAGTTGCCTTCATTTTTCGAGCGACAAGAATGGTACAAAAGTTTGG 
TATCGAATACCTAATAGAATAAGGGAGGATGAGCATGAAAATTTTAATTC 
T 

SEQ XD NO. 5806 
STRAIN M732 

TTGATGGTGTTGTTATTCCAAAGGCTAGGAATTATTATGAT 

TTTAG CCTTTTT ATTGGTAAATAATAGTTATTT CAGACAGTTAATTGAAG 

AGCGGTCTAAACGTGAAACGGTAGTCCTTGTCATCATTTTCGGCTTGTTT 

GTTATT ATAT CTAATATAACA.GGAATTGAAAT AAAAGGGG ATCGAAGTTT 

GGTCGAGCGCCC"I w ri"rCTAACAACGATTTCCCAl'TCU'GACrCACTTGCTA 

AT ACAAGGACTTTAGTTATTACAACGG CAAGTTTGGTTGGTGG 

GTTGGAT CAATTGTTGGTTTTATTGGAGGAGTT CAT CG CTTTTTT CAAGG 

AAGCTTTTCAGGTTCTTTCTATATTGT CAGTTCAGTTCTAGTCGGCATTG 

TTAGCX3GAAAGATTGGTGATAAGCTOAAGGAAAACCATCTCTACCCTTCA 

ACAAG CCAAGTTATTTTAATTAGTATTATTG CCG A^iAGTAT C CAGATGCT 

ATTTGTTGG CATTTTTACAGGATGGGAACTTGT CAAAATG CATTC 

CAATGATGATTTTAAATAGTTTAGGTT CCACACTTTTCCTTG CGATTTTG 

AAAACTTATTTGT CAAATGAAAGTCAGTTACGCG CAGTT CAAACGAGAGA 

TGTT CITGAATTGACTCGACAG ACTCTGC C CTAC CTT AGACAAGGTTTGA 

CACCGCAATCrGCTAGGAGCGTTTGCGAAATTATAAAGAGGCATACTAAC 

TTTGATG CTGTGGGATT AACAGATCGGTCAAACGTATTAG CT CATATTGG 

TATTGGCCATGATCACCATATTGCAGGACAACraGTCSVAAACAGACTTAT 

CTAAAAGTGTTATTTTTGATGG CGAAC CAAGAATTG CG CAAGATAAAGCG 

GCGAtTTCTTGTCCAGATCACAACTGTCAGTTAAAITCT 

T C CTCTAAAAAT AAATGATAAAACTGT GTGTG C CTTAAAAATGTACTTTG 

CAGGAGATAAGACAATGTCTGAGGTGGAGGAAAACCTAGTCCITTGGTTTA 

GCG CAAATATTTTCAGGACAACTGG CAATGGGGATAACAGAGGAACAAAA 

TAAGTTAGCCAGTATGG CAGAGATAAAGG CTTTACAAGCACAAAT CAAC C 

CTCATTT CTT CTTTAATG C CATTAACACAATTAGTG CATTAATCCGTATT 

GATTCTGATAAAGCACGTTATGCACEX^ 

AACAAGTTTG CAAGGTGGTCAGGAT CGTG AGGTAACG CTTGAGCAAGAAA 
AAT(^CATGTGGATGCTTATATGAATGTTGAAA^TTACGTTTCCCTGAT 
AAATATCAGTTATCTTATGATATTAGTG CACCAGAAAAAATGAAGTTAC C 
GCCTTTTGGTTT ACAGGTACTGGTAGAGAATG CAGTT CGACATGCTTTCA 
AAGAACGTAAGACGGACAAC CATATATTGGTT CAAATAAAGC CAGATGGT 
CATTATTATTGTGTTTCTGTTAGTGACAATGGACAAGGAATCT 
TATCATTGATAAATTAGGTCAAGA^CAGTTGCAGAGAGTAAGGGGACAG 
GTACTGCTCTAGTTAATCTAAATAACAGGCTGAATTTATTATATGGTAGT 
GT AAGTTGC CTT CATTTTT CGAGCGACAAG AATGGTACAAAAGTTTGGT A 
TCGAATACCTAATAGAATAAGGGAGGATGAGCATGAAAATTTTAATTCT 

SEQ ID NO. 5807 
STRAIN COH1 

TTGATGGTGTTGTTATT C CAAAGGCTAGGAATTAT 

TATGATTTTAGCCTTTTTATTGGTA^TAATAGTTATTTCAGACAGTTAA 

TTGAAGAGCGGTCTAAACGTGAAACGGTAGTCCTTGTCATCATTTTCGGC 

TTGTTTGTTATTATATCTAATATAACAGGAATTGAAATAAAAGGGGATCG 

AAGTTTGGT CGAGCG CC CTTTTCT AACAACGATTT C C CATTCTGACTCAC 

TTGCTAATACAAGGACTTTAGTTATTACAACGGCAA 

C CTCTGGTTGGATGAATTGTTGGTTTT ATTGGAG 

TCAAGGAAG CTTTT CAGGTT CTTT CTATATTGT CAGTT CAGTTCT AGTCG 
GCATTGTTAGCGGAAAGATTGGTGATAAG CTTAAGGAAAACCATCTCTAC 
C CTT CAACAAGCCAAGTTATTTTAATTAGTATTATTG CCGAAAGTATCCA 
GATG CTATTTGTTGG CATTTTTACAGG ATGGGAACTTGTCAAAATGATTG 
TCATT CCAATGATGATTTTAAATAGTTTAGGTT C CACACITTTC CTTG CG 
ATTTTGAAAACTTATTTGT CAAATGAAAGT CAGTTACGCG CAGTT CAAAC 
GAGAGATGTTCTTGAATTGACTCGACAGACT CTG C CCTAC CTTAGACAAG 
GTTTGACAC CGCAATCTGCT AGGAG CGTTTG CGAAATTATAAAGAGG CAT 
ACTAACTTTGATGCTGTGGGATTAACAGATCGGTCAAA 
TATTGGTGTTGG CCATGAT CAC CAT ATTG CAGGACAACCGGT CAAAACAG 
ACTTATCTAAAAGTGTT ATTTTTG ATGG CG AACCAAGAATTG CG CAAGAT 
AAAG CGG CGATTTCTTGT C CAGAT CACAACTGTCAGTTAAATTCTG CTAT 
TGTAGTTCCTCTAAAAATAAATGATAAAACTGTGTGTGCCTTAAAAATGT 
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ACTTTGCAGG AG AT AAG ACAATGT CTGAGGTGGAGGAAAAC CT AGT C CTT 
GGTTT AG CG GAAATATTTT CAGGACAACTGGCAAT GG GG AT AACAGAGG A 
ACAAAATAAGTTAG CCAGTATGG CAGAGATAAAG G CTTT ACAAG CACAAA 
TCAAC CCT CATTT GITCTTTAATGC CATTAACACAATTAGTG CATTAAT C 
CGTATTGATTCTGATAAAGCACGTTATGCACTGATGCAGTTAAGTACTTT 
TTTTAGAACAAGTTTGCAAGGTGGTCAGGATCGTGAGGTAACGCTTGAGC 
AAGAAAAAT CACATGTGG ATG CTT ATATG AATGTTGAAAAATTACGTTT C 
CCTG ATAAAT AT CAGTTAT CTTAT G AT ATTAGTG CAC CAG AAAAAATGAA 
GTTAC CG GCTTTTGGTTTACAGGTACTGGT AGAG AATG CAGTT CGACATG 
CTTTCAAAGAACGTAAGACGGACAACCATATATTGGTTCAAATAAAGCCA 
GATGGTCATTATTATTGTGTTTCrGTTAGTGACAATGGACAAGGAATCTC 
AG ATACTATCATTG ATAAATTAGGT CAAGAAACAGTTG CAGAGAGTAAGG 
GG ACAGGTACTG CT CTAGTTAAT CTAAATAACAGG CTGAATTTATTAT AT 
GGTAGTGTAAGTTGCCTTCATTTTTCGAGCGACAAGAATGGTACAAAAGT 
TTGGT AT CG AATAC CTAATAGAATAAGGG AGG AT GAG CATG AAAATTTTA 
ATTCT 

SEQ ID NO. 5808 
STRAIN M781 

TTGATGGTGTTGTT ATT C CAAAGG CTAGG AATTATTA 
TGATTTTAGCCTTTTTATTGGTAAATAATAGTT^ 

GAAGAGCGGT CTAAACGTGAAACGGTAGT C CTTGT CAT CATTTT CGG CTT 
GTTTGTTATTATAT CTAATATAACAGGAATTG AAATAAAAGGGGATCGAA 
GTTTGGTCXiAGCGCCCTTTTCTAACAACGATT^ 

GCTAATACAAG<^CTTTAGTTATTACAACGG CAAGTTTGGTTGGTGGAC C 
TCTGGTTGGATGAATTGTTGGTTTTATTGGAGGAGTTCATCGCT 
AAGGAAG CTTTT CAGGTTCTTT CT ATATTGT CAGTTCAGTT CTAGTCGGC 
ATTGTTAGCGGAAAGATTGGTGATAAG CTTAAGGAAAACCAT CTCTACC C 
TT CAACAAGCCAAGTTATTTTAATTAGTATTATTG CCGAAAGTAT CCAGA 
TGCTATTTGTTGGCATTTTTACAGGA^ 

ATTCCAATG ATGATTTT AAATAGTTTAGGTTCCACACIT"!"^ 
TTTGAAAACTTATTTGT CAAATGAAAGTCAGTTACGCG CAG t T CAAACGA 
GAGATGTT'ClTGAATTGACrCGACAGACrCTGCCCTACCTTAGACAAGGT 
TTGACACCX3CAATCTGCTAGGAGCGTTTGCGAAATTATAAAGAGGCATAC 
TAAGTTTGATG CTGTGGGATTAACAGATCGGT CAAACGTATTAG CTCATA 
TTGGTGTTGGCCATGATCACCATAT/rGCAGGA 

TTATCTAAAAGTGTT ATTTTTGATGG CGAACCAAG AATTG CG CAAGATAA 
AG CGG CGATTTCTTGTCCAGATCACAACTGT CAGTTAAATT CTGCTATTG 
TAGTTCCTCTAAAAATAAATGATAAAACTGTGTGTGCCTTAAAAATGTAC 
TTTG CAGGAGATAAGACAATGTCTGAGGTGGAGGAAAACCTAGT C CTTGG 
TTTAGCGO^AAT ATTTT CAGGACAACTGG CAATG 

AAAATAAGTTAGCCAGTATGGCAGAGATAAAGGCTTTACAAGCACAAATC 

AACCCTCATTTCTTCTTTAATGCCATTAACACAATTAGT 

TATTGATTCTGATAAAGCACGTTATGCACTGATGCAGTTAAGTACl"l''l"lT 

TTAGAAGAAGTTTGCAAGGTGGTCAGGATCGTGAGGTAACGCTTGAGCAA. 

GAAAAATCACATGTGGATGCIT'ATATGAATGTTGAAAAATTACGTTTCCC 

TGATAAATATCAGTTAT CTTATGATATTAGTGCAC CAGAAAAAATGAAGT 

TACCGCCTTTTGGTTTACAGGTACTGGTAGAG 

TTCAAAGAACGTAAGACGGACAAC CAT ATATTGGTTCAAATAAAGCCAG A 

TGGTCATTATTATTGTGTTTCTGTTAGTGACAATGaA.CAAGGAATCT 

ATACTAT GATTGATAAATTAGGTCAAGAAACAGTTG CAGAGAGTAAGGGG 

ACAGGTACTGCTCTAGTTAATCTAAATAACAGGCTGAATTTATTATATGG 

TAGTGTAAGTTGCCTTCATTTTTCGAGCGACAAGAATGGTACAAAAGT^ 

GGTATO^AATACCTAATAGAATAAGGGAGGATGAGCATGAAAATTTTAAT 

TCT 

SEQ ID KO. 5809 
STRAIN CJB1 10 

TTGATGGTGTTGTTATT C CAAAGG CTAGGAATTATT AT 
GATTTTAGCCTTTTTATTGGTAAATAATA 

AAGAG CGGT CTAAACGTGAAACGGTAGTACTTGT CAT CATTTTCGG CTTG 

TTTGTTATTATATCTAATATAACAGGAATTGAAATAAAAGGGGATCGAAG 

TTTGGTCGAGCGCCCTTTTCTAACAACGATTTCCCATTCTGACT 

CTAATACAAGGACI^AGTTATTACAACGGCAAGTTTGGTTGGTGGACCT 

CTGGTTGGATCAATTGTTGGTTTTATTGGAGGAGTTCATCGCTTT 

AGGAAGCTTTTCAGGTTCTTTCTATATTGTCAGTTCAGT^ 

TTGTTAG CGGAAAGATTGGTGATAAG CTTAAGGAAAACCATCTCTAC CCT 

TCAACAAGCCAAGTTATTTTAATTAGTATTAT^ 

GCTATTTGTTGGTATTTTTACAGGATGGGAACTTGTCAAAAT 

TT CCAATGATGATTTTAAATAGTTT AGGTT CCACACT u lT M iGCTTG CG ATT 

TTGAAAACITATTTGTCAAATGAAAGTCAGTTACGCX3CAGTTCAAACGAG 

AGATGTTCTTGAATTGACTCGACAGACrrCTGCCCTACCTCAGACAAG^ 

TGACACCGCAATCTG CTAGGAG CGTTTG CGAAATTATAAAGAGG CATACT 

AACIT'TGATGCTGTAGGATTAACAGATCGGTCAAACGTAT^ 

TGGTGTTGGCCATGATCACCATATTG(^GGACAACCAGTCAAAACAGACC 

TATCTAAAAGTGTTATTTTTGATGGCGAACCAAGAATTGCGCAAGATAAA 

GCGG CGATTT CTTGT C CAGATCACAACTGTCAGTTAAATTCTG CT ATTGT 

AGTTCCTCTAAAAATAAATGATAAAACTGTGGGTGCCTTAAAAATGTACT 

TrGCAGGAGATAAGACTVATGTCTGAGGTGGAGGAAAACCTAGTCCrrTGGT 

TTAGCGCAAATATTTTCAGGACAACTCGC^^ 

AAATAAGTTAGCCAGTATGG CAGAGATAAAGG CTTTACAAG CACAAAT CA 
AC C CT CATTTTTT CTTT AATG C CATTAACACAATTAGTG CATTAAT C CGT 
ATTGATT CTGATAAAG CACGTTATG CACTGATG CAGTTAAGT ACTTTTTT 
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TAGAACAAGTTT G CAAGGT GGT CAGGATCGTG AGGTAACG CTTG AG CAAG 
AAAAATCACATGTGGATGCTTATATGAATGTTGAAAAATTACGTTTCCCT 
GATAAATATCAGTTATCTTATGATATTAGTGCACCAGAAAAAATGAAGTT 
ACCX3CCTTTTGGTTTACAGGTACTGGTAGAGAATGCAGTTAGACATGCTT 
T CAAAGAACGTAAG ACGG ACAACCATATATTGGTT CAAAT AAAG C CAGAT 
GGT CATT ATT ATTGTGTTT CTGTT AGTGACAATGG ACAAGG AAT CTCAG A 
TACTATCATTGATAAATTAGGT CAAGAAACAGTT G CAGAG AGTAAGG GTA 
CAGGTACTG CT CTAGTT AAT CT AAATAACAGG CTGAATTT ATTAT ATGGT 
AGTGTAAGTTGCCTTCATTTTTCGAGCGACAAGAATGGTACAAAAGTT^ 
GT AT CGAATACCTAATAGAATAAGGGAGGATGAG CATGAAAATTTTAATT 
CT 

SEQ ID MO. 5810 
STRAIN 1169NT 

TTGAT GGTGTTGTTATT C CAAAGG CTAGGAATTATT 

ATGATTTTAG CCTTTTT ATTGGTAAA.TAAT AGTT ATTT CAGACAGTT AAT 
TGAAG AG CGGTCTAAACGTG AAACG GTAGTACTT GT CAT CATTTT CGG CT 
TGTTTGTTATTATATCTAATATAACAGGAATTGAAATAAAAGGGGATCGA 
AGTTTGGTCGAGCGCCCITTTCTAACAACGATTTCT 

TGCTAATACAAGGACITTAGTTATTACAACGGCAAGTTTGGTTGGTGGAC 
CT CTGGTTGG ATCAACTGTTGGTTCTATTGGAGGAGTT CATCGCTTTCT T 
CMGGAAGCTTITCAGGTTCTTTCTATATTGTCAGrr 
CATTGTG AG CGGAAAGATTGGTGATAAG CTTAAGGAAAAC CAT CT CTACC 
CITGAACAAG CCAAGCTATTTTAATTAGTAT^ CG AAAGT AT CCAG 
ATGCTATTTGTTGG CATTTTTACAGGATGGGAACT CAAAATG ATTGT 
CATT C CAATGATGATTTTAAATAGTTTAGGTTCCACACTTTTCOT 
TTTTGAAAACTTATTTGTCAAATGAAAG 

AGAGATGTT CTTGAATTGACT CGACAGACT CTGC C CTAC CTTAGACAAGG 

TTTGACACCGCAATCTGCTAGGAGCGTTTGCGAAACTATAAAGA 

CTAATTTTGATGCTGTGGGATTAACAGATCGGTCAAACXiTAT^ 

ATTGGTGTTGGCCATGATCACCATATTGCAGGACAACCAGTCA 

C CTATCT AAAAGTGTTATTTTTGATGG CX3AACCAAGAATTGCGCAAG ATA 

AAGC GGCGATTT CTTGTCCAGATCACAACTGT CAG'ITAAATT CTG CTATT 

GTAGTTCCTCTAAAAATAAATGATAAAACTGTGGGTG CCTTAAAAATGT A 

CTTTG CAGGAGAT AAGACAATGTCTGAGGTGGAGG AAAAC CTAGT CCTTG 

GTTTAGCGCAAATATTTTCAGG ACAACTGG CAATGGGGATAACAGAGGAA 

CAAAATAAGTTAGC CAGTATGG CAGAG AT AAAGG CTTTACAAG CACAAAT 

CAACCCTCATTT CTT CTTTAATGCCATTAACA.CAATT AGTGCATTAAT C C 

GTATTGATTCTGATAAAGCACGTTATGCACTGATGCAGTTAAGTACTTTT 

TTTAGAA.CAAGTTTG CAAGGTGGT CAGGATCGTG AGGTAACG CTTGAGCA. 

AGAAAAATCACATGTGGATGCTTATATGAATGTTGAAAAATTACGTTTCC 

CTGATAAATATCAGTTATCTTATGATATTAGTGCACCAGAAAAAATGAAG 

TTAC CG C CTTTTGGTTTACAGGTACTGGTAGAGAATG CAGTT CGACATG C 

TTTTAAAGAACGTAAGACGGACAAC CATATATTGGTTCAAATAAAGCCAG 

ATGGTCATTATTATTGTGTTTCTGTTAGTGACAATGGACAA.GGAATCTCA 

GATACTATCATT GATAAATTAGGTCAAGAAACAGTTG CAGAGAGTAAGGG 

TACAGGTACTGCTCTAGTTAATCTAAATAACAGGCTGAATTTATTATATG 

GTAGTGTAAGTTG CCITCATTTTT CGAGCGACAAGAATGGTACAAi^ 

TGGTATCGAATACCTAATAGAATAAGGGAGGATGAGCATGAAAATTTTAA 

TTCT 

SEQ XD NO. 5810 
STRAIN JM9 1300 13 

TTGATGGTGTTGTTATT CCAAAGG CTAGGAATTATT 
ATGATTTTAGCCTTTTTATTGGTAAATAATAGCT 

TGAAGAG CG GT CTAAACGTGAAACGGTAGT C CTTGT CAT CATTTT CGG CT 
TGTTTGTTATTATAT CTAATATAACAGGAATTGAAATAAAAGGGG AT CGA 
AGTTTGGT CGAG CG CCCTTTTCTAACAACGATTT CTCATTCTG 
TGCTAATAGAAGGACIT'TAGTTATTACAACGGCAA 
CTCTGGTTGGATCAATTGTTGGTTTTATTGGAGGAGTTCATCGCT 
CAAGGAAG CTTTTCAGGTT CTTT CTATATTGT CAGTT CAGTT CTAGT CGG 
(^TTGTTAGCGGAAAGATTGGTGATAAGCrrTAAGGAAAACCATCT CTACC 
(CTTCAACAAG CCAAGCTATTTTAATTAGTATTATT 
ATGCTATTTGTTGGCATTTCTACAGGATGGGAACCT 

CATT C CAATGATGATTTTAAATAGTTTAGGTT CCACALTiTTT'CCTTG CGA 
TTTTGAAAACTTATTTGTCAAATG AAAGT CAGTTACG CG CAGTTCAAACG 
AGAGATGTT CITGAATTGACTCGACAGACT CTGC C CTAC CTTAGACAAGG 
TTTGACACCG CAAT CTG CT AGGAGCGTTTG CGAAATTATAAAGAGGCATA 
CTAACTTTG ATG CTGTGGG ATTAACAG AT CGGTCAAA.CX3T ATTAG CT CAT 
ATT^GTGTTGGC CATGATCACCAT ATTG CAGGACAAC CGGTCAAAACAGA 
CTTATCTAAAAGTGTTATTTTTGATGGCX3AACCAAGAATTG 
AAGC GGCGATTT CTTGTCCAG AT CACAACT GT CAGTTAAATT CTG CTATT 
CTAGTTCCTCTAAAAATAAATGATAAAACTGTGGGTGCCTTAAAAATGTA 
CTTTGCAGGAGATAAGACAATGTCTGAGGTGGAGGAAAACCTAGTCCTTG 
GTTTAGCGCAAATATTTT CAGGACAACTGG CAATG GGGATAACAG AGGAA 
CAAAATAAGTTAGCCAGTATGG CAGAGAT AAAGG CTTTACAAGCACAAAT 
CAACCCTCATTTCTTCTTTAAT 

GTATTGATT CTG ATAAAGCACGTTATG CACTGAT G CAGTTAAGTACTTTT 
TTT AGAACAAGTTTG CAGGGTGGT CAGGATCGTGAGGTAACG CTTGAGCA 
agAAAAAT CACATGTGG ATG CTTATATGAATGTTGAAAAATTACGTTT CC 
CTGATAAATATCAGTTATCTTATGATATTAGTGCACCAGAAAAAATGAAG 
TTACCACCTTCTGGTCTACAGGTACTGGTAGAGAATGCAGT^ 
TTTCAAAGAACGTAAGACGGACAACCATATATTGGTTCAAATAAAGCC^^ 
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Table 58: Comparative Sequences relating to SAGO 182 



ATGGTCATTATTATTGTGTTTCTGTTAGTGACAATGGACAAGGAATCTCA 
GATACTAT CATTGATAAATT AGGT CAAGAAACAGTTG CAG AG AGTAAGGG 
TACAGGTACTG CT CTAGTT AAT CT AAATAACAGG CTGAATTTATTAT ATG 
GTAGTGTAAGTTGCCTTCATTTTTCGAGCGACAAGAATGGTACAAAAGTT 
TGGTATCGAATAC CTAATAG AATAAGGGAGGATGAG CATG AAAATTTTAA 
TTCT 

MSA Alignment Results: Pretty output 

PRETTY of: /biotmp/msa442667 . 2 { * } January 13, 2003 06:34 .. 

1 50 

msa442667 . 2 { 248_18RS21 } TTGATGGTGT TGTTATTCCA AAGGCTAGGA ATTATTATGA TTTTAGCCTT 

maa442667 . 2 { 248_26l)3 } TTGATGGTGT TGTTATTCCA AAGG CT AGGA ATTATTATGA TTTTAGCCTT 

msa442667 . 2 { 248_A909} TTGATGGTGT TGTTATTCCA AAGGCTAGGA ATTATTATGA TTTTAGCCTT 

msa44 2667 .2(24 8_H36B} TTGATGGTGT TGTTATTCCA AAGGCTAGGA ATTATTATGA TTTTAGCCTT 

ms a4 42667.2(24 8_JM9 130013} TTGATGGTGT TGTTATTCCA AAGGCTAGGA ATTATTATGA TTTTAGCCTT 

msa442 667 .2(24 8_COHl} TTGATGGTGT TGTTATTCCA AAGGCTAGGA ATTATTATGA TTTTAGCCTT 

msa442 667 .2(24 8_M78l} TTGATGGTGT TGTTATTCCA AAGGCTAGGA ATTATTATGA TTTTAGCCTT 

msa442 667 .2(24 8_M732> TTGATGGTGT TGTTATTCCA AAGGCTAGGA ATTATTATGA TTTTAGCCTT 

msa442667 .2{248_090) TTGATGGTGT TGTTATTCCA AAGGCTAGGA ATTATTATGA TTTTAGCCTT 

msa442667 .2{248_CJB110} TTGATGGTGT TGTTATTCCA AAGGCTAGGA ATTATTATGA TTTTAGCCTT 

msa442667.2{248_1169NT} TTGATGGTGT TGTTATTCCA AAGGCTAGGA ATTATTATGA TTTTAGCCTT 

Consensus ********** ********** ********** ********** ********** 



51 

msa442667.2{248_18RS2l} TTTATTGGTA AATAATAGTT ATTTtAGACA 

msa442667.2{248_2603} TTTATTGGTA AATAATAGTT ATTTtAGACA 

msa442667.2{248_A909} TTTATTGGTA AATAATAGTT ATTTcAGACA 

msa442 667 .2(248 JK36B} TTTATTGGTA AATAATAGTT ATTTcAGACA 

ms a4 42667-2(24 8_JM9 13 0013} TTTATTGGTA AATAATAGTT ATTTcAGACA 

msa442667 .2(248_COHlj TTTATTGGTA AATAATAGTT ATTTcAGACA 

msa442667.2{248_M781} TTTATTGGTA AATAATAGTT, ATTTcAGACA 

msa442667.2(248_M732} TTTATTGGTA AATAATAGTT' ATTTcAGACA 

msa442667.2{248_090) TTTATTGGTA AATAATAGTT ATTTcAGACA 

msa442667 . 2 (248_CJB110 } TTTATTGGTA AATAATAGTT ATTTcAGACA 

msa442 667 .2(24 8_1169NT} TTTATTGGTA AATAATAGTT ATTTcAGACA 

Consensus ********** ********** ****_***** 



GTTAATTGAA 
GTTAATTGAA 
GTTAATTGAA 
GTTAATTGAA 
GTTAATTGAA 
GTTAATTGAA 
GTTAATTGAA 
GTTAATTGAA 
GTTAATTGAA 
GTTAATTGAA 
GTTAATTGAA 



100 

GAG CGGTCTA 
GAGCGGTCTA 
GAG CGGTCTA 
GAGCGGTCTA 
GAGCGGTCTA 
GAGCGGTCTA 
GAGCGGTCTA 
GAGCGGTCTA 
GAGCGGTCTA 
GAGCGGTCTA 
GAGCGGTCTA 



********** ********** 



101 150 

msa442667.2(248_18RS2l} AACGTGAAAC GGTAGTcCTT GTCATCATTT TCGGCTTGTT TGTTATTATA 

msa442667 .2(24 8_2 603} AACGTGAAAC GGTAGTcCTT GTCATCATTT TCGGCTTGTT TGTTATTATA 

tnsa442667 .2( 248_A909) AACGTGAAAC GGTAGTcCTT GTCATCATTT TCGGCTTGTT TGTTATTATA 

msa442667 .2{248_H36B) AACGTGAAAC GGTAGTcCTT GTCATCATTT TCGGCTTGTT TGTTATTATA 

msa442667 .2(24 8_JM9 13 0013} AACGTGAAAC GGTAGTcCTT GTCATCATTT TCGGCTTGTT TGTTATTATA 

msa442667 .2{248_COHl} AACGTGAAAC GGTAGTcCTT GTCATCATTT TCGGCTTGTT TGTTATTATA 

msa442667.2{248_M78l} AACGTGAAAC GGTAGTcCTT GTCATCATTT TCGGCTTGTT TGTTATTATA 

msa442667 . 2{248__M732 j AACGTGAAAC GGTAGTcCTT GTCATCATTT TCGGCTTGTT TGTTATTATA 

msa442667.2{248_090} AACGTGAAAC GGTAGTaCTT GTCATCATTT TCGGCTTGTT TGTTATTATA 

msa442667.2{248_CJB110} AACGTGAAAC GGTAGTaCTT GTCATCATTT TCGGCTTGTT TGTTATTATA 

msa442667 . 2 ( 248_1169NT) AACGTGAAAC GGTAGTaCTT GTCATCATTT TCGGCTTGTT TGTTATTATA 

Consensus ********** ******_*** ********** ********** ********** 

151 200 

msa4426S7 . 2 ( 248_18RS21) T CT AAT AT AA CAGGAATTGA AATAAAAGGG GAT CGAAGTT TGGTCGAGCG 

msa442667.2{248_2603} TCTAATATAA CAGGAATTGA AATAAAAGGG GATCGAAGTT TGGTCGAGCG 

msa442667.2(248_A909} TCTAATATAA CAGGAATTGA AATAAAAGGG GATCGAAGTT TGGTCGAGCG 

msa442667.2(248_H36B} TCTAATATAA CAGGAATTGA AATAAAAGGG GATCGAAGTT TGGTCGAGCG 

msa442 667 .2 (248_JM9 130013} TCTAATATAA CAGGAATTGA AATAAAAGGG GATCGAAGTT TGGTCGAGCG 

msa442667.2(248_COHl} TCTAATATAA CAGGAATTGA AATAAAAGGG GATCGAAGTT TGGTCGAGCG 

msa442667.2(248_M78l} TCTAATATAA CAGGAATTGA AATAAAAGGG GATCGAAGTT TGGTCGAGCG 

msa442667 .2(248_M732} TCTAATATAA CAGGAATTGA AATAAAAGGG GATCGAAGTT TGGTCGAGCG 

msa442667.2{248_090} TCTAATATAA CAGGAATTGA AATAAAAGGG GATCGAAGTT TGGTCGAGCG 

msa442667.2(248__CJB110} TCTAATATAA CAGGAATTGA AATAAAAGGG GATCGAAGTT TGGTCGAGCG 

msa442667.2(248_1169NT} TCTAATATAA CAGGAATTGA AATAAAAGGG GATCGAAGTT TGGTCGAGCG 

Consensus ********** ********** ********** ********** ********** 



201 

msa442667 .2 (248_18RS2l} CCCTTTTCTA ACAACGATTT 

msa442667 .2{248_2603) CCCTTTTCTA ACAACGATTT 

msa442667.2(248_A909} CCCTTTTCTA ACAACGATTT 

msa442667 .2{248_H36B} CCCTTTTCTA ACAACGATTT 

msa442667 .2{248_JM9130013) CCCTTTTCTA ACAACGATTT 

msa442667.2f248_C0Hl> CCCT TTT CTA ACAACG ATTT 

msa442667.2(248_M78l| CCCTTTTCTA ACAACGATTT 

msa442667 .2(248_M732 } CCCTTTTCTA ACAACGATTT 

msa4 42667. 2(24 8_0 9 0 } CCCTTTTCTA ACAACGATTT 

msa442667 . 2 ( 248_CJB110 J CCCTTTTCTA ACAACGATTT 

msa442667 .2 (248_1169NT) CCCTTTTCTA ACAACGATTT 

Consensus ********** ********** 



CtCATTCTGA 
CtCATTCTGA 
CtCATTCTGA 
CtCATTCTGA 
CtCATTCTGA 
CcCATTCTGA 
CcCATTCTGA 
CcCATTCTGA 
CcCATTCTGA 
CcCATTCTGA 
CtCATTCTGA 



CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 



250 

AATACAAGGA 
AATACAAGGA 
AATACAAGGA 
AATACAAGGA 
AATACAAGGA 
AATACAAGGA 
AATACAAGGA 
AATACAAGGA 
AATACAAGGA 
AATACAAGGA 
AATACAAGGA 



*_******** ********** ********** 



msa442667.2(248_18RS2l} 
msa442667 . 2 { 248J2603 } 



251 300 
CTTTAGTTAT TACAACGGCA AGTTTGGTTG GTGGACCTCT GGTTGGATCA 
CTTTAGTTAT TACAACGGCA AGTTTGGTTG GTGGACCTCT GGTTGGATCA 
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Table 58: Comparative Sequences relating to SAG0182 



msa442667 .2{248_A909} CTTTAGTTAT TACAACGGCA AGTTTGGTTG GTGGACCTCT GGTTGGATCA 

msa442667 ,2{248_H36B} CTTTAGTTAT TACAACGGCA AGTTTGGTTG GTGGACCTCT GGTTGGATCA 

msa442667 ,2{248_JM9130013} CTTTAGTTAT TACAACGGCA AGTTTGGTTG GTGGACCTCT GGTTGGATCA 

msa442667 .2 {248_COHl} CTTTAGTTAT TACAACGGCA AGTTTGGTTG GTGGACCTCT GGTTGGATCA 

msa442667 .2 {248_M78l} CTTTAGTTAT TACAACGGCA AGTTTGGTTG GTGGACCTCT GGTTGGATCA 

msa442667 .2{248_M732} CTTTAGTTAT TACAACGGCA AGTTTGGTTG GTGGACCTCT GGTTGGATCA 

msa442667. 2 {248_090} CTTTAGTTAT TACAACGGCA AGTTTGGTTG GTGGACCTCT GGTTGGATCA 

msa442667 .2{248_CJB110} CTTTAGTTAT TACAACGGCA AGTTTGGTTG GTGGACCTCT GGTTGGATCA 

msa442667.2{248_1169NT} CTTTAGTTAT TACAACGGCA AGTTTGGTTG GTGGACCTCT GGTTGGATCA 

Consensus ********** ********** ********** ********** ********** 



msa442667 .2{ 
msa442667 
msa442667 
msa442667 
msa442667.2{248 
msa442667 . 
msa442667 . 
msa442667 . 
msa442667 
msa442667.2 
msa442667 .2 



248_18RS2l] 
2{248_2603] 
2{248_A909] 
2{248_H36B 
JM9130013. 
2*{248_COHl} 
2{248_M781} 
2{248_M732} 
.2{248_090} 
248_CJBli0} 
248_JL169NT} 
Consensus 



301 

ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
********** 



TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
********** 



AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 
********** 



TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
********** 



350 

GAAGCTTTTC 
GAAGCTTTTC 
GAAGCTTTTC 
GAAGCTTTTC 
GAAGCTTTTC 
GAAGCTTTTC 
GAAGCTTTTC 
GAAGCTTTTC 
GAAGCTTTTC 
GAAGCTTTTC 
GAAGCTTTTC 
********** 



msa442667.2{ 
msa442667 . 
msa442667 . 
msa442667 . 
msa442667.2{248 
msa442667 
msa442667 
msa442667 
msa442667 
tnsa442667 .2 
msa442667 .2 



248_18RS2l} 
2 { 248^2603} 
2{248_A909} 
2{248_H36B} 
_JM9130013' 
2{248_COHl' 
2{248_M781 ( 
2{248_M732; 
.2{248_090" 
248_CJB110' 
248_1169NT) 
Consensus 



351 

AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
********** 



TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
********** 



GTTCAGTTCT 
GTTCAGTTCT 
GTTCAGTTCT 
GTTCAGTTCT 
GTTCAGTTCT 
GTTCAGTTCT 
GTTCAGTTCT 
GTTCAGTTCT 
GTTCAGTTCT 
GTTCAGTTCT 
GTTCAGTTCT 
********** 



AGTCGGCATT 
AGTCGGCATT 
AGTCGGCATT 
AGTCGGCATT 
AGTCGGCATT 
AGTCGGCATT 
AGTCGGCATT 
AGTCGGCATT 
AGTCGGCATT 
AGTCGGCATT 
AGTCGGCATT 
********** 



400 

GT t AGCGGAA 
GTtAGCGGAA 
GTtAGCGGAA 
GTtAGCGGAA 
GTtAGCGGAA 
GTtAGCGGAA 
GTtAGCGGAA 
GTtAGCGGAA 
GTtAGCGGAA 
GTtAGCGGAA 
GTgAGCGGAA 
**_******* 



msa442667 . 2 { 248_1BRS21 } 
msa442667.2{248_2603} 
msa442667 . 2 f 248_A909> 
msa442667 . 2 { 248_H36B} 
msa442667.2{248_«JM9130013} 
msa442667 . 2 { 248_C0H1 } 
ttisa44 2 6 6 7 . 2 j 24 8__M7 8 1 \ 
msa442667.2{248 M732) 
msa442667 .2{248_090} 
msa442667 .2{248_CJB110} 
msa442667 . 2 { 248_1169NT} 
Consensus 



401 

AGATTGGTGA 
AGATTGGTGA 
AGATTGGTGA 
AGATTGGTGA 
AGATTGGTGA 
AGATTGGTGA 
AGATTGGTGA 
AGATTGGTGA 
AGATTGGTGA 
AGATTGGTGA 
AGATTGGTGA 



TAAGCTTAAG 
TAAGCTTAAG 
TAAGCTTAAG 
TAAGCTTAAG 
TAAGCTTAAG 
TAAGCTTAAG 
TAAGCTTAAG 
TAAGCTTAAG 
TAAGCTTAAG 
TAAGCTTAAG 
TAAGCTTAAG 



GAAAACCATC 
GAAAACCATC 
GAAAACCATC 
GAAAACCATC 
GAAAACCATC 
GAAAACCATC 
GAAAACCATC 
GAAAACCATC 
GAAAACCATC 
GAAAACCATC 
GAAAACCATC 



TCTACCCTTC 
TCTACCCTTC 
TCTACCCTTC 
TCTACCCTTC 
TCTACCCTTC 
TCTACCCTTC 
TCTACCCTTC 
TCTACCCTTC 
TCTACCCTTC 
TCTACCCTTC 
TCTACCCTTC 



********** ********** ********** ********** 



450 

AACAAGCCAA 
AACAAGCCAA 
AACAAGCCAA 
AACAAGCCAA 
AACAAGCCAA 
AACAAGCCAA 
AACAAGCCAA 
AACAAGCCAA 
AACAAGCCAA 
AACAAGCCAA 
AACAAGCCAA 
********** 



msa442667 .2{ 

msa442667. 

msa442667 . 

msa442667 . 
msa442667.2{248 

msa442667 . 

ltisa442667 . 

msa442667 . 
msa442667 
rasa442667 ,2{ 
msa442667.2{ 



248_18RS21} 
2{248__2603} 
2{248_A909} 
2{248_H3SB} 
_JM9130013> 
2{248_COHl> 
2{248_M781} 
2{248JM732} 
.2{248_090} 
248_CJB110j 
248_1169NT} 
Consensus 



451 

GTTATTTTAA 
GTTATTTTAA 
GTTATTTTAA 
GTTATTTTAA 
GTTATTTTAA 
GTTATTTTAA 
GTTATTTTAA 
GTTATTTTAA 
GTTATTTTAA 
GTTATTTTAA 
GTTATTTTAA 
********** 



TTAGTATTAT 
TTAGTATTAT 
TTAGTATTAT 
TTAGTATTAT 
TTAGTATTAT 
TTAGTATTAT 
TTAGTATTAT 
TTAGTATTAT 
TTAGTATTAT 
TTAGTATTAT 
TTAGTATTAT 
********** 



TGCCGAAAGT 
TGCCGAAAGT 
TGCCGAAAGT 
TGCCGAAAGT 
TGCCGAAAGT 
TGCCGAAAGT 
TGCCGAAAGT 
TGCCGAAAGT 
TGCCGAAAGT 
TGCCGAAAGT 
TGCCGAAAGT 
********** 



ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 



500 

TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 



********** ********** 



msa442667.2{ 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667.2{248 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667 
msa442667 
rasa442667 



248_18RS2l] 
2(248__2603| 
2{248_A909j 
2{248_H36B] 
JM9130013] 
2{248_COHl} 
2{248_M78l} 
2{248__M732} 
Jt ,2{248_090} 
.2(248_CJB110} 
.2{248_1169NT> 
Consensus 



501 

CATTTTTACA 
CATTTTTACA 
CATTTTTACA 
CATTTTTACA 
CATTTTTACA 
CATTTTTACA 
CATTTTTACA 
CATTTTTACA 
tATTTTTACA 
tATTTTTACA 
CATTTTTACA 
_********* 



GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
********** 



TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
********** 



GATTGTCATT 
GATTGT CATT 
GATTGTCATT 
GATTGTCATT 
GATTGTCATT 
GATTGTCATT 
GATTGTCATT 
GATTGTCATT 
GATTGTCATT 
GATTGTCATT 
GATTGTCATT 
********** 



550 

CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
********** 



msa442667 . 2 { 248_18RS2l} 



551 600 
TTTTAAATAG TTTAGGTTCC ACACTTTTCC TTGCGATTTT GAAAACTTAT 



909 
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Table 58: Comparative Sequences relating to SAG0182 



msa442667 .2{248_2603} 
msa442667.2{248_A909} 
msa442667 .2{248_H36B} 
ms a4 4 2 6 67 . 2 { 24 8_JM9 13 0 0 1 3 } 
msa442667 .2(248_COHl} 
msa442667.2{248_M78l} 
msa442667.2{248_M732} 
msa442667 . 2 {248_090 } 
msa442667 . 2 { 248_CJB110 } 
msa442667 . 2 { 248_1169NT} 
Consensus 



TTTTAAATAG 
TTTTAAATAG 
TTTTAAATAG 
TTTTAAATAG 
TTTTAAATAG 
TTTTAAATAG 
TTTTAAATAG 
TTTTAAATAG 
TTTTAAATAG 
TTTTAAATAG 



TTTAGGTTCC 
TTTAGGTTCC 
TTTAGGTTCC 
TTTAGGTTCC 
TTTAGGTTCC 
TTTAGGTTCC 
TTTAGGTTCC 
TTTAGGTTCC 
TTTAGGTTCC 
TTTAGGTTCC 



ACACTTTTCC 
ACACTTTTCC 
ACACTTTTCC 
ACACTTTTCC 
ACACTTTTCC 
ACACTTTTCC 
ACACTTTTCC 
ACACTTTTCC 
ACACTTTTCC 
ACACTTTTCC 



********** ********** ********** 



TTGCGATTTT 
TTGCGATTTT 
TTGCGATTTT 
TTGCGATTTT 
TTGCGATTTT 
TTGCGATTTT 
TTGCGATTTT 
TTGCGATTTT 
TTGCGATTTT 
TTGCGATTTT 
********** 



GAAAACTTAT 
GAAAACTTAT 
GAAAACTTAT 
GAAAACTTAT 
GAAAACTTAT 
GAAAACTTAT 
GAAAACTTAT 
GAAAACTTAT 
GAAAACTTAT 
GAAAACTTAT 
********** 



msa442667 .2{248_18RS21} 
msa442667 .2{248_2603} 
msa442667.2( 248_A909} 
msa442667 . 2 (248 JH36B} 
msa442667.2{248_JM9130013} 
msa442667 . 2 { 248_COHl } 
tnsa442667.2i248_M78l} 
msa44 2 6 6 7 . 2 { 24 8_M7 3 2 } 
msa442667.2{248_090} 
msa442667 . 2 {248_CJB110 } 
msa442667 . 2 {248_1169NT} 
Consensus 



601 

TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 



AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 



ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 



********** ********** ********** 



CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
********** 



650 

ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
********** 



651 

msa442 667 .2(24 8_18RS21) ATTGACTCGA CAGACTCTGC CCTACCTtAG ACAAGGTTTG 

msa442667 . 2{248_2603 } ATTGACTCGA CAGACTCTGC CCTACCTtAG ACAAGGTTTG 

msa442667.2{24 8_A909} ATTGACTCGA CAGACTCTGC CCTACCTtAG ACAAGGTTTG 

msa442667 .2{248_H36B} ATTGACTCGA CAGACTCTGC CCTACCTtAG ACAAGGTTTG 

msa442667.2{248_JM9130013} ATTGACTCGA CAGACTCTGC CCTACCTtAG ACAAGGTTTG 

msa442667.2{248_COHl} ATTGACTCGA CAGACTCTGC CCTACCTtAG ACAAGGTTTG 

msa442667 .2{248_M78l} ATTGACTCGA CAGACTCTGC CCTACCTtAG ACAAGGTTTG 

msa442667 .2{248_M732} ATTGACTCGA CAGACTCTGC CCTACCTtAG ACAAGGTTTG 

msa442667 .2{248_090} ATTGACTCGA CAGACTCTGC CCTACCTcAG ACAAGGTTTG 

msa442667 . 2 { 248_CJB110 } ATTGACTCGA CAGACTCTGC CCTACCTcAG ACAAGGTTTG 

tnsa442667 .2{248_1169NT} ATTGACTCGA CAGACTCTGC CCTACCTtAG ACAAGGTTTG 

Consensus ********** ********** *******_** ********** 



700 

ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
********** 



msa442667.2{248_18RS2l} 
msa442667.2(248_2603) 
msa442667.2{248_A909) 
msa442667 .2{248_H36B} 
msa442667.2{248_JM9130013} 
msa442667 .2{248_COHl} 
msa442667 .2{248_M781} 
msa442667 . 2 { 248_M732 } 
msa442667.2{248_090} 
msa442667 . 2 f 248_CJB110 } 
msa442667 . 2 { 248_1169NT} 
Consensus 



msa442667 . 2 { 248_18RS2l} 
msa442667.2{248__2603} 
msa442667.2{248_A909} 
msa442667.2{248_H36B) 
msa442667 .2{248_JM9 130013} 
msa442667 . 2 { 248_COHl } 
msa442667 .2{248_M781} 
msa442667 . 2 { 24 8_M732 } 
msa442667 . 2 { 248_090} 
msa442667 . 2 { 248_CJB110 } 
msa442667 . 2 { 248_1169OT} 
Consensus 



701 

CTGCTAGGAG 
CTGCTAGGAG 
CTGCTAGGAG 
CTGCTAGGAG 
CTGCTAGGAG 
CTGCTAGGAG 
CTGCTAGGAG 
CTGCTAGGAG 
CTGCTAGGAG 
CTGCTAGGAG 
CTGCTAGGAG 
********** 

751 

GTgGGATTAA 
GTgGGATTAA 
GTgGGATTAA 
GTgGGATTAA 
GTgGGATTAA 
GTgGGATTAA 
GTgGGATTAA 
GTgGGATTAA 
GTaGGATTAA 
GTaGGATTAA 
GTgGGATTAA 
**„******* ********** 



CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
********** 



CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 



ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
********** 



AAACGTATTA 
AAACGTATTA 
AAACGTATTA 
AAACGTATTA 
AAACGTATTA 
AAACGTATTA 
AAACGTATTA 
AAACGTATTA 
AAACGTATTA 
AAACGTATTA 
AAACGTATTA 
********** 



750 

cTTTGATGCT 
CTTTGATGCT 
cTTTGATGCT 
cTTTGATGCT 
cTTTGATGCT 
CTTTGATGCT 
cTTTGATGCT 
CTTTGATGCT 
CTTTGATGCT 
cTTTGATGCT 
tTTTGATGCT 
.********* 

800 

GTgTTGGCCA 
GTgTTGGCCA 
GTgTTGGCCA 
GTgTTGGCCA 
GTgTTGGCCA 
GTgTTGGCCA 
GTgTTGGCCA 
GTaTTGGCCA 
GTgTTGGCCA 
GTgTTGGCCA 
GTgTTGGCCA 
********** **_******* 



GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
********** 



GCTCATATTG 
GCTCATATTG 
GCTCATATTG 
GCTCATATTG 
GCTCATATTG 
GCTCATATTG 
GCTCATATTG 
GCTCATATTG 
GCTCATATTG 
GCTCATATTG 
GCTCATATTG 



801 

msa442667 . 2 { 248_18RS2l) TGATCACCAT 

msa442667 .2{248_2603) TGATCACCAT 

msa442667.2(248_A909} TGATCACCAT 

tnsa442 667 .2(24 8_H36B} TGATCACCAT 

msa442667.2{248_JM9130013 j TGATCACCAT 

ms a4 42667.2(24 8_C0H1 } TGATCACCAT 

msa442667 .2{248_M78l} TGATCACCAT 

tnsa442667.2{24 8_M7 3 2 } TGATCACCAT 

msa442667.2{248_090} TGATCACCAT 

msa442667 . 2 { 248_CJB110 } TGATCACCAT 

msa442667.2{248_1169NT} TGATCACCAT 

Consensus ********** 



ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 



AACCgGTCAA 
AACCgGTCAA 
AACCgGTCAA 
AACCgGTCAA 
AACCgGTCAA 
AACCgGTCAA 
AACCgGTCAA 
AACCgGTCAA 
AACCaGTCAA 
AACC aGTCAA 
AACCaGTCAA 



AACAGACtTA 
AACAGACtTA 
AACAGACtTA 
AACAGACtTA 
AACAGACtTA 
AACAGACtTA 
AACAGACtTA 
AACAGACtTA 
AACAGACcTA 
AACAGACcTA 
AACAGACcTA 



850 

TCTAAAAGTG 
TCTAAAAGTG 
TCTAAAAGTG 
TCTAAAAGTG 
TCTAAAAGTG 
TCTAAAAGTG 
TCTAAAAGTG 
TCTAAAAGTG 
TCTAAAAGTG 
TCTAAAAGTG 
TCTAAAAGTG 



********** ****-***** *******-** ********** 



851 



900 



910 



WO 2004/018646 



PCT/US2003/026827 



Table 58: Comparative Sequences relating to SAG0182 



msa442667 .2 {248_18RS2l} TTATTTTTGA TGGCGAACCA AGAATTGCGC 

msa4 42667.2(24 8_2 603} TTATTTTTGA TGGCGAACCA AGAATTGCGC 

msa4 4 2 6 6 7 . 2 { 2 4 8_A9 0 9 } TTATTTTTGA TGGCGAACCA AGAATTGCGC 

msa4 42667. 2(24 8_H3 6B } TTATTTTTGA TGGCGAACCA AGAATTGCGC 

msa442667 .2(248 JM913 0013} TTATTTTTGA TGGCGAACCA AGAATTGCGC 

msa4 42667 .2(24 8_COHl} TTATTTTTGA TGGCGAACCA AGAATTGCGC 

msa4 42667. 2(248 M7 8 1 } TTATTTTTGA TGGCGAACCA AGAATTGCGC 

Tnsa4 42667. 2 { 2 4 8_M7 3 2 } TTATTTTTGA TGGCGAACCA AGAATTGCGC 

msa442667 .2{248__090} TTATTTTTGA TGGCGAACCA AGAATTGCGC 

rasa442 667 . 2 { 24 8_CJB1 1 0 } TTATTTTTGA TGGCGAACCA AGAATTGCGC 

msa442667 .2(248_1169NT} TTATTTTTGA TGGCGAACCA AGAATTGCGC 

Consensus ********** ********** ********** 



AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 



GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 



********** ********** 



msa442667.2( 

msa442667 . 

tnsa442667 . 

msa442667 . 
msa442667.2{248 

msa442667 . 

msa442667 . 

rasa442667 . 
msa442667 
msa442667 
msa442667 



;. 2 j 

f.2{ 



248_JL8RS2l} 
2{248__2603> 
2{24 8_A909} 
2{248_H36B} 
_JM9130013} 
2{248_C0H1} 
2(248_M78l} 
2{2'48_M732} 
.2(248_090} 
248__CJB110) 
248_1169NT} 
Consensus 



901 

TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
********** 



ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
********** 



GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
********** 



GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
********** 



950 

TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
********** 



msa442667 . 2 ( 248_18RS21 } 
msa442667 .2(248_2603} 
msa442667 . 2 ( 248_A909 } 
msa442667 .2{248_H36B} 
msa442667.2{248_JM9130013} 
msa442667 . 2 { 248_COHl } 
msa442667.2(248_M78l} 
msa442667.2(248_M732} 
msa442667 . 2 (248_090 } 
msa442667.2(248__CJB110} 
msa442667.2{248_1169NT} 
Consensus 



951 

AATAAATGAT 
AATAAATGAT 
AATAAATGAT 
AATAAATGAT 
AATAAATGAT 
AATAAATGAT 
AATAAATGAT 
AATAAATGAT 
AATAAATGAT 
AATAAATGAT 
AATAAATGAT 



AAAACTGTGg 
AAAACTGTGg 
AAAACTGTGg 
AAAACTGTGg 
AAAACTGTGg 
AAAACTGTGt 
AAAACTGTGt 
AAAACTGTGt 
AAAACTGTGg 
AAAACTGTGg 
AAAACTGTGg 



GTGCCTTAAA 
GTGCCTTAAA 
GTGCCTTAAA 
GTGCCTTAAA 
GTGCCTTAAA 
GTGCCTTAAA 
GTGCCTTAAA 
GTGCCTTAAA 
GTGCCTTAAA 
GTGCCTTAAA 
GTGCCTTAAA 



AATGTACTTT 
AATGTACTTT 
AATGTACTTT 
AATGTACTTT 
AATGTACTTT 
AATGTACTTT 
AATGTACTTT 
AATGTACTTT 
AATGTACTTT 
AATGTACTTT 
AATGTACTTT 



********** *********_ 



********** ********** 



1000 
GCAGGAGATA 
GCAGGAGATA 
GCAGGAGATA 
GCAGGAGATA 
GCAGGAGATA 
GCAGGAGATA 
GCAGGAGATA 
GCAGGAGATA 
GCAGGAGATA 
GCAGGAGATA 
GCAGGAGATA 
********** 



msa442667.2{ 
msa442667 . 
msa442667 . 
msa442667 
msa442667. 2(248, 
msa442667 
msa442667 . 
msa442667 . 
msa442667 
msa442667.2 
msa442667 .2 



248_18RS21> 
2(248_2603} 
2{248_A909} 
2{248_H36B} 
_JM9130013} 
2{248_C0H1} 
2(248_M78l} 
2(248_M732} 
.2(248_090} 
248__CJB110} 
248_1169NT} 
Consensus 



1001 

AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 



TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 



********** ********** 



GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
********** 



TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 



1050 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 



********** ********** 



1051 1100 

msa442667.2(248__18RS2l} TTTTCAGGAC AACTGGCAAT GGGGATAACA GAGGAACAAA ATAAGTTAGC 

msa442667 .2(24 8_2 603} TTTTCAGGAC AACTGGCAAT GGGGATAACA GAGGAACAAA ATAAGTTAGC 

msa442667.2{24 8_A9 0 9 } TTTTCAGGAC AACTGGCAAT GGGGATAACA GAGGAACAAA ATAAGTTAGC 

msa442667.2(248_H36B} TTTTCAGGAC AACTGGCAAT GGGGATAACA GAGGAACAAA ATAAGTTAGC 

msa442667.2(248_JM9130013} TTTTCAGGAC AACTGGCAAT GGGGATAACA GAGGAACAAA ATAAGTTAGC 

msa442 667 .2(24 8_COHlj TTTTCAGGAC AACTGGCAAT GGGGATAACA GAGGAACAAA ATAAGTTAGC 

msa4 42667. 2(24 8__M7 8 1 } TTTTCAGGAC AACTGGCAAT GGGGATAACA GAGGAACAAA ATAAGTTAGC 

msa442667.2(248_M732} TTTTCAGGAC AACTGGCAAT GGGGATAACA GAGGAACAAA ATAAGTTAGC 

msa442667.2(248_090} TTTTCAGGAC AACTGGCAAT GGGGATAACA GAGGAACAAA ATAAGTTAGC 

ms a4 42667.2(24 8_CJB 110} TTTTCAGGAC AACTGGCAAT GGGGATAACA GAGGAACAAA ATAAGTTAGC 

msa442667.2(248_1169NT} TTTTCAGGAC AACTGGCAAT GGGGATAACA GAGGAACAAA ATAAGTTAGC 

Consensus ********** ********** ********** ********** ********** 



msa442667 .2 (248_18RS2i; 
msa442667 . 2 { 248_2603 ] 
msa442667 . 2 ( 248_A909 ] 
msa442667 .2{248_H36B; 
msa442667 . 2 { 248_JM9130013 
msa442667 . 2 { 248_COHl 
msa442667 .2{248_M781 
rasa442667 .2(248_M732 
msa442667 . 2 (248_090 
msa442667 . 2 { 248_CJB110 
m sa442667 . 2 { 248_1169NT 
Consensus 



1101 

CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
********** 



GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
********** 



CTTTACAAGC 
CTTTACAAGC 
CTTTACAAGC 
CTTTACAAGC 
CTTTACAAGC 
CTTTACAAGC 
CTTTACAAGC 
CTTTACAAGC 
CTTTACAAGC 
CTTTACAAGC 
CTTTACAAGC 
********** 



ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
********** 



1150 
CCTCATTTcT 
CCTCATTTcT 
CCTCATTTcT 
CCTCATTTcT 
CCTCATTTcT 
CCTCATTTcT 
CCTCATTTcT 
CCTCATTTcT 
CCTCATTTcT 
CCTCATTTtT 
CCTCATTTcT 
********_* 



911 



WO 2004/018646 



PCT/US2003/026827 



Table 58: Comparative Sequences relating to SAG0182 



msa442667.2{ 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667.2{248 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667 
msa442667 .2f 
msa442667 .2{ 



248_18RS2l} 
2{248_2603} 
2{248_A909} 
2{248_H36Bj 
_JM9130013} 
2{248_COHl} 
2{248_M78l} 
2{248_M732} 
.2{248_090} 
248_CJB110} 
248_1169NT} 
Consensus 



1151 

TCTTTAATGC 
TCTTTAATGC 
TCTTTAATGC 
TCTTTAATGC 
TCTTTAATGC 
TCTTTAATGC 
TCTTTAATGC 
TCTTTAATGC 
TCTTTAATGC 
TCTTTAATGC 
TCTTTAATGC 



CATTAACACA 
CATTAACACA 
CATTAACACA 
CATTAACACA 
CATTAACACA 
CATTAACACA 
CATTAACACA 
CATTAACACA 
CATTAACACA 
CATTAACACA 
CATTAACACA 



ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 



********** ********** ********** 



TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
********** 



1200 
TGATTCTGAT 
TGATTCTGAT 
TGATTCTGAT 
TGATTCTGAT 
TGATTCTGAT 
TGATTCTGAT 
TGATTCTGAT 
TGATTCTGAT 
TGATTCTGAT 
TGATTCTGAT 
TGATTCTGAT 
********** 



1201 

msa442667.2{248_18RS2l} AAAGCACGTT ATGCACTGAT GCAGTTAAGT 

msa442667.2{248_2603) AAAGCACGTT ATGCACTGAT GCAGTTAAGT 

msa442667 .2{248_A909) AAAGCACGTT ATGCACTGAT GCAGTTAAGT 

msa442667 . 2 { 248__H36B> AAAGCACGTT ATGCACTGAT GCAGTTAAGT 

msa442667 .2{248_JM9130013) AAAGCACGTT ATGCACTGAT GCAGTTAAGT 

rnsa442667.2{248_COHl> AAAGCACGTT ATGCACTGAT GCAGTTAAGT 

msa442667 ,2{248_M78l} AAAGCACGTT ATGCACTGAT GCAGTTAAGT 

msa442667.2{248_M732} AAAGCACGTT ATGCACTGAT GCAGTTAAGT 

msa442667.2{248_090> AAAGCACGTT ATGCACTGAT GCAGTTAAGT 

msa442667 .2{248_CJB110) AAAGCACGTT ATGCACTGAT GCAGTTAAGT 

msa442 667 .2(24 8_1169NT} AAAGCACGTT ATGCACTGAT GCAGTTAAGT 

Consensus ********** ********** ********** 



ACTTTTTTTA 
ACTTTTTTTA 
ACTTTTTTTA 
ACTTTTTTTA 
ACTTTTTTTA 
ACTTTTTTTA 
ACTTTTTTTA 
ACTTTTTTTA 
ACTTTTTTTA 
ACTTTTTTTA 
ACTTTTTTTA 
********** 



1250 
GAACAAGTTT 
GAACAAGTTT 
GAACAAGTTT 
GAACAAGTTT 
GAACAAGTTT 
GAACAAGTTT 
GAACAAGTTT 
GAACAAGTTT 
GAACAAGTTT 
GAACAAGTTT 
GAACAAGTTT 
********** 



msa442667 . 2 { 248_18RS21 } 
msa442667 .2{248_2603} 
msa442667.2{248_A909} 
msa442667.2{248 H36BJ 
msa442667 .2 {248_JM9 130013} 
msa442667 . 2 { 248_C0H1 } 
msa442667 .2{248_M78ll 
msa442667 . 2 { 248_M732 } 
msa442667 . 2 { 248_090 } 
msa442667 . 2 {248_CJB110 } 
msa442667 . 2 { 248_1169NT) 
Consensus 



1251 

GCAgGGTGGT 
GCAgGGTGGT 
GCAgGGTGGT 
GCAgGGTGGT 
GCAgGGTGGT 
GCAaGGTGGT 
GCAaGGTGGT 
GCAaGGTGGT 
GCAaGGTGGT 
GCAaGGTGGT 
GCAaGGTGGT 



CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 



AGGTAACGCT 
AGGTAACGCT 
AGGTAACGCT 
AGGTAACGCT 
AGGTAACGCT 
AGGTAACGCT 
AGGTAACGCT 
AGGTAACGCT 
AGGTAACGCT 
AGGTAACGCT 
AGGTAACGCT 



TGAGCAAGAA 
TGAGCAAGAA 
TGAGCAAGAA 
TGAGCAAGAA 
TGAGCAAGAA 
TGAGCAAGAA 
TGAGCAAGAA 
TGAGCAAGAA 
TGAGCAAGAA 
TGAGCAAGAA 
TGAGCAAGAA 



***_****** ********** ********** ********** 



1300 
AAATCACATG 
AAATCACATG 
AAATCACATG 
AAATCACATG 
AAATCACATG 
AAATCACATG 
AAATCACATG 
AAATCACATG 
AAATCACATG 
AAATCACATG 
AAATCACATG 
********** 



1301 

msa442 66 7. 2 { 24 8_1 8 RS2l} TGGATGCTTA TATGAATGTT GAAAAATTAC 

msa44 26 67. 2 { 248^2603} TGGATGCTTA TATGAATGTT GAAAAATTAC 

msa442667.2{248_A909} TGGATGCTTA TATGAATGTT GAAAAATTAC 

msa442667.2{248_H36B} TGGATGCTTA TATGAATGTT GAAAAATTAC 

msa442667.2{248_JM9130013} TGGATGCTTA TATGAATGTT GAAAAATTAC 

msa442667 .2(248_C0H1> TGGATGCTTA TATGAATGTT GAAAAATTAC 

msa4426 67 .2(248 _M78l} TGGATGCTTA TATGAATGTT GAAAAATTAC 

msa442667.2{24S_M732> TGGATGCTTA TATGAATGTT GAAAAATTAC 

msa442667.2{248_090} TGGATGCTTA TATGAATGTT GAAAAATTAC 

msa442667.2{248_CJB110} TGGATGCTTA TATGAATGTT GAAAAATTAC 

msa442667 . 2 { 248_1169NT} TGGATGCTTA TATGAATGTT GAAAAATTAC 

Consensus ********** ********** ********** 



msa442667 . 2 { 248_18RS21 } 
msa442667 .2f248_2603] 
msa442667 . 2 ( 248__A909 ] 
msa442667 .2{248_H36B] 
msa442667 .2 {248_JM9130013 ] 
msa442667 . 2 { 248__C0H1 ] 
msa442667 .2{248_M781 
msa442667 . 2 {248_M732 \ 
msa442667 . 2 {248_090 \ 
msa442667 . 2 {248_CJB110 } 
msa442667 . 2 { 248_1169NT} 
Consensus 



msa442667 . 2 { 

msa442667 . 

msa442667. 

msa442667 . 
msa442667.2{248 

msa442667 . 

msa442667 . 

msa442667. 
msa442667 
msa442667.2{ 
msa442667.2{ 



248_18RS2l} 
2{248_2603} 
2{248_A909} 
2{248_H36B) 
_jJM9130013} 
2{248_COHl} 
2(248_M78l} 
2{248__M732} 
.2{248_090} 
248_CJB110} 
248_1169NT} 
Consensus 



1351 

TTATCTTATG 
TTATCTTATG 
TTATCTTATG 
TTATCTTATG 
TTATCTTATG 
TTATCTTATG 
TTATCTTATG 
TTATCTTATG 
TTATCTTATG 
TTATCTTATG 
TTATCTTATG 



ATATTAGTGC 
ATATTAGTGC 
ATATTAGTGC 
ATATTAGTGC 
ATATTAGTGC 
ATATTAGTGC 
ATATTAGTGC 
ATATTAGTGC 
ATATTAGTGC 
ATATTAGTGC 
ATATTAGTGC 



********** ********** 



ACCAGAAAAA 
ACCAGAAAAA 
ACCAGAAAAA 
ACCAGAAAAA 
ACCAGAAAAA 
ACCAGAAAAA 
ACCAGAAAAA 
ACCAGAAAAA 
ACCAGAAAAA 
ACCAGAAAAA 
ACCAGAAAAA 
********** 



GTTTCCCTGA 
GTTTCCCTGA 
GTTTCCCTGA 
GTTTCCCTGA 
GTTTCCCTGA 
GTTTCCCTGA 
GTTTCCCTGA 
GTTTCCCTGA 
GTTTCCCTGA 
GTTTCCCTGA 
GTTTCCCTGA 
********** 



ATGAAGTTAC 
ATGAAGTTAC 
ATGAAGTTAC 
ATGAAGTTAC 
ATGAAGTTAC 
ATGAAGTTAC 
ATGAAGTTAC 
ATGAAGTTAC 
ATGAAGTTAC 
ATGAAGTTAC 
ATGAAGTTAC 
********** 



1350 
TAAATATCAG 
TAAATATCAG 
TAAATATCAG 
TAAATATCAG 
TAAATATCAG 
TAAATATCAG 
TAAATATCAG 
TAAATATCAG 
TAAATATCAG 
TAAATATCAG 
TAAATATCAG 
********** 

1400 
CaCCTTTTGG 
CaCCTTTTGG 
CaCCTTTTGG 
CaCCTTTTGG 
CaCCTTTTGG 
CgCCTTTTGG 
CgCCTTTTGG 
CgCCTTTTGG 
CgCCTTTTGG 
CgCCTTTTGG 
CgCCTTTTGG 
*_******** 



1401 

TTTACAGGTA 
TTTACAGGTA 
TTTACAGGTA 
TTTACAGGTA 
TTTACAGGTA 
TTTACAGGTA 
TTTACAGGTA 
TTTACAGGTA 
TTTACAGGTA 
TTTACAGGTA 
TTTACAGGTA 



CTGGTAGAGA 
CTGGTAGAGA 
CTGGTAGAGA 
CTGGTAGAGA 
CTGGTAGAGA 
CTGGTAGAGA 
CTGGTAGAGA 
CTGGTAGAGA 
CTGGTAGAGA 
CTGGTAGAGA 
CTGGTAGAGA 



********** ********** 



ATGCAGTTcG 
ATGCAGTTcG 
ATGCAGTTcG 
ATGCAGTTcG 
ATGCAGTTcG 
ATGCAGTTcG 
ATGCAGTTcG 
ATGCAGTTcG 
ATGCAGTTaG 
ATGCAGTTaG 
ATGCAGTTcG 
********_* 



ACATGCTTTc 
ACATGCTTTc 
ACATGCTTTc 
ACATGCTTTc 
ACATGCTTTc 
ACATGCTTTc 
ACATGCTTTc 
ACATGCTTTc 
ACATGCTTTc 
ACATGCTTTc 
ACATGCTTTt 
*********„ 



1450 
AAAGAACGTA 
AAAGAACGTA 
AAAGAACGTA 
AAAGAACGTA 
AAAGAACGTA 
AAAGAACGTA 
AAAGAACGTA 
AAAGAACGTA 
AAAGAACGTA 
AAAGAACGTA 
AAAGAACGTA 
********** 
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1451 

msa442667 .2{248_JL8RS2l} AGACGGACAA 

msa442667 .2{248_2603} AGACGGACAA 

msa442667.2f 248_A909> AGACGGACAA 

msa442667 .2{248_H36B) AGACGGACAA 

msa442667 . 2 { 248_JM9130013 } AGACGGACAA 

msa4 42667.2(24 8_COHl } AGACGGACAA 

msa442667 .2(248_M78l) AGACGGACAA 

msa442667.2{248 1 _M732} AGACGGACAA 

msa442667.2{248_090} AGACGGACAA 

msa442667 .2{248_CJB110} AGACGGACAA 

Ttisa442667 .2{248_1169NT} AGACGGACAA 
Consensus 



CCATATATTG 
CCATATATTG 
CCATATATTG 
CCATATATTG 
CCATATATTG 
CCATATATTG 
CCATATATTG 
CCATATATTG 
CCATATATTG 
CCATATATTG 
CCATATATTG 
********** ********** 



GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
********** 



AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
********** 



msa442667 . 2 { 248_18RS21 } 
msa442667 . 2{248J2603 } 
msa442667 . 2 { 248_A909 ] 
msa442667 .2{248_H36Bj 
msa442667.2{248_JM9130013 j 
rasa442667.2{248_C0Hl] 
nisa442667.2{248_M781] 
msa442667 . 2 { 248JM732 ; 
msa442667 . 2 {248_090 ] 
msa442 667 . 2 { 248_CJB110 } 
msa442667 - 2 { 248_1169NT} 
Consensus 



msa442667.2{248_18RS2i; 
msa442667 . 2 (248_2603 ; 
msa442667 . 2 { 248_A909". 
msa442667 ,2{248_H36B] 
msa442667 .2{248_JM9130013] 
msa442667.2{248_C0Hl) 
msa442667.2{248_M781} 
msa442667 . 2 {248JM732 } 
msa442667 .2{248_090} 
msa442667 . 2 { 248_CJB110 } 
msa442667.2{248_1169NT} 
Consensus 



msa442667.2{248_18RS2l} 
msa442667 . 2 { 248J2603 } 
msa442667.2{248_A909) 
msa442667.2{248_H36B} 
msa442667 .2{248_JM9130013} 
msa442667 . 2 { 248__C0H1 } 
msa442667.2{248_M78l! 
msa442667 . 2 { 248_M732 j 
msa442667 . 2 { 248_090 \ 
msa442667.2{248_CJB110; 
msa442667.2{248_1169NT] 
Consensus 



1501 

TGTGTTTCTG 
TGTGTTTCTG 
TGTGTTTCTG 
TGTGTTTCTG 
TGTGTTTCTG 
TGTGTTTCTG 
TGTGTTTCTG 
.TGTGTTTCTG 
TGTGTTTCTG 
TGTGTTTCTG 
TGTGTTTCTG 
********** 

1551 

TAAATTAGGT 
TAAATTAGGT 
TAAATTAGGT 
TAAATTAGGT 
TAAATTAGGT 
TAAATTAGGT 
TAAATTAGGT 
TAAATTAGGT 
TAAATTAGGT 
TAAATTAGGT 
TAAATTAGGT 
********** 

1601 

TAGTTAATCT 
TAGTTAATCT 
TAGTTAATCT 
TAGTTAATCT 
TAGTTAATCT 
TAGTTAATCT 
TAGTTAATCT 
TAGTTAATCT 
TAGTTAATCT 
TAGTTAATCT 
TAGTTAATCT 
********** 



TTAGTGACAA 
TTAGTGACAA 
TTAGTGACAA 
TTAGTGACAA 
TTAGTGACAA 
TTAGTGACAA 
TTAGTGACAA 
TTAGTGACAA 
TTAGTGACAA 
TTAGTGACAA 
TTAGTGACAA 
********** 



CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
********** 



AAATAACAGG 
AAATAACAGG 
AAATAACAGG 
AAATAACAGG 
AAATAACAGG 
AAATAACAGG 
AAATAACAGG 
AAATAACAGG 
AAATAACAGG 
AAATAACAGG 
AAATAACAGG 
********** 



TGGACAAGGA 
TGGACAAGGA 
TGGACAAGGA 
TGGACAAGGA 
TGGACAAGGA 
TGGACAAGGA 
TGGACAAGGA 
TGGACAAGGA 
TGGACAAGGA 
TGGACAAGGA 
TGGACAAGGA 
********** 



TTGCAGAGAG 
TTGCAGAGAG 
TTGCAGAGAG 
TTGCAGAGAG 
TTGCAGAGAG 
TTGCAGAGAG 
TTGCAGAGAG 
TTGCAGAGAG 
TTGCAGAGAG 
TTGCAGAGAG 
TTGCAGAGAG 
********** 



ATCTCAGATA 
ATCTCAGATA 
ATCTCAGATA 
ATCTCAGATA 
ATCTCAGATA 
ATCTCAGATA 
ATCTCAGATA 
ATCTCAGATA 
ATCTCAGATA 
ATCTCAGATA 
ATCTCAGATA 
********** 



TAAGGG t ACA 
TAAGGGtACA 
TAAGGG tACA 
TAAGGGtACA 
TAAGGGtACA 
TAAGGGgACA 
TAAGGGgACA 
TAAGGGgACA 
TAAGGGtACA 
TAAGGGtACA 
TAAGGGtACA 
******_*** 



CTGAATTTAT 
CTGAATTTAT 
CTGAATTTAT 
CTGAATTTAT 
CTGAATTTAT 
CTGAATTTAT 
CTGAATTTAT 
CTGAATTTAT 
CTGAATTTAT 
CTGAATTTAT 
CTGAATTTAT 
********** 



TATATGGTAG 
TATATGGTAG 
TATATGGTAG 
TATATGGTAG 
TATATGGTAG 
TATATGGTAG 
TATATGGTAG 
TATATGGTAG 
TATATGGTAG 
TATATGGTAG 
TATATGGTAG 
********** 



1500 
T CATT ATT AT 
TCATTATTAT 
TCATTATTAT 
TCATTATTAT 
TCATTATTAT 
TCATTATTAT 
TCATTATTAT 
TCATTATTAT 
TCATTATTAT 
TCATTATTAT 
TCATTATTAT 
********** 

1550 
CTATCATTGA 
CTATCATTGA 
CTATCATTGA 
CTATCATTGA 
CTATCATTGA 
CTATCATTGA 
CTATCATTGA 
CTATCATTGA 
CTATCATTGA 
CTATCATTGA 
CTATCATTGA 
********** 

1600 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
********** 

1650 
TGTAAGTTGC 
TGTAAGTTGC 
TGTAAGTTGC 
TGTAAGTTGC 
TGTAAGTTGC 
TGTAAGTTGC 
TGTAAGTTGC 
TGTAAGTTGC 
TGTAAGTTGC 
TGTAAGTTGC 
TGTAAGTTGC 
********** 



msa442667 . 2 { 248_18RS21 ) 
msa442667 . 2 { 248J2603 } 
msa442667.2{248_A909} 
msa442667 . 2 { 248JH36B} 
msa442667.2{248_JM9130013} 
msa442667 . 2 { 248_C0H1 } 
msa442667.2(248_M78l} 
msa442667.2{248_M732) 
msa442667.2{248_090) 
msa442667 . 2 { 248_CJB110 } 
msa442667 . 2 { 248_1169NT} 
Consensus 



1651 

CTTCATTTTT 
CTTCATTTTT 
CTTCATTTTT 
CTTCATTTTT 
CTTCATTTTT 
CTTCATTTTT 
CTTCATTTTT 
CTTCATTTTT 
CTTCATTTTT 
CTTCATTTTT 
CITCATTTTT 
********** 



CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
********** 



GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
********** 



AAAGTTTGGT 
AAAGTTTGGT 
AAAGTTTGGT 
AAAGTTTGGT 
AAAGTTTGGT 
AAAGTTTGGT 
AAAGTTTGGT 
AAAGTTTGGT 
AAAGTTTGGT 
AAAGTTTGGT 
AAAGTTTGGT 



1700 
ATCGAATACC 
ATCGAATACC 
ATCGAATACC 
ATCGAATACC 
ATCGAATACC 
ATCGAATACC 
ATCGAATACC 
ATCGAATACC 
ATCGAATACC 
ATCGAATACC 
ATCGAATACC 



********** ********** 



1701 

msa442667 .2{248_18RS2l} TAATAGAATA AGGGAGGATG AGCATGAAAA 

msa442667.2(248_2603j TAATAGAATA AGGGAGGATG AGCATGAAAA 

msa442667.2(248_A909} TAATAGAATA AGGGAGGATG AGCATGAAAA 

msa442667.2{248_H36B} TAATAGAATA AGGGAGGATG AGCATGAAAA 

msa442667.2{248_JM9130013} TAATAGAATA AGGGAGGATG AGCATGAAAA 

msa442 66 7. 2 ( 24 8_C0H1) TAATAGAATA AGGGAGGATG AGCATGAAAA 

msa442667 .2{248_M78l) TAATAGAATA AGGGAGGATG AGCATGAAAA 

msa442667.2{248_M732} TAATAGAATA AGGGAGGATG AGCATGAAAA 

msa4 4 266 7. 2 {248_090} TAATAGAATA AGGGAGGATG AGCATGAAAA 

ms a442667.2{248_CJB110} TAATAGAATA AGGGAGGATG AGCATGAAAA 

m0 a442667.2{248_1169NT} TAATAGAATA AGGGAGGATG AGCATGAAAA 



1740 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
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Consensus ********** ********** ********** ********** 



SEQ ID NO. 5811 
STRAIN 2603 frame: 1 

LMVLLFQRLGI IMILAPLLVNNSYFRQLIEERSKRETVVLVI IFGLFVI I SNITGIEIKG 
DRSLVERPFLTT I SHSDSLANTRTLVI TTASLVGGPLVGS I VGF I GG VHR FFQGS FSGS F 
YI VSSVLVGI VSGKIGDKLKENHLYPSTSQVILI S 1 1 AES IQMLFVGI FTGWELVKMI VI 
PMMI LNSLGSTLFTAILKT YLSNE S QLRAVQTRDVLELTRQTLP YLRQGLTPQS ARSVCE 
1 1 KRHTNFDAVGLTDRSNVIiAH I GVGHDHH I AGQPVKTDLSKSVI FDGE PRI AQDKAAI S 
CPDHNCQLNSAI WPLKINDKTVGALKMYFAGDKTMSEVEENLVLGLAQ I FSGQLAMGI T 
EEQNiOASMAEIKALQAQINPHFFFmrNTXSALrRIDSDKARYALMQLSTFFRTSIiQGG 
QDREVTLEQEKSHVDAYMNVEKLRFPDKYQLS YD I S APEKMKLP PFGLQ VLVENAVRHAF 
KERKTDNHI LVQI KPDGHYYCVSVSDNGQG I SDT I IDKLGQETVAESKGTGTALVNLNNR 
LNLLYGSVSCLHFSSDKNGTKVWYRIPNRIREDEHENFNS 

SEQ ID NO. 5812 

STRAIN 090 frame: 1 

LMVLiIiFQRLG I IMILAFLLVNNSYFRQLIEERSKRETVVLVI IFGLFVI ISNITGIEIKG 
DRSLVERPFLTT I SHSDSLANTRTLVITTASLVGGPLVGS I VGFIGGVHRFFQGS FSGSF 
Y I VSSVLVG I VSGKIGDKLKENHL YPSTSQVI L I S I 1 AES I QMLFVG I FTGWELVKMI VI 
PMMILNSIXSSTLFLAILKTYLSNESQLIUVVQTRDVLELTRQTLPYLRQGLTPQSARSVeE 
1 1 KRHTNFDAVGLTDRSNVIiAH I GVGHDHH I AGQPVKTDLSKSVI FDGEPRIAQDKAAI S 
CPDHNCQLNSAIVVPLKINDKTVGALKMYFAGDKTMSEVEENLVLGLAQ I FSGQLAMGIT 
EEQNKLASMAEIKALQAQINPHFFFNAINTISALIRIDSDKARYALMQLSTFFRTSLQGG 
QDRE VTLEQEKSHVDAYMNVEKLRFPDKYQLS YD I SAPEKMKLPPFGLQVLVENAVRHAF 
KERKTDNHI LiVQI KPDGHYYCVSVSDNGQG I SDT 1 I DKLGQETVAE SKGTGTALVNLNNR 
LNLLYGSVSCLHFSSDKNGTKVWYRI PNR I REDEHENFNS 

SEQ ID NO. 5813 
STRAIN A909 frame: 1 

LMVLiLFQRLG I IM I LAFLLVNNS YFRQL I EERSKRETWLVI I FGLFVI I SNITG I E I KG 
DRSLVERPFLTT I SHSD SLANTRTL VI TTASLVGGPLVGS I VGFIGGVHRFFQGS FSGSF 
YI VSSVLVGI VSGKIGDKLKENHLYPSTSQVILI S I IAES IQMLFVGI FTGWELVKMIVI 
PMMI LNSLGSTLFIiAI LKTYLSNESQLRAVQTRD VLELTRQT'LP YLRQGLTPQSARS VCE 
1 1 KRHTNFDAVGLTDRSNVLAHIGVGHDHHI AGQPVKTDLSKSVI FDGEPRI AQDKAAI S 
CPDHNCQLNSAI WPLKINDKTVGALKMYFAGDKTMSEVEENLVLGLAQI FSGQLAMGIT 
EEQNKIASMAEIKALOJ^QINPHFFFNAINTISALIRIDSDKARYAIjMQLSTFFRTSLGGG 
QDRE VTLEQEKSHVDAYMNVEKLRFPDKYQLS YD I SAPEKMKLPPFGLQVLVENAVRHAF 
KERKTDNHI LVQ I KPDGHYYCVSVSDNGQG I SDTI I DKLGQETVAE SKGTGTALVNLNNR 

LNLLYGSVSCLHFSSDKNGTKVWYRIPNRI REDEHENFNS l 

SEQ ID NO. 5814 

STRAIN H36B frame: 1 

LMVLLFQRLG I IM I LAFLLVNNS YFRQL I EERS KRET WLVI IFGLFVI ISNITGIEIKG 
DRSLVERPFLTT I SHSDSLANTRTLVI TTASLVGGPLVGS I VGFIGGVHRFFQGS FSGSF 
YI VSSVLVGI VSGKIGDKLKENHL YPSTSQVILISI IAES IQMLFVGIFTGWELVKMIVI 
PMMI LNSLGSTLFLAI LKTYLSNE S QLRAVQTRDVLELTRQTLP YLRQGLTPQS ARSVCE 
1 1 KRHTNFDAVGLTDRSNVLAHIGVGHDHHI AGQPVKTDLSKSVI FDGEPRIAQDKAAI S 
CPDHNCQLNSAI WPLKI NDKTVGALKMYFAGDKTMSEVEENLVLGLAQ I FSGQLAMG I T 
EEQNKLASMAEIKALQAQINPHFFFNAIOTISALIRIDSDKARYALMQLSTFFRTSLQGG 
QDRE VTLEQEKSHVDAYMNVEKLRFPDKYQLS YD I SAPEKMKLPPFGLQVLVENAVRHAF 
KERKTDNH I LVQ I KPDGHYYCVSVSDNGQG I SDTI I DKLGQETVAE SKGTGTALVNLNNR 
LNLL YGSVS CLHFS SDKNGTKVWYRI PNRI REDEHENFNS 

SEQ ID NO. 5815 
STRAIN 18RS21 frame: 1 

LMVLLFQRLG 1 1 MI LAFLLVNNS YFRQL I EERSKRETWLVI I FGLFVI I SNITG IE I KG 
DRSLVERPFLTTI SHSDSLANTRTLVI TTASLVGGPLVGS I VGFIGGVHRFFQGS FSGS F 
YI VSSVLVGI VSGKIGDKLKENHLYPSTSQVILI SI I AESIQMLFVGI FTGWELVKMI VI 
PMMIIJ^SI/3STLFLAILKTYLSNESQLRAVGTRDVLELTRQTLPYLRQGLTPQSARSVCE 
1 1 KRHTNFDAVGLTDRSNVLAH I GVGHDHH I AGQPVKTDLS KS VI FDGEPRIAQDKAAI S 
CPDHNCQLNSAI VVPLKINDKTVGALKMYFAGDKTMSEVEENLVLGLAQI FSGQLAMGI T 
EEQNKIASMAEIKALQAQINPHFFFNAINTISALIRIDSDKARYALMQLSTFFRTSLQGG 
QD RE VTLEQEKSHVDAYMNVEKLRFPDKYQLS YD I SAPEKM^ 

KERKTDNH I LVQ I KPDGHYYCVSVSDNGQG I SDT 1 1 DKLGQETVAE SKGTGTALVNLNNR 
LNLL YGSVS CLHFS SDKNGTKVWYR I PNR I REDEHENFNS 

SEQ ID NO. 5816 

STRAIN M732 frame: 1 

LMVLLFQRLG 1 1 MIIAFIJjVNNSYFRQLIEERSKRETVVLVI IFGLFVI ISNITGIEIKG 
DRSLVERPFLTT I SHSDSLANTRTLVI TTASLVGGPLVGS I VGFIGGVHRFFQGS FSGSF 
Y I VS S VLVG I VSGKI GDKLKENHL YPSTSQVI L I S I IAES I QMLFVG I FTGWELVKM I V I 
PMM I LNSLGSTLFLAI LKTYLSNESQLRAVQTRDVLELTRC/TLPYLRQGLTPQSARS VCE 
1 1 KRHTNFDAVGLTDRSNVLAHIGIGHDHHI AGQPVKTDLSKSVI FDGEPRIAQDKAAI S 
CPDHNCQLNSAI WPLKINDKTVCALKMYFAGDKTMSEVEENLVLGLAQ I FSGQLAMG I T 
EEQNKLASMAEI KALQAQI NPHFFFNAI NT I SAL I RI DSDKARYALMQLSTFFRTSLQGG 
QDRE VTLEQE KSHVDAYMNVEKLRFPDKYQLS YD I SAPE KMKLPP FGLQVLVENAVRHAF 
KERKTDNHI LVQI KPDGHYYCVSVSDNGQG I SDT I IDKLGQETVAES KGTGTALVNLNNR 
LNLLYGSVSCLHFSSDKNGTKVWYRIPNRIREDEHENFNS 

SEQ ID NO. 5817 
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STRAIN COH1 frame: 1 

LM VLLFQRLG 1 1 M I LAFLLVNNS YFRQL I EERS KRETWL V 1 1 FGLF V 1 1 SN I TG I E I KG 
DRSLVERPFLTT I SHSDSLANTRTLVI TTASLVGGPLVGS I VGF I GGVHRFFQGS FSGS F 
YIVSSVLVGIVSGKIGDKLKENHLYPSTSQVILISIIAESIQMLFVGIFTGWELVKMIVI 
PMMI LNSLGSTLFLAI LKTYLSNESQLRAVQTRDVLELTRQT'LPYLRQGLTPQSARSVCE 
1 1 KRHTN FDAVG LTDRS NVLAH I G VGHDHH I AGQPVKTDL SKSVI FDGE PRI AQDKAAI S 
CPDHNCQLNSAIWPLKINDKTVCALKMYFA 

EEQNK1ASMAEIKALQAQINPHFFFNAINTISALIRIDSDKARYALMQLSTFFRTSLGGG 
QDREVTLEQEKSHVDAYMNVEKLRFPDKYQLS YD I SAPE KMKLPP FGLQ VLVENAVRHAF 
KERKTDNHI LVQ I KPDGHYYCVSVSDNGQG I SDT 1 1 DKLGQETVAE S KGTGTAL VNLNNR 
LNLLYGSVSCLHFS SDKNGTKVWYRI PNRI REDEHENFNS 

SEQ ID NO, 5818 
STRAIN M781 frame: 1 

LMVLLFQRLGI IMII^AFLLVNNSYFRQLIEERSKRETWLVI IFGLFVI ISNITGIEIKG 
DRSLVERPFLTT I SHSDSLANTRTLVI TTASLVGGPLVGS I VGF I GGVHRFFQGS FSGS F 
YI VSSVLVGI VSGKIGDKLKENHLYPSTSQVILI SI IAES IQMLFVGI FTGWELVKMI VI 
PMMI LNSLGSTLFLAI LKT YLSNE S QLRAVQTRD VLELTRQTLP YLRQGLTPQSARS VCE 
1 1 KRHTNFDAVGLTDRSNVLAH I G VGHDHH I AGQPVKTDL SKSVI FDGEPRI AQDKAAI S 
CPDHNCQLNSAIVVPLKINDKTVCALKMYFAGDKTMSEVEENLVIiGL^ 
EEQNKLASMAEI KALQAQINPHFFFNAINTI SAL I RI DSDKARYALMQLSTFFRTSLQGG 
QDREVTLEQEKSHVDAYMNVEKLRFPDKYQLSYDISAPEKMKLPPFGLQVLVENAVRH^ 
KERKTDNH I LVQ I KPDGHYYCV S VSDNGQG I SDT 1 I DKLGQETVAES KGTGTALVNLNNR 
LNLLYGSVSCLHFSSDKNGTKVWYRI PNRI REDEHENFNS 

SEQ ID NO. 5819 
STRAIN CJB 1 10 frame: 1 

LMVLLFQRLGI IMI LAFLLVNNSYFRQLIEERSKRETVVLVI I FGLFVI I SNITGIEIKG 
DRSL VERPFLTTISHSDSLANTRTLVI TTASLVGGPLVGS IVGFIGGVHRFFQGSFSGSF 
YI VSSVLVGI VSGKIGDKLKENHLYPSTSQVILI SI IAES IQMLFVGI FTGWELVKMI VI 
PMMI LNSLGSTLFLAI LKTYLSNESQLRAVQTRDVLELTRQTLPYLRQGLTPQSARSVCE 
I IKRHTNFDAVGLTDRSNVLAHIGVGHDHHIAGQPVKTDLSKSVI FDGEPRIAQDKAAI S 
CPDHNCQLNS AI VVPLKI NDKTVGALKMYFAGDKTMS EVEENLVLGLAQ I FSGQLAMG I T 
EEQNKLASMAEIKALQAQINPHFFFNAINTISALIRIDSDKARYALMQLSTFFRTSLQGG 
QDREVTLEQEKSHVDAYMNVEKLRFPDKYQLS YD I SAPEKMKLP PFGLQVLVENAVRHAF 
KERKTDNH I LVQ I KPDGHYYCVSVSDNGQG I SDT 1 1 DKLGQETVAE S KGTGTALVNLNNR 
LNLLYGSVSCLHFSSDKNGTKVWYRI PNRI REDEHENFNS 

SEQ ID NO. 5820 
STRAIN 1 169NT frame: 1 

LMVLLFQRLG 1 1 M I LAFLLVNNS YFRQL I EERSKRETWLVI I FGLFVI I SN I TG I E I KG 
DRSLVERPFLTTI SHSDSLANTRTLVI TTASLVGG PLVGS IVGFIGGVHRFFQGSFSGSF 
YI VSSVLVGI VSGKIGDKLKENHLYPSTSQVILI S I IAES IQMLFVGI FTGWELVKM I VI 
PMM I LNSI/3STLFLA I LKT YLSNES QLRAVQTRD VLELTRQTLP YLRQGLTPQSARS VCE 
1 1 KRHTNFDAVGLTDRSNVLAH I GVGHDHHI AGQPVKTDL SKSVI FDGEPRIAQDKAAI S 
CPDHNCQLNSAI VVPLKINDKTVGALKMYFAGDKTMS EVEENLVLGLAQI FSGQLAMG IT 
EEQNKIASMAEI KALQAQINPHFFFNAINTI SALIRIDSDKARYALMQLSTFFRTSLQGG 
QDREVTLEQEKSHVDAYMNVEKLRFPDKYQLS YD I SAPE KMKLPP FGLQ VLVENAVRHAF 
KERKTDNHI LVQ I KPDGHYYCVSVSDNGQG I SDT 1 1 DKLGQETVAESKGTGTALVNLNNR 
LNLLYGSVSCLHFSSDKNGTKVWYRI PNRI REDEHENFNS 

SEQ ID NO. 5821 
STRAIN JM9130013 frame: 1 

LMVLLFQRLGI IMI LAFLLVNNSYFRQLIEERSKRETVVLVI I FGLFVI ISNITGIEIKG 
DRSLVERPFLTTI SHSDSLANTRTLVI TTASLVGGPLVGS IVGFIGGVHRFFQGSFSGSF 
YI VSSVLVGI VSGKIGDKLKENHLYPSTSQVILI S I IAESIQMLFVGIFTGWELVKMIVI 
PMMILNSIjGSTLFLAILKTYLSNESQLRAVGTRDVLELTRQTLPYLRQGLTPQSARSVCE 
1 1 KRHTNFDAVGLTDRSNVLAH I GVGHDHHI AGQPVKTDL SKSVI FDGEPRIAQDKAAI S 
CPDHNCQLNSAI WPLKINDKTVGALKMYFAGDKTMSEVEENLVLGLAQ I FSGQLAMGIT 
EEQNKLASMAE I KALQAQINPHFFFNAINTI SAL I RI DSDKARYALMQLSTFFRTSLQGG 
QDREVTLEQEKSHVDAYMNVEKLRFPDKYQLSYDISAPEKMKLPPFGLQVLVF^NAVRHAF 
KERKTDNHI LVQI KPDGHYYCVSVSDNGQG I SDTI I DKLGQETVAESKGTGTALVNLNNR 
LNLLYGSVSCLHFSSDKNGTKVWYRI PNRI REDEHENFNS 

PRETTY of : /biotmp/msa442834 . 2 { * } January 13, 2003 06:47 

1 50 

msa442834. 2(248 090} LMVLLFQRLG IIMILAFLLV NNS YFRQL IE ERS KRETWL VIIFGLFVII 

msa442834.2{248 1169NT} LMVLLFQRLG IIMILAFLLV NNSYFRQLIE ERS KRETWL VIIFGLFVII 

msa442834.2{248 18RS21} LMVLLFQRLG IIMILAFLLV NNSYFRQLIE ERS KRETWL VIIFGLFVII 

msa442834.2{248 2603} LMVLLFQRLG IIMILAFLLV NNSYFRQLIE ERS KRETWL VIIFGLFVII 

msa442834.2{248~A909} LMVLLFQRLG IIMILAFLLV NNSYFRQLIE ERS KRETWL VIIFGLFVII 

msa442834.2{248 CJB110} LMVLLFQRLG IIMILAFLLV NNSYFRQLIE ERS KRETWL VIIFGLFVII 

msa442834.2{248_H36B} LMVLLFQRLG IIMILAFLLV NNSYFRQLIE ERS KRETWL VIIFGLFVII 

msa442834.2{248 JM9130013} LMVLLFQRLG IIMILAFLLV NNSYFRQLIE ERS KRETWL VIIFGLFVII 

tnsa442834.2{248 COHl} LMVLLFQRLG IIMILAFLLV NNSYFRQLIE ERS KRETWL VIIFGLFVII 

msa442834.2{248~M78l} LMVLLFQRLG IIMILAFLLV NNSYFRQLIE ERS KRETWL VIIFGLFVII 

msa442834.2{248 M732) LMVLLFQRLG IIMILAFLLV NNSYFRQLIE ERS KRETWL VIIFGLFVII 

Consensus ********** ********** ********** ********** ********** 

51 100 
msa442834.2{248_090} SNITGIEIKG DRSLVERPFL TTISHSDSLA NTRTLV I TTA SLVGGPLVGS 
msa442834.2{248__1169NT} SNITGIEIKG DRSLVERPFL TTISHSDSLA NTRTLVI TTA SLVGGPLVGS 



915 
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Table 58: Comparative Sequences relating to SAG0182 



msa442834.2{ 

msa442834 . 

msa442834 . 
msa442834 .2{ 

msa442834 
msa442834.2{248 

msa442834 . 

msa442834. 

msa442834 . 



248_18RS2l} 
2{248_2603} 
2{248_A909} 
24 8_CJB110} 
2{248_H36B) 
JM9130013} 
2{248_COHl} 
2{248_M78l} 
2{248_M732} 
Consensus 



SNITGIEIKG 
SNITGIEIKG 
SNITGIEIKG 
SNITGIEIKG 
SNITGIEIKG 
SNITGIEIKG 
SNITGIEIKG 
SNITGIEIKG 
SNITGIEIKG 



msa442834.2{248_090} 
msa442834 .2{248_1169NT} 
msa442834 . 2 {248_18RS21 } 
msa442834. 2 f 248^2603} 
msa442834 .2{248_A909} 
msa442834 .2 {248_CJB110} 
msa442834 .2{248_H36B} 
msa442834 .2{248_JM9130013} 
msa442834 . 2 { 248_C0H1 } 
msa442834.2{248_M78l} 
msa442834 . 2 { 248_M732 } 
Consensus 



msa442834 .2{248_090} 
msa442834 . 2 ( 248_1169NT> 
msa442834.2{248__18RS21} 
msa442834 .2f 248_2603} 
msa442834.2{248_A909} 
msa442834 ,2{248_CJB110} 
msa442834.2{248_H36B} 
msa442834 .2{248_JM9130013} 
msa442834 . 2 { 248_C0H1 } 
msa442834 . 2 (248_M781 \ 
msa442834.2{248_M732} 
Consensus 



msa442834 . 2 {248_090 } 
msa442 834 . 2 { 248JL169NT} 
msa442 834.2{248_18RS2l} 
msa442834 . 2 { 248_2603 } 
msa442834.2{248_A909} 
msa442 8 34 . 2 { 24 8_C JB11 0 } 
msa442834 . 2 {248_H36B} 
msa442834.2{248_JM9130013} 
msa442834.2{248_COHl} 
msa442834 .2{248_M781} 
msa442834.2{248_M732} 
Consensus 



DRSLVERPFL 
DRSLVERPFL 
DRSLVERPFL 
DRSLVERPFL 
DRSLVERPFL 
DRSLVERPFL 
DRSLVERPFL 
DRSLVERPFL 
DRSLVERPFL 



TTISHSDSLA 
TTISHSDSLA 
TTISHSDSLA 
TTISHSDSLA 
TTISHSDSLA 
TTISHSDSLA 
TTISHSDSLA 
TTISHSDSLA 
TTISHSDSLA 



NTRTLVITTA 
NTRTLVITTA 
NTRTLVITTA 
NTRTLVITTA 
NTRTLVITTA 
NTRTLVITTA 
NTRTLVITTA 
NTRTLVITTA 
NTRTLVITTA 



SLVGGPLVGS 
SLVGGPLVGS 
SLVGGPLVGS 
SLVGGPLVGS 
SLVGGPLVGS 
SLVGGPLVGS 
SLVGGPLVGS 
SLVGGPLVGS 
SLVGGPLVGS 



********** ********** ********** ********** ********** 



101 

IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
********** 

151 

VILISIIAES 
VILISIIAES 
VILISIIAES 
VILISIIAES 
VILISIIAES 
VILISIIAES 
VILISIIAES 
VILISIIAES 
VILISIIAES 
VILISIIAES 
VILISIIAES 
********** 

201 

LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 



FFQGSFSGSF 
FFQGSFSGSF 
FFQGSFSGSF 
FFQGSFSGSF 
FFQGSFSGSF 
FFQGSFSGSF 
FFQGSFSGSF 
FFQGSFSGSF 
FFQGSFSGSF 
FFQGSFSGSF 
FFQGSFSGSF 
********** 



YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
********** 



VSGKI GDKLK 
VSGKIGDKLK 
VSGKI GDKLK 
VSGKIGDKLK 
VSGKIGDKLK 
VSGKIGDKLK 
VSGKIGDKLK 
VSGKIGDKLK 
VSGKIGDKLK 
VSGKIGDKLK 
VSGKIGDKLK 



150 

ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 



********** ********** 



IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
********** 



GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
********** 



PMM I LNSLGS 
PMMILNSLGS 
PMMI LNSLGS 
PMMILNSLGS 
PMMILNSLGS 
PMMILNSLGS 
PMMILNSLGS 
PMMILNSLGS 
PMMILNSLGS 
PMMILNSLGS 
PMMILNSLGS 
********** 



QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 



QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 



TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 



********** ********** ********** ********** 



msa442834 . 2 {248_090 } 
msa442 834 . 2 { 248_1169NT } 
msa442834.2{248__18RS2l} 
rasa442834 . 2 { 248_2603 } 
msa442834.2{248_A909} 
msa4 42 8 34 . 2 { 24 8_C JB 1 1 0 } 
msa442834 -2{248_H36B} 
msa442B34 . 2 {248_JM9130 013 } 
msa442834.2(248_COHl} 
tnsa442834.2{248_M78l} 
msa442834.2{248_M732} 
Consensus 



msa442834.2{248_090} 
msa442834.2(248_1169NT} 
msa442834 . 2 { 248_18RS21 } 
msa442834.2{248__2603} 
msa4 42 8 34 . 2 { 24 8_A9 0 9 } 
msa442 834 . 2 { 248_CJB110 } 
msa442834.2{248_H36B} 
msa442834.2{248_JM9130013} 
msa4 4 2 8 3 4 . 2 { 2 4 8_COHl } 
msa442834 .2{248_M781} 
m9a442834.2{248_M732} 
Consensus 



msa442834.2{248_090} 



251 

VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
********** 

301 

CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 



AH I GvGHDHH 
AHI GvGHDHH 
AH I GvGHDHH 
AHI GvGHDHH 
AH I GvGHDHH 
AHI GvGHDHH 
AHI GvGHDHH 
AHI GvGHDHH 
AHI GvGHDHH 
AHI GvGHDHH 
AHIGiGHDHH 



IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 



SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 



200 

TLFLAILKTY 
TLFLAI LKTY 
TLFLAILKTY 
TLFLAILKTY 
TLFLAILKTY 
TLFLAILKTY 
TLFLAILKTY 
TLFLAILKTY 
TLFLAILKTY 
TLFLAILKTY 
TLFLAILKTY 
********** 

250 

I I KRHTNFDA 
I I KRHTNFDA 
I I KRHTNFDA 
I I KRHTNFDA 
1 1 KRHTNFDA 
I I KRHTNFDA 
1 1 KRHTNFDA 
I I KRHTNFDA 
I I KRHTNFDA 
I I KRHTNFDA 
I I KRHTNFDA 
********** 

300 

RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 



****_***** ********** ********** ********** 



AIWPLKIND 
AIWPLKIND 
AIWPLKIND 
AIWPLKIND 
AIWPLKIND 
AIWPLKIND 
AIWPLKIND 
AIWPLKIND 
AIWPLKIND 
AIWPLKIND 
AIWPLKIND 



KTVgALKMYF 
KTVgALKMYF 
KTVgALKMYF 
KTVgALKMYF 
KTVgALKMYF 
KTVgALKMYF 
KTVgALKMYF 
KTVgALKMYF 
KTVcALKMYF 
KTVcALKMYF 
KTVcALKMYF 



AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 



350 

ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 



********** ********** ***-****** ********** ********** 

351 400 
FSGQLAMGIT EEQNKLASMA EIKALQAQIN PHFFFNAINT ISALIRIDSD 



916 



WO 2004/018646 



PCT/US2003/026827 



Table 58: Comparative Sequences relating to SAG0182 



msa442834 .2{ 
msa442834.2{ 

msa442834 . 

msa442834 . 
msa442834 .2{ 

msa442834 . 
msa442834.2{248 

msa442834 . 

msa442834 . 

msa442834 



248_1169NT} 
248_18RS2l} 
2{248_2603} 
2{248_A909} 
248_CJB110} 
2{248_H36B} 
_JM9130013} 
2{248_COHl} 
2{24 8_M78l} 
2{248_M732} 
Consensus 



FSGQLAMGIT 
FSGQLAMGIT 
FSGQLAMGIT 
FSGQLAMGIT 
FSGQLAMGIT 
FSGQLAMGIT 
FSGQLAMGIT 
FSGQLAMGIT 
FSGQLAMGIT 
FSGQLAMGIT 



EEQNKLASMA 
EEQNKLASMA 
EEQNKLASMA 
EEQNKLASMA 
EEQNKLASMA 
EEQNKLASMA 
EEQNKLASMA 
EEQNKLASMA 
EEQNKLASMA 
EEQNKLASMA 



EIKALQAQIN 
EIKALQAQIN 
EIKALQAQIN 
EIKALQAQIN 
EIKALQAQIN 
EIKALQAQIN 
EIKALQAQIN 
EIKALQAQIN 
EIKALQAQIN 
EIKALQAQIN 



********** ********** ********** 



PHFFFNAINT 
PHFFFNAINT 
PHFFFNAINT 
PHFFFNAINT 
PHFFFNAINT 
PHFFFNAINT 
PHFFFNAINT 
PHFFFNAINT 
PHFFFNAINT 
PHFFFNAINT 
********** 



ISALIRIDSD 
ISALIRIDSD 
ISALIRIDSD 
ISALIRIDSD 
ISALIRIDSD 
ISALIRIDSD 
ISALIRIDSD 
ISALIRIDSD 
ISALIRIDSD 
ISALIRIDSD 
********** 



msa442B34 .2{248_090} 
msa442834 . 2 { 248_1169NT} 
msa442834.2{248_18RS2l} 
msa442834 .2{248_2603} 
msa442834.2{248_A909} 
msa442834 . 2 { 248_CJB110 } 
' msa442834.2{248_H36B} 
msa442834 . 2 { 248_JM9130013 } 
msa442834 . 2 { 248_COHl } 
msa442834.2{248_M78l} 
msa442834 . 2 (248_M732 } 
Consensus 



401 

KARYALMQLS 
KARYALMQLS 
KARYALMQLS 
KARYALMQLS 
KARYALMQLS 
KARYALMQLS 
KARYALMQLS 
KARYALMQLS 
KARYALMQLS 
KARYALMQLS 
KARYALMQLS 



TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 



QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 



********** ********** ********** 



KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
********** 



450 

EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
********** 



msa442834 
msa442834.2{ 
msa442834.2{ 

msa442834. 

msa442834 . 
msa442834.2{ 

msa442834 
msa442834.2{248 

msa442834 

msa442834 . 

msa442834 . 



2{248_090) 
248_1169NT) 
248_18RS2l] 
2{248_2603; 
2{248_A909 
248_CJB110' 
2{248__H36B] 
_JM9130013} 
2l248_COHl) 
2{248_M781} 
2{248_M732} 
Consensus 



451 

LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 



MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 



**********.. ********** 



LVENAVRHAF 
LVENAVRHAF 
LVENAVRHAF 
LVENAVRHAF 
LVENAVRHAF 
LVENAVRHAF 
LVENAVRHAF 
LVENAVRHAF 
LVENAVRHAF 
LVENAVRHAF 
LVENAVRHAF 
********** 



KERKTDNHI L 
KERKTDNHIL 
KERKTDNHI L 
KERKTDNHIL 
KERKTDNHIL 
KERKTDNHIL 
KERKTDNHIL 
KERKTDNHIL 
KERKTDNHIL 
KERKTDNHIL 
KERKTDNHIL 



500 

VQ I KPDGHYY 
VQ I KPDGHYY 
VQI KPDGHYY 
VQI KPDGHYY 
VQI KPDGHYY 
VQI KPDGHYY 
VQI KPDGHYY 
VQI KPDGHYY 
VQI KPDGHYY 
VQI KPDGHYY 
VQI KPDGHYY 



********** ********** 



msa442 834. 2 {248_090 
msa442834 . 2 { 248_1169NT 
msa442834.2{248_18RS21 
msa442834.2{248_2603 
msa442834.2{248_A909, 
msa442834 . 2 { 248_CJB110 J 
tnsa442834.2{248_H36B} 
msa442834.2{248_JM9130013} 
msa442834 . 2 { 248_COHl} 
msa442834 . 2 { 248_M781 } 
msa442834.2{248_M732} 
Consensus 



501 

CVSVSDNGQG 
CVSVSDNGQG 
CVSVSDNGQG 
CVSVSDNGQG 
CVSVSDNGQG 
CVSVSDNGQG 
CVSVSDNGQG 
CVSVSDNGQG 
CVSVSDNGQG 
CVSVSDNGQG 
CVSVSDNGQG 



ISDTIIDKLG 
ISDTIIDKLG 
ISDTIIDKLG 
ISDTIIDKLG 
ISDTIIDKLG 
ISDTIIDKLG 
ISDTIIDKLG 
ISDTIIDKLG 
ISDTIIDKLG 
ISDTIIDKLG 
ISDTIIDKLG 



QETVAESKGT 
QETVAESKGT 
QETVAESKGT 
QETVAESKGT 
QETVAESKGT 
QETVAESKGT 
QETVAESKGT 
QETVAESKGT 
QETVAESKGT 
QETVAESKGT 
QETVAESKGT 



********** ********** ********** 



GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
********** 



550 

LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
********** 



551 

msa442834 . 2 {248__090 } LHFSSDKNGT 

msa442834.2{248_1169NT} LHFSSDKNGT 

msa442834.2{248_18RS2l} LHFSSDKNGT 

msa442834. 2 {248^2603} LHFSSDKNGT 

msa442834.2(248_A909} LHFSSDKNGT 

msa442834.2{248_CJB110} LHFSSDKNGT 

msa442834.2{248_H36B} LHFSSDKNGT 

msa442834.2{248_JM9130013} LHFSSDKNGT 

msa4 42834.2(24 8_COHl } LHFSSDKNGT 

msa442834.2{248_M78l} LHFSSDKNGT 

msa442834.2{24 8JM7 3 2 } LHFSSDKNGT 

Consensus ********** 



KVWYRIPNRI 
KVWYRIPNRI 
KVWYRIPNRI 
KVWYRIPNRI 
KVWYRIPNRI 
KVWYRIPNRI 
KVWYRIPNRI 
KVWYRIPNRI 
KVWYRIPNRI 
KVWYRIPNRI 
KVWYRIPNRI 



580 

REDEHENFNS 
REDEHENFNS 
REDEHENFNS 
REDEHENFNS 
REDEHENFNS 
REDEHENFNS 
REDEHENFNS 
REDEHENFNS 
REDEHENFNS 
REDEHENFNS 
REDEHENFNS 



********** ********** 
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SEQ ID NO. 5901 
STRAIN 26.03 

ATGAATAAAAGAAG AAAATT ATCAAAATTG AATGTAAAAAAACAT CATTT AG CTT ATGG A 
G CTAT CACTTTAGTAG C C CT TTTTT CATGT ATTTTGG CTGT AATGGT CATCTTTAAAA.GT 
T CACAAGTTACTACTG AAT CTTTGT CAAAAG CAGAT AAAGTTCG C GTAG C CAAAAAAT CA 
AAAAT G ACTAAGG CG ACATCT AAATCAAAAGTAGAAGATGTAAAACAG GCT CCAAAAC CT 
T CT CAGG CAT CTAATGAAG C CC CAAAAT CAAGTT CT CAAT CTACAGAAGCTAATT CTCAG 
CAACAAGTT ACTG CG AGTGAAGAGG CAG CTGTAGAACAAG CAGTTGTAACAGAAAACACC 
CCTGCTACCAGTCAGGCACAACAAGCTTATGCrFGTTAC^ 

C^CACCAGACGAGTGGCCAAGTATTGAGTAATGGAAATAcTGCAGGGGCTATTGGCrCA 
GCAGCTGGAGCAGAAATGGCTGCIX3CA 

ATTGCCCGTGAATCAA^TGGTAATCCTAATGTTGCTAATGCCrCAGGAGCTTCAGGACTT 

TTCCAAACGATGCCAGGTTGGGGTTGAACAGCTACAGTT 

ATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTACTAG 

SEQ ID NO. 5902 
STRAIN JM9130013 

AAAAGTT CACAAGTT ACT ACTGAAT CTTTGT CAAA 

AGCAGATAAAGTTCGCGTAGCCAAAAAATCAAAAATGAATAAGGCAA.CAT 
CTAAATCAAAAGT AG AAGGTGTAAAACACX3 CTC CAAAACCAAGTT CT CAA 
T CTACAGAAG CTAATT CT CAGCAACAAGTTACTGCGAGTG AAGAGGCAG C 
TGTAGAACAAG CAGTTGT AACAGAAAATAC CCCTG CTACCAGT CAAGCAC 
AACAAG CTTATGCTGTTACTGAGACAACTTATAGACCTGCT CAACACCAG 
C CGAGTGG CCAAGTATTG AG CAATGGAAATACTG CAGGGGTTATTGGCTC 
AGCAGCAGCAGCACAAATGGCTGCTGCAACGGGAGTTCCTCAG 
GGGAACAT ATTATTG C CCGTG AATCAAATGGTAAT CCTAACGTTG CTAAT 
GCCTCAGGAGCTTCAGGACTTTTCCAAACGATGCCAGGTC^ 
AGCTACAGTT CAGGAT CAAGTTAAT t CAG CT ATTAAAG CTTAT CGTG CTC 
AAGGTTTATCAGCTTGGGGTTAC 

SEQ ID NO. 5903 

STRAIN 1 169NT reverse complement 

AAAAGTTCACAAGTTACT ACTGAAT CTTTGT CAAAAGCAGATAAAGTTCG CGTAGC C 
AAAAAATCAAAAATGACTAAGGCGACAT CTAAATCAAAAGTAGAAGATGTAAAACAGG CT 
CCAAAACCTTCTCAGGCATCTAATGAAGTCCCAAAATCAAGTTCT 

AATTCTCAGCAACAAGTT ACT G CGAGTGAAGAGGCGGCTGTAGAACAAG CAGTTGTAACA 

GAAAATACCC CTG CTAC CAGT CAGG CACAACAAACTTATGCTGTTACTGAG ACAACTTAC 

AAACCTGCTCAACACCAGACAAGTGGCCAAGTATTGAGCAAT 

GTCGGATCTG CTG CTG CAGCACAAATGG CTG CTG CAACAGGAGTCCCT 

GAACAT ATTATTG C C CGTGAAT CAAATGGTAAT CCTAATGTTG CTAATG CCT CAGGAG CT 

TCAGGACTTTTCCAA^CGATGCCAGGTTGGGGTTCAACAGCTACAGTTCAGGATCAA 

AATTCAG CTATTAAAG CTTAT CGTGCTCAAGGTTTATCAG CTTGGGGTTAC 

SEQ ID NO. 5904 

STRAIN 18RS21 reverse complement 

AAAAGTT CACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAAAGTTC 
GCGTAGCCAAAAAATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAA. 
AACAGG CT CCAAAAC CTTCT CAGG CATCTAATGAAGCCCCAAAAT CAAGTT CTCAATCTA 
CAGAAG CT AATTCTCAG CAACAAGTTACTG CGAGTGAAGAGG CAG CTGTAGAACAAG CAG 
TTGTAACAGAAAACACCCCTGCTACCAGTCAGGCACAA 

CAACTTATAG ACCTG CT CAACACCAGACGAGTGGCCAAGTATTGAGTAATGGAAATACTG 
CAGGGGCTATTGGCTCAGCAGCTGCAGCACAAATGGCT 

CTACITGGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGOT 
CAGGAG CTTCAGGACTTTTC CAAACGATGCCAGGTTGGGGTTCA^ CAGG 
ATCAAGTT AATTCAG CTATT AAAGCTTATCGTG CTCAAGGTTTAT CAG CTTGGGGTTAC 

SEQ ID NO. 5905 

STRAIN 090 reverse complement 

TAG C CAAAAAAT CAAAAATG ATTAAGG CGACAT CTAAATCAAAAGTAGAAGATGTAAAAC 
AGG CT C CAAAACCTT CTCAGGCAT CTAATGAAG CC CCAAAAT CAAGTT CT CAATCTACAG 
AAG CTAATT CT CAG CAACAAGTT ACTGCGAGTGAAGAGGCAG CTGTAGAACAAG CAGTTG 
TAACAGAAAACAC CC CTG CTAC CAGT CAGG CAGAA.CAAG CTTAT G CTGTTACTGAGACAA 
CTTATAGAC CTGCTCAACACCAGACGAGTGG C CAAGTATTGAGTAATGGAAATACTG CAG 
GGGCTATTGGCTCAGCAGCTGCAGCACAAATGGCTGCrGCAACAGGAGTCCCT 
CTTGGGAACATATTATTGCC CGTGAAT CAAATGGTAAT C CT AATGTTG CTAATG C CTCAG 
GAGCTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAA 

SEQ ID NO. 5906 

STRAIN A909 reverse complement 

AAGGCGACAT CTAAATCAAAAGTAGAAGATGTAAAACAGG CT CCAAAACCTT CTCAGG CA 
TCTAATGAAG CC CCAAAAT CAAGTT CTCAATCTACAGAAG CTAATT CT CAG CAACAAGTT 
ACTGCGAGTGAAG AGG CAGCT GT AGAACAAG CAGTTGTAACAGAAAACACCCCTG CTACC 
AGT CAGGCACAACAAG CTTATG CTGTTACTGAGACAACTTATAGACCTGCT CAACACCAG 
ACAAGTGG C CAAGTATTGAGT AATGGAAATACTGCAGGGG CTATTGG CT CAG CAG CTG CA 
G CACAAATGG CTG CTG CAACAGG AGTCC CT CAGTCT ACTTGGGAACATATTATTG CC CGT 
GAATCAAATGGTAAT C CTAATGTTGCTAATG CCTCAGGAG CTTCAGGACTTT^ CAAACG 
ATGCCAGGTTGGGGTTCAACAG CTACAGTT CAGAAT CAAGTTAATT CAG CTATTAAAG CT 
TATCGTGCTCAAGGTTTATCA 

SEQ ID NO. 5907 

STRAIN CJB1 10 reverse complement 

AAT CTTTGTCAAAAG CAG AT AAAGTT CG CGTAG CCAAAAAATGAAAAATG ACTAAGG C GA 
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CAT CT AAAT CAAAAGT AGAAG AT GTAAAACAGG CT C CAAAAC CTT CTCAGG CAT CT AATG 

AAGCCCCAAAATCAAGTTCTCAATCTACAGAAGCTA^TTCTCAGCAACAAGTTA 

GTG AAG AGG CAG CTGTAG AACAAG CAGTTGT AACAG AAAACAC CCCTG CT ACCAGT CAGG 

CACAACAAGCTTATGCTGTTACTGAGACAACTTATAGACCTGCTCAAC^ 

G C CAAGT ATTG AGTAATGGAAATACTGCAGGGG CT ATTGG CT CAG CAG CTG CAG CACAAA 

TGGCTGCTGCAACAGGAGTCCCrCAGTCTACTTGGGAACATATTAT^ 

ATGGTAATCCTAATGTTGCTAATGCCTCAGGAGCTTCAGGACrrTTTCCAAACGATGCCAG 

GTTGGGGTT CAACAG CTACAGTT CAGGAT CAAGTTAATTCAG CTATTAAAG CTTAT CGTG 

CTCAAGGTTTATCAG CTTGGGGTTAC 

SEQ ID NO. 5908 

STRAIN COH1 reverse complement 

AAAAGTTCACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAA 

AGTT CG CGTAG CCAAAAAAT CAAAAATGACTAAGGCGACAT CT AAAT CAAAAGTAG AAGA 
TGTAAAACAGGCTCCAAAACCTTCTCAGGCATCTAATGAAGCC 

AT CTACAGAAGCTAATTCT CAG CAACAAGTT ACTG CGAGTG AAGAGGCGG CTGTAGAACA 
AGCAGTTGTAACAGAAAATACCCCTGCTACCAGTCAGGCACAACAAA 
TGAGACAACTTACAAACCTGCTCAACACCAGACAAGTC 
TACTGCAGGGGCGGTCGGATCTGCTGCTGCAGCACAAATGGCT 

T CAGT CTACTT GGGAACATATT ATTG C C CGTGAAT CAAATGGTAAT CCTAATGTTG CT AA 

TGCCTCAGGAGCTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAAC^ 

T CAGGATCAAGTTAATTCAG CTATTAAAG CTTATCGTG CT CAAGGTTTATCAG CTTGGGG 

TTAC 

SEQ XD NO. 5909 

STRAIN H36B reverse complement 

AAAAGTT CACAAGTT ACTACTGAATCTTTGT CAAAAGC 

AGATAAAGTT CG CGTAGC CAAAAAAT CAAAAATGACTAAGG CGACATCTAAAT CAAAAGT 
AGAAGATGTAAAACAGGCTC CAAAACCTTCT CAGGCAT CTAATGAAGCCC CAAAAT CAAG 
TTCTCAAT CT ACAGAAG CTAATT CT CAG CAACAAGTTACT GCG AGTGAAG AGG CAG CTGT 
AGAACAAG CAGTTGTAACAGAAAACACCCCTG CTAC CAGT CAGGCACAACAAG CTTATGC 
TGTTACTGAGACAACTTATAGAC CTG CTCAACAC CAGACAAGTGG CCAAGTATTGAGT AA 
TGGAAATACTGCAGGGGCTATTGGCTCAGCAGCTGCAGCACAAATGGCTC 
AGT CC CTCAGT CT ACTTGGG AACATATTATTGC CCGTGAAT CAAATGGTAAT CCTAATGT 
TG CTAATG C CTCAGGAG CTT CAGGACTTTT C CAAACGATG C CAGGTTGGGGTT CAACAGC 
TACAGTT CAGGAT CAAGTTAATT CAG CTATTAAAG CTT 

SEQ ID NO. 5910 

STRAIN M732 reverse complement 
AAAAGTT CACAAGTTACTACTGAATCITTGTC 

CAAAAAAT CAAA^TGACTAAGG CGACATCTAAAT CAAAAGTAGAAGATGTAAAACAGGC 

T CCAAAACCTT CT CAGGCAT CTAATGAAG C C CCAAAAT CAAGTTCTCAATCTACAGAAGC 

TAATT CTCAG CAACAAGTTACTG (2GAGTGAAGAGGCGG CTGTAGAACAAG CAGTTGTAAC 

AGAAAATACCCCTGCTACCAGTCAGGCACAACAAACTTATGCTGTTACT 

CAAACCTG CT CAACAC CAGACAAGTGG CCAAGTATTGAGCAATGGAAATACTG CAGGGGC 

GGTCGGATCTGCTGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCT 

GGAACATATTATTGC CCGTGAATCAAATC4GTAATCCTAATGTTGCTAATG C CT CAGGAG C 

TTCAGG ACTTTTC CAAACGATG C CAG GTTGGGGTT CAACAG CTACAGTT CAGGAT CAAGT 

TAATTCAGCTATTAAAGCTTATCGTGCTCAA 

SEQ ID NO. 5911 

STRAIN M781 reverse complement 

T CTTTGTCAAAAG CAGATAAAGTTCG CGTAG CCAAAAAAT CAAAAATG ACT AAGG CGACA 
T CTAAATCAAAAGTAGAAGATGTAAAACAG G CT CCAAAAC CTT CT CAGG CAT CTAATGAA 
G C C CCAAAAT CAAGTT CT CAATCTACAGAAG CTAATT CTCAGCAACAAGTTACTG CGAGT 
GAAGAGGCGGCTGTAGAACAAG CAGTTGTAACAGAAAATACC CCTG CTAC CAGT CAGG CA 
CAACAAACTTATG CTGTTACTGAGACAACTTACAAACCTG ^ CAGACAAGTGGC 
CAAGTATTGAG CAATGGAAATACTGCAGGGGCGGT CGGAT CTG CTG CTGCAGCACAAATG 
G CTG CTG CAACAGGAGTC CCTCAGT CTACTTGG GAACATATTATT G CCCGTGAATCAAAT 
GGTAATCCTAATGTTGCTAATGCCTCAGGAGCTTCAGGACTTTTCCAAACGATGCCAGGT 
TGGGGTTCAACAG CTACAGTT CAGGATCAAGTT AATT CAG CTATT AAAGCTT AT CGTG CT 
CAAGGTTT AT CAG CTTGGGGTTAC 

PRETTY of : /biOtmp/msa519780 . 2 { *} March 10, 2003 06:25 

1 50 

msa519780.2{25_COHl} 

msa519780.2{25_M78l} 

msa519780.2{25_M732} 

msa519780.2f25_1169NTj 

msa519780 . 2{25_18RS21} 

msa519780 -2{25_A909) 

msa519780.2{25_090} 

mBa519780.2{25_CJB110) 

rnsa519780 .2(2603) atgaataaaa gaagaaaatt atcaaaattg aatgtaaaaa aacatcattt 



msa519780.2{25_H36B} 
msa519780.2{25_JM9130013} 
Consensus 



msa519780.2{25_COHl 
msa519780.2{25_M781 
msa519780.2{25_M732 



********** ********** ********** ********** ********** 
51 100 
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rasa519780.2f 25_1169NT} 

msa519780 .2{25_18RS2l} — 

msa519780 . 2 {25_A909} 

msa519780 .2{25_090} 

msa519780 .2{25_CJB110} ~~ 

msa519780 .2(2603} agcttatgga gctatcactt tagtagccct tttttcatgt attttggctg 

msa519780 . 2{25_H36B} 

msa519780 . 2{25 JM9130013 } — [ 

~ consensus ********** ********** ********** ********** ********** 

101 I 50 

msa51978 0.2{25_COHl} aaaagt tcacaagtta ctactgaatc tttgtcaaaa 

msa519780.2(25_M78l} tc tttgtcaaaa 

msa5l9780.2(25_M732} aaaagt tcacaagtta ctactgaatc tttgtcaaaa 

msa519780.2(25_1169NT} aaaagt tcacaagtta ctactgaatc tttgtcaaaa 

msa519780.2(25_18RS2lj aaaagt tcacaagtta ctactgaatc tttgtcaaaa 

msa51978 0.2(25_A909} 

msa519780 .2{25_090} -r ~ 

msa5197 8 0 . 2 { 25_CJB110 } aatc tttgtcaaaa 

msa519780 .2(2603} taatggtcat ctttaaaagt tcacaagtta ctactgaatc tttgtcaaaa 

msa519780.2(25_H36B} — aaaagt tcacaagtta ctactgaatc tttgtcaaaa 

msa519780.2(25_JM9130013} aaaagt tcacaagtta ctactgaatc tttgtcaaaa 

Consensus ********** **** 

151 200 

msa519780.2(25__COHl} gcagataaag ttcgcgtagc caaaaaatca aaaatgactA AGGCgACATC 

msa519780.2(25_M78l} gcagataaag ttcgcgtagc caaaaaatca aaaatgactA AGGCgACATC 

msa519780.2(25_M732} gcagataaag ttcgcgtagc caaaaaatca aaaatgactA AGGCgACATC 

msa519780.2{25_1169NT} gcagataaag ttcgcgtagc caaaaaatca aaaatgactA AGGCgACATC 

msa519780.2{25 18RS21} gcagataaag ttcgcgtagc caaaaaatca aaaatgactA AGGCgACATC 

msa519780.2{25_A909} A AGGCgACATC 

msa519780.2{25_090} tagc caaaaaatca aaaatgattA AGGCgACATC 

msa519780.2(25_CJB110} gcagataaag ttcgcgtagc caaaaaatca aaaatgactA AGGCgACATC 

msa519780 .2(2603} gcagataaag ttcgcgtagc caaaaaatca aaaatgactA AGGCgACATC 

msa519780 .2{25_H36B} gcagataaag ttcgcgtagc caaaaaatca aaaatgactA AGGCgACATC 

rasa519780.2(25 JM9130013} gcagataaag ttcgcgtagc caaaaaatca aaaatgaatA AGGCaACATC 

1 — „^.—„,,l * ****_***** 

Consensus 



msa519780.2{25_COHl] 
msa519780.2(25_M78l] 
msa519780.2(25_M732] 
msa51 97 8 0 . 2 ( 2 5_1169NT 
msa519780 . 2 (25_18RS21] 
msa519780 . 2 {25_A909} 
msa519780.2(25__090} 
msa519780 . 2{25_CJB110 J 
msa519780. 2(2603} 
msa519780 . 2 {25_H36B} 
msa519780.2{25_JM9130013} 
Consensus 



201- 

TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 



GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGgTG 



TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC' 
TAAAACAGGC 



********** *******-** ********** 



TCCAAAACct 
TCCAAAACct 
TCCAAAACct 
TCCAAAACct 
TCCAAAACct 
TCCAAAACct 
TCCAAAACct 
TCCAAAACct 
TCCAAAACct 
TCCAAAACct 
TCCAAAAC . . 
********__ 



250 

tctcaggcat 
tctcaggcat 
tctcaggcat 
tctcaggcat 
tctcaggcat 
tctcaggcat 
tctcaggcat 
tctcaggcat 
tctcaggcat 
tctcaggcat 



msa5 1 9 7 8 0 . 2 { 2 5_COHl } 
msa519780.2(25_M78l] 
msa519780 . 2 (25_M732 ] 
msa519780.2(25_1169NT} 
msa519780.2(25_18RS2l] 
rasa519780.2(25_A909 
msa519780.2{25_090] 
msa519780.2(25_CJB110} 
tnsa519780. 2(2603} 
msa51978 0 . 2 (25_H36B> 
msa519780.2{25_JM9130013} 
Consensus 



251 

ctaatgaagc 
ctaatgaagc 
ctaatgaagc 
ctaatgaagt 
ctaatgaagc 
ctaatgaagc 
ctaatgaagc 
ctaatgaagc 
ctaatgaagc 
ctaatgaagc 



cccaaaatCA 
cccaaaatCA 
cccaaaatCA 
cccaaaatCA 
cccaaaatCA 
cccaaaatCA 
cccaaaatCA 
cccaaaatCA 
cccaaaatCA 
cccaaaatCA 
CA 



AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
********** 



CTACAGAAGC 
CTACAGAAGC 
CTACAGAAGC 
CTACAGAAGC 
CTACAGAAGC 
CTACAGAAGC 
CTACAGAAGC 
CTACAGAAGC 
CTACAGAAGC 
CTACAGAAGC 
CTACAGAAGC 
********** 



300 

TAATTCTCAG 
TAATTCTCAG 
TAATTCTCAG 
TAATTCTCAG 
TAATTCTCAG 
TAATTCTCAG 
TAATTCTCAG 
TAATTCTCAG 
TAATTCTCAG 
TAATTCTCAG 
TAATTCTCAG 
********** 



301 

msa519780.2(25_COHl} CAACAAGTTA CTGCGAGTGA AGAGGCgGCT 

.msa519780.2(25_M78l} CAACAAGTTA CTGCGAGTGA AGAGGCgGCT 

msa519780.2(25_M732) CAACAAGTTA CTGCGAGTGA AGAGGCgGCT 

msa519780.2(25_1169NT} CAACAAGTTA CTGCGAGTGA AGAGGCgGCT 

msa519780.2(25_18RS2l} CAACAAGTTA CTGCGAGTGA AGAGGCaGCT 

msa519780.2{25_A909} CAACAAGTTA CTGCGAGTGA AGAGGCaGCT 

Tnsa519780.2(25_090j CAACAAGTTA CTGCGAGTGA AGAGGCaGCT 

msa519780.2(25_CJB110} CAACAAGTTA CTGCGAGTGA AGAGGCaGCT 

msa5 19780. 2(2603} CAACAAGTTA CTGCGAGTGA AGAGGCaGCT 

msa519780.2(25__H36B) CAACAAGTTA CTGCGAGTGA AGAGGCaGCT 

msa51978 0.2(25_JM9130013) CAACAAGTTA CTGCGAGTGA AGAGGCaGCT 

Consensus ********** ********** ******_*** 



GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 



350 

CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 



********** ********** 



351 400 
msa519780.2{25 COHl} AGAAAAtACC CCTGCTACCA GTCAgGCACA ACAAaCTTAT GCTGTTACTG 
msa519780.2(25""M78l} AGAAAAtACC CCTGCTACCA GTCAgGCACA ACAAaCTTAT GCTGTTACTG 



920 
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msa519780.2{25_M732} 
msa519780 ,2{25_1169NT} 
msa519780 . 2 { 25_18RS2l} 
msa519780 . 2{25_A909} 
msa519780 .2{25_090> 
msa519780 . 2 (25_CJB110) 
msa519780. 2(2603} 
msa519780 . 2 {25_H36B} 
msa519780.2{25_JM9130013} 
Consensus 



msa519780.2(25_COHl} 
msa519780.2(25_M78l} 
msa519780 . 2 (25„M732 } 
msa519780.2(25_1169NT} 
msa519780,2(25_18RS2l} 
msa519780.2{25_A909} 
msa519780 .2{25_090} 
msa519780.2{25_CJB110} 
msa519780. 2(2603} 
msaS19780.2{25_H36B} 
msa519780.2{25_JM9130013} 
Consensus 



msa51978 0 . 2 { 25_COHl } 
msa519780.2{25_M78i; 
msa519780.2(25_M732 < 
ms a5 1 97 8 0 . 2 { 2 5_11 6 9NT ; 
msa519780.2(25_18RS2ll 
msa51 97 8 0 . 2 { 2 5_A909 j 
msa519780.2{25__090 
msa51978 0 . 2 { 25_CJB110 
msa519780. 2(2603, 
msa519780 . 2 (25_H36B j 
msa519780.2{25_JM9130013) 
Consensus 



msa51978 0 . 2 { 25_C0H1 } 
msa519780.2{25J478l} 
msa519780.2{25_M732| 
msa519780 . 2 { 25_1169NT} 
tnsa519780.2{25_18RS2l} 
msa519780.2{25_A909} 
msa5197 80 . 2 { 25_090 } 
msa519780.2{25_CJB110} 
msa519780. 2(2603} 
msa519780.2(25_H36B} 
msa519780.2(25_JM9130013} 
Consensus 



AGAAAAtACC 
AGAAAAtACC 
AGAAAACACC 
AGAAAAcACC 
AGAAAAcACC 
AGAAAAcACC 
AGAAAACACC 
AGAAAAcACC 
AGAAAAtACC 
******_*** 

401 

AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
********** 

451 

AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
* ********* 

501 

TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACg 



CCTGCTACCA 
CCTGCTACCA 
CCTGCTACCA 
CCTGCTACCA 
CCTGCTACCA 
CCTGCTACCA 
CCTGCTACCA 
CCTGCTACCA 
CCTGCTACCA 



GTCAgGCACA 
GTCAgGCACA 
GTCAgGCACA 
GTCAgGCACA 
GTCAgGCACA 
GTCAgGCACA 
GTCAgGCACA 
GTCAgGCACA 
GTCAaGCACA 



ACAAaCTTAT 
ACAAaCTTAT 
ACAAgCTTAT 
ACAAgCTTAT 
ACAAgCTTAT 
ACAAgCTTAT 
ACAAgCTTAT 
ACAAgCTTAT 
ACAAgCTTAT 



GCTGTTACTG 
GCTGTTACTG 
GCTGTTACTG 
GCTGTTACTG 
GCTGTTACTG 
GCTGTTACTG 
GCTGTTACTG 
GCTGTTACTG 
GCTGTTACTG 



********** ****_***** ****-***** ********** 



cAaACCTGCT 
cAaACCTGCT 
cAaACCTGCT 
cAaACCTGCT 
tAgACCTGCT 
tAgACCTGCT 
tAgACCTGCT 
tAgACCTGCT 
tAgACCTGCT 
tAgACCTGCT 
tAgACCTGCT 



CAACACCAGa 
CAACACCAGa 
CAACACCAGa 
CAACACCAGa 
CAACACCAGa 
CAACACCAGa 
CAACACCAGa 
CAACACCAGa 
CAACACCAGa 
CAACACCAGa 
CAACACCAGc 



CaAGTGGCCA 
CaAGTGGCCA 
CaAGTGGCCA 
CaAGTGGCCA 
CgAGTGGCCA 
CaAGTGGCCA 
CgAGTGGCCA 
CgAGTGGCCA 
CgAGTGGCCA 
CaAGTGGCCA 
CgAGTGGCCA 



_*_******* *********- *_******** 



CTG CAGGGGc 
CTGCAGGGGc 
CTGCAGGGGc 
CTGCAGGGGc 
CTGCAGGGGc 
CTGCAGGGGc 
CTGCAGGGGc 
CTGCAGGGGc 
CTGCAGGGGC 
CTGCAGGGGc 
CTGCAGGGGt 
*********_ 



ggTcGGaTCt 
ggTcGGaTCt 
ggTcGGaTCt 
ggTcGGaTCt 
taTtGGcTCa 
taTtGGcTCa 
taTtGGcTCa 
taTtGGcTCa 
taTtGGcTCa 
taTtGGcTCa 
taTtGGcTCa 



GCtGCtGCAG 
GCtGCtGCAG 
GCtGCtGCAG 
GCtGCtGCAG 
GCaGCtGCAG 
GCaGCtGCAG 
GCaGCtGCAG 
GCaGCtGCAG 
GCaGCtGCAG 
GCaGCtGCAG 
GCaGCaGCAG 



' 450 
AGTATTGAGc 
AGTATTGAGc 
AGTATTGAGc 
AGTATTGAGc 
AGTATTGAGt 
AGTATTGAG t 
AGTATTGAGt 
AGTATTGAGt 
AGTATTGAGt 
AGTATTGAGt 
AGTATTGAGc 
*********_ 

500 

CACAAATGGC 
CACAAATGGC 
CACAAATGGC 
CACAAATGGC 
CACAAATGGC 
CACAAATGGC 
CACAAATGGC 
CACAAATGGC 
CACAAATGGC 
CACAAATGGC 
CACAAATGGC 



_.„*_**_**_ **_**-**** ********** 



GGAGTcCCTC 
GGAGTcCCTC 
GGAGTcCCTC 
GGAGTcCCTC 
GGAGTcCCTC 
GGAGTcCCTC 
GGAGTcCCTC 
GGAGTcCCTC 
GGAGTcCCTC 
GGAGTcCCTC 
GGAGT t CCTC 



AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 



GGAACATATT 
GGAACATATT 
GGAACATATT 
GGAACATATT 
GGAACATATT 
GGAACATATT 
GGAACATATT 
GGAACATATT 
GGAACATATT 
GGAACATATT 
GGAACATATT 



550 

ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 



*********_ *****-**** 



********** ********** ********** 



msa519780.2(25_COHl} 
msa519780.2(25_M78i; 
msa519780 . 2 { 25_M732 ; 
msa519780.2(25_1169NT] 
msa519780.2(25_18RS2l' 
msa519780.2(25_A909; 
msaS 19780.2(2 5J390 } 
msa519780.2(25_CJB110} 
msa519780. 2(2603} 
msa519780 . 2 {25_H36B} 
msa519780.2(25_JM9130013} 
Consensus 



msa519780.2(25_COHll 
msa519780 . 2 ( 25_M781 
msa519780 . 2 (25J4732 
msa51978 0 . 2 { 25_1169NTi 
msa519780.2(25_18RS2l} 
msa519780 . 2 { 25_A909 
msa519780 . 2(25_090 
msa519780.2(25_CJB110 
11133519780.2(2603, 
msa519780.2(25_H36B) 
msa519780.2(25_JM9130013] 
Consensus 



551 

AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
********** 

601 

TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
********** 

651 



TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAc 
*********_ 



GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
********** 



CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 



600 

TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 



********** ********** 



TGCCAGGTTG 
TGCCAGGTTG 
TGCCAGGTTG 
TGCCAGGTTG 
TGCCAGGTTG 
TGCCAGGTTG 
TGCCAGGTTG 
.TGCCAGGTTG 
TGCCAGGTTG 
TGCCAGGTTG 
TGCCAGGTTG 



GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 



GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 



********** ********** ********** **-*- 



650 

AGgAtcaagt 
AGgAtcaagt 
AGgAtcaagt 
AGgAtcaagt 
AGgAtcaagt 
AGaAtcaagt 

AGgA 

AGgAtcaagt 
AGgAtcaagt 
AGgAtcaagt 
AGgAtcaagt 



700 



msa519780.2(25_COHl} taattcagct attaaagctt atcgtgctca aggtttatca gcttggggtt 
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msa519780.2{25_M78l} taattcagct attaaagctt atcgtgctca aggtttatca gcttggggtt 

msa519780.2{25_M732} taattcagct attaaagctt atcgtgctca aggtttatca gcttggggtt 

msa51978 0.2{25_1169NT} taattcagct attaaagctt atcgtgctca aggtttatca gcttggggtt 

msa5l9780.2{25_18RS2l} taattcagct attaaagctt atcgtgctca aggtttatca gcttggggtt 

msa519780.2{25_A909} taattcagct attaaagctt atcgtgctca aggtttatca 

msa519780 .2{25_09Q} 

msa519780.2{25_CJB110} taattcagct attaaagctt atcgtgctca aggtttatca gcttggggtt 

msa519780.2{2603} taattcagct attaaagctt atcgtgctca aggtttatca gcttggggtt 

msa519780.2{25_H3SBj taattcagct attaaagctt 

msa519780 . 2 {25_JM9130013 } taattcagct attaaagctt atcgtgctca aggtttatca gcttggggtt 

Consensus 



701 

msa519780.2{2 5_COHl} ac 

msa519780.2{25_M78l} ac 

msa519780.2{25_M732} a 

msa519780.2{25_1169NT} ac 

msa519780.2{25_18RS2l} ac 

msa519780.2{25_A909} 

msa519780.2{25_090) 

msa519780.2{25_CJB110} ac 

msa51978 0.2 {2603} actag 

msa519780 . 2 {25_H36B} 

msa519780.2{25_JM9130013} ac 

Consensus --*** 



SEQ ID NO. 5912 
STRAIN 2603 frame: 1 

MNKRRKLSKLNVKKHHLAYGAITLVALFSCI LAVMVI FKSSQVTTESLSKADKVRVAKKS 
KMTKATSKSKVEDVKQAPKPSQASNEAPKS S SQSTEANSQQQVTASEEAAVEQAWTENT 
PATSQAQQAYAVTETTYRPAQHCTSGQVLSNGNTAGAI GSAAAAQMAAATGVPQSTWEH I 
IARESNGNPNVANASGASGLFQTMPGWGSTATVQDQVNSAIKAYRAQGLSAWGY 

SEQ ID NO. 5913 
STRAIN 1169NT frame: 1 

KSSQVTTESLSKADKVRVAKKSK>rrKATSKSK^/EDVKQAPKPSQASNEVPKSSSQSTEAN 
SQQQVTASEFJVA.VEQAVVTENTPATSQA(^TYAVTETTYKPAQHQTSGQVLSNGNTAGAV 
GSAAAAQMAAATGVPQSTWEHI I ARE SNGNPNVANASGASGLFQTMPGWGSTATVQDQVN 
SAI KAYRAQGLSAWGY 

SEQ ID NO. 5914 
STRAIN 18RS21 frame: 1 

KS S QVTTE SL S KAD KVRVAKKS KMTKAT S KS KVED VKQAP KPS QASNEAP KS S S Q S TEAN 
S QQQVTAS EEAAVEQAWTENT PAT S QAQQAYAVTETT YR PAQHQTSGQVL SNGNTAGA I 
GSAAAAQMAAATGVPQSTWEHI I ARE SNGNPNVANASGASGLFQTMPGWGSTATVQDQVN 
SAI KAYRAQGLSAWGY 

SEQ ID NO. 5915 
STRAIN 2603 frame: 1 

KjSSQVTTESLSKADKVRVAKXSKMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEAN 
S QQQVTAS EEAAVEQAWTENT PAT S QAQQAYAVTETT YRPAQHQT SG QVL SNGNTAGA I 
GSAAAAQMAAATGVPQSTWEHI I ARE SNGNPNVANASGASGLFQTMPGWGSTATVQDQVN 
SAI KAYRAQGLSAWGY 

SEQ ID NO. 5916 

STRAIN 090 frame: 3 

AKKSKM I KATSKSKVEDVKQAPKPSQASNEAPKSSSQSTE1ANSQQQVTASEEAAVEQAVV 

TENTPATSCAQQAYAVTETTYRPAQHCTSGQVLSNGKT^^ 

WEHI I ARE SNGNPNVANASGASGLFQTMPGWGSTATVQ 

SEQ ID NO. 5917 

STRAIN A909 frame: 1 

KAT S KS KVED VKQAP KPS QASNEAP KS S SQSTEANS QQQVTAS E EAAVEQ AWTENT PAT 
S QAQQAYAVTETT YRPAQHQTSGQVLSNGNTAGA I GSAAAAQMAAATGVPQSTWEH 1 1 AR 
E SNGNPNVANASGASGLFQTMPGWGSTATVQNQVNS AI KAYRAQGLS 

SEQ ID NO. 5918 

STRAIN CJB 110 frame: 3 

SLSKADKWVAKKSKOTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEANSQQQWAS 
E EAAVE QAVVT ENTPAT S QAQQAYAVTETT YRPAQHQT SGQVL SNGNTAGA I G S AAAA.QM 
AAATGVPQSTWEHI IARESNGNPNVANASGASGLFQTMPGWGSTATVQDQVNSAI KAYRA 
QGLSAWGY 

SEQ ID NO. 5919 
STRAIN COH1 frame: 1 

KSSQV^ESLSKADKWVAKKSKMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEAN 
S QQQVTASEEAAVEQAWTENTPATS QACK2T YAVTETTYKPAQHQTSGQVLSNGNTAGAV 
GSAAAAQMAAATGVPQSTWEHI IARESNGNPNVANASGASGLFQTMPGWGSTATVQDQVN 
SAI KAYRAQGLSAWGY 

SEQ ID NO. 5920 
STRAIN H36B frame: 1 
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KSSQVTTESLSKADKVRVAKKSKMTIG^TSKSKVEDVKQAPKPSQASNEAPKSSSQSTEAN 
SQQQVTASEEAAVEQAVVTENTPATSQAQQAYAVTETTYRPAQHQTSGQVLSNGNTAGAI 
GSAAAAQMAAATGVPQSTWEH 1 1 ARE SNGNPNVANASGAS GLFQTMPGWG STATVQDQ VN 
SAIKA 



SEQ ID NO. 5921 
STRAIN M732 frame: 1 . 

KSSQVTTESLSKADKVRVAKKSKMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEAN 
SQQQVTAS EEAAVEQAVVTEOTPATS QAG^T YAWETT YKPAQHQT SGQVLSNGNTAGAV 
G S AAAAQMAAATG VPQSTWEH 1 1 ARE SNGNPNVANASGASGLFQTMPGWGSTATVQDQVN 
SAI KAYRAQGLSAWG 

SEQ ID NO. 5922 
STRAIN M781 frame: 4 

SliSKADKVRVAKKSKMTKATSKSKOTDVKQAPKPSQASNEAPKSSSQSTEANSQQQVTAS 
EEAAVE QA WTENT PAT S QAQQTYAVTETT Y KPAQHQT S G Q VL SNGNTAGAVG SAAAAQM 
AAATGVPQSTWEHI I ARESNGNPNVANASGASGLFQTMPGWGSTATVQDQVNSAI KAYRA 
QGLSAWGY 

SEQ ID NO. 5923 
STRAIN JM9130013 frame: 1 

KS SQVTTESIj s KADKVRVAKKSKMNKATSKS KVEGVKQAPKPS SQSTEANSQQQVTAS EE 
AAVEQAVVTENTPATSQAC^AYAWETTYRPAQHQPSGQVLSNGNTAGVIGSAAAAQMAA 
ATGVPQSTWEHI IARESNGNPNVANASGASGLFQTMPGWGSTATVQDQVNSAI kayraqg 
LSAWGY 

MSA Alignment Results: Pretty output 

PRETTY of : /biotmp/msa519418 . 2 { * } March 10, 2003 06:15 

1 50 

msa519418 . 2 { 25_090 } 

msa519418.2{25_H36B} KS SQVTTESLSK 

msa519418.2?25_COHl} KS SQVTTESLSK 

msa519418.2{25_M781) SLSK 

msa519418.2{25_1169NT} KS SQVTTESLSK 

msa519418.2{25_M732} KS SQVTTESLSK 

msa519418.2{25_18RS2l} KS SQVTTESLSK 

msa519418.2{25_CJB110} SLSK 

msa519418.2{25 2603} KS SQVTTESLSK 

msa519418 .2T2603} mnkrrklskl nvkkhhlayg aitlvalfsc ilavmvifKS SQVTTESLSK 

msa519418.2{25_A909} 

msa519418.2{25 JM9130013} KS SQVTTESLSK 

~~ consensus ********** ********** ********** ********** ********** 

51 100 

msa519418.2{25_090} akks kmiKATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418.2{25_H36B} ADKVRVakks kmtKATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418.2{25_COHl} ADKVRVakks kmtKATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418.2{25_M78l} ADKVRVakks kmtKATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418.2{25_1169NT} ADKVRVakks kmtKATSKSK VEdVKQAPKP sqasnevpks SSQSTEANSQ 

msa519418.2{25_M732} ADKVRVakks kmtKATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418.2{25_18RS2l} ADKVRVakks kmtKATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418 . 2{25_CJB110} ADKVRVakks kmtKATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418.2{25 2603} ADKVRVakks kmtKATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418. 2^2603} ADKVRVakks kmtKATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418 .2{25_A909> KATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418.2{25 JM9130013) ADKVRVakks kmnKATSKSK VEgVKQAPKP SSQSTEANSQ 

Consensus ******- ******* **_******* ********** 



msa519418.2{25_090} 
msa519418.2{25_H36B} 
msa5 1941 8 .2(2 5JZOH1 } 
msa519418.2{25_M78l} 
msa519418.2{25_1169NT} 
msa519418 .2{25_M732} 
msa519418 . 2 ( 25_18RS21 j 
msa519418 . 2{25_CJB110] 
msa519418.2{25 2603 
msa519418.2T2603; 
msa5194l8.2{25_A909] 
msa519418.2{25_JM9130013} 
Consensus 



msa519418 
msa519418 
msa519418 . 
msa519418 . 
msa519418 .2{ 

msa519418 
msa519418.2 
msa519418.2 



2{25_090j 
2{25_H36BJ 
2{25_COHl> 
2{25_M781J 
25_1169NT> 
2{25_M732J 
25_18RS21) 
25 CJB110} 



101 

QQVTASEEAA 
QQVTASEEAA 
QQVTASEEAA 
QQVTASEEAA 
QQVTASEEAA 
QQVTASEEAA 
QQVTASEEAA 
QQVTASEEAA 
QQVTASEEAA 
QQVTASEEAA 
QQVTASEEAA 
QQVTASEEAA 



VEQAWTENT 
VEQAWTENT 
VEQAWTENT 
VEQAWTENT 
VEQAWTENT 
VEQAWTENT 
VEQAWTENT 
VEQAWTENT 
VEQAWTENT 
VEQAWTENT 
VEQAWTENT 
VEQAWTENT 



PATSQAQQaY 
PATSQAQQaY 
PATSQAQQtY 
PATSQAQQtY 
PATSQAQQtY 
PATSQAQQtY 
PATSQAQQaY 
PATSQAQQaY 
PATSQAQQaY 
PATSQAQQaY 
PATSQAQQaY 
PATSQAQQaY 



********** ********** ********_* 



151 

NGNTAGaiGS 
NGNTAGaiGS 
NGNTAGavGS 
NGNTAGavGS 
NGNTAGavGS 
NGNTAGavGS 
NGNTAGaiGS 
NGNTAGaiGS 



AAAAQMAAAT 
AAAAQMAAAT 
AAAAQMAAAT 
AAAAQMAAAT 
AAAAQMAAAT 
AAAAQMAAAT 
AAAAQMAAAT 
AAAAQMAAAT 



GVPQSTWEHI 
GVPQSTWEHI 
GVPQSTWEHI 
GVPQSTWEHI 
GVPQSTWEHI 
GVPQSTWEHI 
GVPQSTWEHI 
GVPQSTWEHI 



AVTETTYrPA 
AVTETTYr PA 
AVTETTYkPA 
AVTETTYkPA 
AVTETTYkPA 
AVTETTYkPA 
AVTETTYrPA 
AVTETTYrPA 
AVTETTYrPA 
AVTETTYrPA 
AVTETTYrPA 
AVTETTYrPA 
*******_** 



IARESNGNPN 
IARESNGNPN 
IARESNGNPN 
IARESNGNPN 
IARESNGNPN 
IARESNGNPN 
IARESNGNPN 
IARESNGNPN 



150 

QHQtSGQVLS 
QHQtSGQVLS 
QHQtSGQVLS 
QHQtSGQVLS 
QHQtSGQVLS 
QHQtSGQVLS 
QHQtSGQVLS 
QHQtSGQVLS 
QHQtSGQVLS 
QHQtSGQVLS 
QHQtSGQVLS 
QHQpSGQVLS 
***_****** 

200 

VANASGASGL 
VANASGASGL 
VANASGASGL 
VANASGASGL 
VANASGASGL 
VANASGASGL 
VANASGASGL 
VANASGASGL 
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msa519418 .2(25 2603} 
msa519418. 2^2603} 
msa519418 . 2 {25_A909> 
rasa519418 . 2 {25_JM9130013 } 
Consensus 



NGNTAGaiGS AAAAQMAAAT GVPQSTWEHI IARESNGNPN VANASGASGL 

NGNTAGaiGS AAAAQMAAAT GVPQSTWEHI IARESNGNPN VANASGASGL 

NGNTAGaiGS AAAAQMAAAT GVPQSTWEHI IARESNGNPN VANASGASGL 

NGNTAGviGS AAAAQMAAAT GVPQSTWEHI IARESNGNPN VANASGASGL 

******__** ********** ********** ********** ********** 



msa519418 ,2{25_090} 
msa519418 . 2 {25_H36B} 
msa519418 .2{25_COHl} 
msa519418 .2{25_M78l} 
msa519418.2{25_1169NT} 
msa519418 . 2 {25_M732} 
msa519418.2{25_18RS2l) 
msa519418.2{25_CMB110} 
msa519418.2{25 2603} 
msa519418.2T2603} 
msa519418.2{25_A909} 
msa519418 . 2 {25_JM9130013 } 
Consensus 



201 

FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 



234 



ATVQ 

ATVQDQVNSA 
ATVQDQVNSA 
ATVQDQVNSA 
ATVQDQVNSA 
ATVQDQVNSA 
ATVQDQVNSA 
ATVQDQVNSA 
ATVQDQVNSA 
ATVQDQVNSA 
ATVQnQVNSA 
ATVQDQVNSA 



IKA 

IKAYRAQGLS 
IKAYRAQGLS 
IKAYRAQGLS 
IKAYRAQGLS 
IKAYRAQGLS 
IKAYRAQGLS 
IKAYRAQGLS 
IKAYRAQGLS 
IKAYRAQGLS 
IKAYRAQGLS 



********** ********** ********** 



AWGY 
AWGY 
AWGY 
AWG- 
AWGY 
AWGY 
AWGY 
AWGY 

AWGY 
**** 
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SBQ ID NO. 6001 
STRAIN 2603 

ATGAAAGAAAAACAGTCGAAAAG^ 

ATAAGTGTTT TTACATACAGT ATT AG C CAG C CTT CT AAACTACTTCCACGAAAAG AATT A 
GTTATTCTAAGT CCAAATAGTCAAGCCATTTT AACAG GAACG ATT CCAG CTTTTG AGGAA 
AAATACGGTATAAAAGTTAAGCTTATTCAAGGTGGGACAGGGCAACTAATAGATAGATTA 
AGTAAGGAGGGTAAG CAGTTGAAGGCGGATATTTTCTTTG GAGG AAATTATACG CAATTT 
GAAAGTCAT AAGGCATTGTTTGAGT CTT ACGTAT CAAAGAATGTT CATACTGTTATT C CA 
GACTATATCCATCCAAGTGATACGGCGACAC<^^ 

GTAAATAACGAATTAG CTAAGGGAC TTAC CAT CAAGAGTTATGAAGATTTATTACAGCCT 
TCCTTAAAAGGTAAAATTGCCTTTGCAGATCCGAA 

CTCACTAATATACTCTTGGCCAAGGGTGGTTACACCAATCCAAAAGCGTGGAACTATGTT 
AAAAAG CTACAACATAATATT AATGCT AT CAAAT CTTCT AGCT CTT CAGAAGTTT AT CAA 
TCAGTTGCAGAAGGAAAAATGATTGTGGGGCTGACTTACGAAGA 

CAAAAAAGTGGTGC CAATGTTT CT ATT GT ATAT C CGACAGAAGGG ACAGTTTTTGTCC CA 
TCTT CGGTTGCAATTATAAAGAATG CT CCTTCTATGAAAGAAG CAAAGTTATTTATT AAT 
TTTATGCTTTCTTTAGATGTTCAAAATGCCT^ 

CGTAAAGATG CCCAAACGAGTAATGG CATGAAAG CTTTAAAGGATATTGCTACT CTTAAA 
GAAGATTAT CGCTATGT CACTAAG CATAAGGG C CAAAT C CTTAAAAC CTATAAT CGTATT 
CGTAGAAATG CTGAT 

SSQ ID NO. 6002 
STRAIN 090 

CAGCCTTCTAAACTACTTCCACCAAAAGAATTAGTTATTCTAAGT 
CCAAATAGTCAAGCCATTTTAACAGGAACGATTCCAGCT^ 
ATACGGTATAAAAGTTAAG CTT ATT CAAGGTGGG ACAGGG CAACTAATAG 
ATAGATTAAGTAAGGAGGGTAAGCAGTTGAAGGCGGATATTTTCTTTGGA 
GGAAATTATA03CAATTT\GAAAGTCATAAGG CAT^ 

AT CAAAGAAT GTT CATACTGTTATT CCAGACTATATCCATCCAAGTGATA 

CGGCC4ACACCTTATACTATAAATGGGAGTGTCTC 

TTAGCTAAGGGACTTACCATCAAGAGTTATGAAGATTTATTACAGC 

CTTAAAAC^TAAAATTGCCTTTGCAGATCCGAATACTTCOT 

TCTCACAACTCA.CTAATATAOTCTTGGCCAAG 

AAAG CGTGGAACTATGTT AAAAAG CTACAACATAATATT AATGCT AT CAA 

ATCTT CTAG CTCTTCAGAAGTTTAT CAATCAGTTGCAGAAGGAAAAATGA 

TTGTGGGGCTGACTTACGAAGACCCTAGTGTCAATTTGC^^ 

GCCAATGTTTCTATTGTATATCCGACAGAAGGGACAGTTTTTG 

TT CGGTTGCAATTATAAAGAATGCTC CTTCTATGAAAGAAG CAAAGTTAT 

TTATTAATTTTATGCTT t CTTTAgATGTTCAAAATGC CTTTGGG CAGT CA 

ACGAGTAACCGACCTATT CGTAAAGATGC C CAAACGAGTAATGG CATGAA 

AG CTTTAAAGGATATTGCTACT CTTAAAG AAGATT ATCGCTATGT CACT A 

AG CATAAGGGCCAAATC CTTAAAACCTATAAT CGTATTCGTAGAAATG CT 

GAT 

SEQ ID NO. 6003 
STRAIN A909 

CAGCCTTCTAAACTACTTCCACCAAAAGAATTAG 

TTATTCTAAGTCCAAATAGTCAAGCCATTTTAACAGGAACGATTCCAGCT 
TTTGAGGAAAAATACGGTATAAAAGTTAAGCTTATTCAAGGTGGGACAGG 
TCAACTAATAGATAGATTAAGTAAGGAGGGTAAG CAGTTGAAGG CGGATA 
TTTT CTTTGGAGGAAATTATACGCAATTTGAAAGTCATAAGG CATTGTTT 
GAGTCTTACGTATCAAAGAATATT CATACTGTTATTC CAGATTATATCCA 
TCCGAGTGATACGGCGACACCTTATACTATAAATGGGAGTGTCTTGATTG 
TAAATAACGAATTAGCTAAGGGACTT^ 

TT A(^G CCTTCCTT AAAAGGT AAAATT GCCTTTGCAGAT C CGAATACTT C 
CTCTAGTGCTTTCTCACAACTCACT 

ACAC CAATC CAAAAGCGTGGAACTATGTTAAAAAG CTACAACATAATATT 
AATGCTATCAAATCTTCTAGCTCTTC^GAAGT^ 

AGGAAAAATGATTGTGGGGTTGACTTACGAAGACCCTAGTGTCAATTTGC 
AAAAAAGTGGTG CCAATGTTTCTATTGTAT AT CCG ACAG AAGGGACAGTT 
TTTGTCC CATCTT CGGTTG CA^TTATAAAGAATG CTC CTTCT ATGAAAGA 
AG CAAAGTT ATTTATT AATTTT ATGCTTT CT7ITA CAAAATG C CT 

TTGGGCAGTCAACGAGTAACCGACCTATTCGTAAAGATGCCCAAACGAGT 
ARTGGCATGAAAGCTTTAAAGGATATTGCTACT 

CTATGTCACTAAGCATAAGGG CCAAATCCTTAAAACCTATAAT CGTATTC 
GTAGAAATG CTGAT 

SEQ ID NO. 6004 

STRAIN H36B 

TAAACTACTT CCAC CAAAAGAATTAGTTATT CTAAGT CCAAATAGTCAAG 
CCATTTTAACAGGAACG ATT CCAG CTTTTGAGGAAAAATACGGT ATAAAA 
GTTAAGCTTATTCAAGGTGGGACAGGTCAACTAATAGATAGATTAAGTAA 
GGAGGGTAAG CAGTTGAAGG CGC4ATATTTT CTTTGGAGGAAATTATACG C 
AATTTGAAAGTCAT AAGGCATTGTTTG AGT CTT ACGTAT CAAAGAATATT 
CATACTGTTATTC CAGATTATATCCAT C CGAGTGATACGG CGACACCTTA 
TACTATAAATGGGAGTGTCTTGATTGTAAATAAcGAATTAGTTAAGGGAC 
TTAC CAT CAAGAGTTATGAAGATTTATTACAG CCTTCCTTAAAAGGTAAA 
ATTG CCTTTGCAGATCCGAATACTTCCTC TAGTG CTTT CT CACAACT CAC 
TAATATACT CTTGG C CAAGGGTGGTTACAC CAAT CCAAAAGCGTGGAACT 
ATGTTAAAAAG CTACAACAT AATATTAATG CTAT CAAAT CTTCTAG CTCT 
TCAGAAGTTT AT CAAT CAGTTGCAGAAGGAAAAATGATTGTGGGGTTGAC 
TTACGAAGACCCTAGTGTCAATTTGCAAAAAAGTGGTGCC^ 
TTGT ATAT CCGACAG AAGGG ACAGTTTTTGT C CCAT CTT CGGTTG CAATT 
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ATAAAGAATG CT C CTT CT ATGAAAG AAG CAAAGTTATTT ATT AATTTT AT 
GCTTTCTTTAGATGTTGAAAATGCCTTTGGGC^ 

CTATT CGTAAAGATG C C CAAACGAGTAATGG CATG AAAG CTTTAAAG GAT 
ATTG CTACT CTT AAAGAAG ATTAT C G CTATGTCACTAAG CAT AAG GG CCA 
AATC CTTAAAACCT AT AAT CGTATT CGTAG AAATG CTGAT 

SEQ ID NO. 6005 
STRAIN 18RS21 

CAGCCTT CT AAACT ACTT C CAC CAAAAGAATTAGT TATT CTAAGT CCAAA 
TAGTCAAGCCATTTTAACAGGAACGATTCCAG CTTTTGAGGAAAAATACG 
GTATAAAAGTTAAG CTT ATT CAAGGTGGGACAGGG CAACTAATAG ATAG A 
TTAAGTAAGGAGGGTAAGCAGTTGAAGGCGgATATTTTCTTTGGAGGAAA 
TTATACG CAATTTGAAAGT CAT AAGG CATTGTTTGAGT CTTACGTAT CAA 
AGAATGTTCATACTGTTATTCCAGACTATATCCATCCAAGTGATACGGCG 
ACACCTTATACTATAAATGGGAGTGTCTTGATTGTAAATAACGAATTAGC 
TAAGGGACTTACCAT CAAG AGTTATGAAGATTTATTACAG CCTT C CTTAA 
AAGGTAAAATTGCCTTTGCAGATCCGAATACTTCCTCTAGTG 
CAACT CACT AATATACT CTTGG CCAAGGGTGGTTACAC CAAT CCAAAAG C 
GTGGAACTATGTTAAAAAG CTACAACATAATATTAATG CTAT CAAAT CTT 
CTAG CTCTT CAGAAGTTTAT CAATCAGTTGCAGAAGGAAAAATG ATTGTG 
GGGCTGACTTACGAAGACCCT'AGTGTCAATTTGCAAAAA 
TGTTTCTATTGTATATCCGACAGAAGGGACAGTTTTTGT C CCAT CTT CGG 
TTGCAATTATAAAGAATGCT C CTTCTATGAAAGAAGCAAAGTTATTTATT 
AATTTTATG CTTTCTTTAGATGTTCAAAATGC CTTTGGG CAGT CAACGAG 
TAACCGACCTATTCGTAAAGATGCC CAAACGAGTAATGG CATGAAAG CTT 
TAAAGGATATTG CTACT CTTAAAGAAGATTAT CG CTATGTCACTAAGCAT 
AAGGG CCAAATC CTTAAAAC CT AT AAT CGTATT CGTAGAAATG CTGAT 

SEQ ID NO. 6006 
STRAIN M732 

CAGCCTTCTAAACTACTTCCACCAAAAGAATTAGT 

TATTCTAAGTCCAAATAGT{^\GCCATTTTAACAGGAACGATTCCAGCTT 
TTGAGGAAAAATACGGTATAAAAGTTAAGCTTATT CAAGGTG GGACAGGG 
CAACTAATAGAT AGATT AAGTAAGG AGGGTAAG CAGTTG AAGG CG GATAT 
TTTCTTTGGAGGAAATTATACGCAATTTGAAAGTCA 

AGTCTTACGTAT CAAAGAATGTTCATACTGTT ATT CCAG ACTAT AT CCAT 
CCGAGTGATACGGCGACACCTTATACTATAAATGGGAGTGTCTTGATTGT 
AAATAACGAATTAGCTAAGGGACTTACCATCAAGAGTTATGAAGATTTAT 
TACAGCCTTCCTTAAAAGGTAAAATTGCCIUU'GCAGATCCGAATACTTCC 
TCTAGTGCTTTCTC^CAACTCACTAATATACT 

CACCAATCCAAAAGCGTGGAACTATGTTAAAAAGCTACAACATAATATTA 

ATGCTATCAAAT CTT CTAG CT CTTCAGAAGTTTAT CAAT CAGTTG CAGAA 

GGAAAAATGATTGTGGGGTTGACTTACGAAGACCCTAGTGTCAATTTGC^ 

AAAAAGTGGTGCCAATGTTTCTATTGTATACCCGAC^GAAGGGA 

TTGT CCCAT CTT CGGTTG CAATTATAAAGAATGCTCCTT CTATGAAAGAA 

GCAAAGTTATTTATTAAri T TATGCTT^ 

TGGG CAGTG?^CGAGTAACCGACCTATT CGTAAAGATG CC CAAACAAGTA 

ATG^SCATGAAAGCTTTAAAGGATATCG CTACT CTTAA^ 

TATGT CACTAAG CATAAGAG CCAAATC CTTAAAACCTATAAT CG CATTCG 

TAGAAATGCTGAT 

SEQ ID NO. 6007 
STRAIN COH1 

CAGCCIT CTAAACTACTT C CACCAAAAGAATTAGTT 
ATTCTAAGTCCAAATAGTCAAGCCATTTTAACAGGAACGAT^ 
TGAGGAAAAATACGGTATAAAAGTTAAGCTTATT CAAGGTGGGACAGGG C 
AACTAATAGATAGATTAAGTAAGGAGGGTAAGCAGTTGAAGGCGGATATT 
TTCITTGG^GGAAATTATACG CAATTTGAAAGT CATAAGG CATTGTTTG A 
GTCTT'ACGTATCAAAGAATGTT CATACTGTTATT CCAGACTATAT CCAT C 
CGAGTGATACGG CGACACCTTATACTATAAATGGGAGTGT CTTGATTGT A 
AATAACGAATTAG CTAAGGGACTTACCAT CAAGAGTTATG AAGATTTATT 
ACAG C CTTCCTT AAAAGGTAAAATTG CCTTTG CAGAT CCGAATACTT CCT 
CTAGTGCTTTCTCACAACTGA.CTAATATACT 

AC CAATCCAAAAG CGTGGAACT ATGTTAAAAAGCTACAACATAAT ATTAA 
TG CTATCAAATCTT CTAGCTCTTGAGAAGTTTATCAAT CAGTTG CAG AAG 
GAAAAATGATTGTGGG^TTG ACTTACGAAG AC CCTAGTGT CAATTTG CAA 
AAAAGTGGTG CC!AATGTTTCTATTGTATAC CCGACAGAAGGGACAGTTTT 
TGTC CCATCTTCGGTTG CAATTATAAAGAATGCT CCTT CTATGAAAG AAG 
CAAAGTTATTTATTAATTTTATGCTTT(^^ CAAAATG CCTTT 

GGGCAGTCAACGAGTAACCGACCTATTCGTAAAGATGCCCAAACAAGTAA 
TGG CATGAAAGCITTAAAGGATAT CGCTACT CTTAAAGAAGATTATCGCT 
ATGTCACTAAGCATAAGAGCCAAATCCTT'AAAACCrATAATCGCATTCGT 
AGAAATG CTGAT 

SEQ ID NO. 6008 
STRAIN M781 

CAGCCTTCTAAACTACTTCCACCAAAAGAATTAGTTATT 

CTAAGTC CAAATAGTCAAG CCATTTTAACAGGAACGAT^ C CAGCTTTTG A 

GGAAAAATACGGTAT AAAAGTTAAG CTTATTCAAGGTGGGACAGGGCAAC 

TAATAGATAG ATTAAGTAAGGAGGGTAAG CAGTTGAAGG CGGATATTTT C 

TTTGGAGGAAATTATACG CAATTTGAAAGTCATAAGG CATTGTTTGAGT C 

TTACGTATOUyVGAATGTTCATACTGTTATTCCAGACTATATCCATCCGA 

GTGATACGG CGACAC CTTATACTATAAATGC<1AGTGT (CTTGATTGTAAAT 
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AA.CGAATTAG CT AAGGG ACTTACCAT CAAG AGTT ATG AAGATTTATTACA 
G C CTT CCTTAAAA.G GTAA^A>TTG C CTTTG CAG AT C CG AATACTT C CT CTA 
GTGCTTTCTCACAAOTCACTAATATACTCTTGGCCAAGGGTGGTTACACC 
AAT CCAAAAG CGTGG AACTATGTT AAAAAG CTACAACATAAT ATT AATG C 
TAT CAAAT CTTCTAGCT CTT CAGAAGTTT ATCAAT CAGTTG CAG AAGGAA 
AAATGATTGTGGGGTTGACTTACGAAGACCCTAGTGTCAATTTGCAAAAA 
AGTGGTGCCAATGTTTCTATTGTATACCCGACAGAAGGGA<^GTTTTTGT 
CC CAT CT TCGGTTG CAATTATAAAG AATG CT CCTT CT AT GAAAG AAGCAA 
AGTTATTTATTAATTTTATG CT TT CTTTAGATGTT CAAAATG C CTTTGGG 
CAGTCAACGAGTAACCGACCTATTCGTAAAGATGCCCAAACAAGTAATGG 
CATGAAAG CTTTAAAGGATATCGCT ACTCTTAAAGAAGATTAT CG CT ATG 
TCACTAAGCATAAGAGCCAAATCCTTAAAACCTATAATCGCATTCGTAGA 
AATGCTGAT 

SEQ ID NO. 6009 
STRAIN CJB110 

CAG C CTTTTAAACTACTT C CACCAAAAGAATTAGTTATT CT 
AAGT C CAAATAGT CAAG C CATTTT AACAGGAACGATT C CAGCTTTTGAGg 
AAAAATACGGTATAAAAGTTAAG CTTATT CAAGGTGG GACAGGG CAACT A 
AT AG ATAGATTAAGT AAGGAGGGT AAG CAGTTGAAGG CGGATATTTT CTT 
TGGAGG^yUVTTATACGCAATTTGAAAGTCATAAGGCATTGTTTGAGTCT 
ACGTATCAAAGAATGTTCAT ACTGTTATT CCAGACTATAT CCAT C CAAGT 
GATACGG CGACACCTTATACTATAAATGGGAGTGT CITGATTGT AAATAA 
CG AATTAG CTAAGGGACTTACCAT CAAGAGTTATGAAGATTTATTACAG C 
CTTC CTTAAAAGGTAAAATTG C CTTTG CAGATC CGAATACTT CCT CTAGT 
G CTTT CT CACAACT CACTAATATACTCTTGG CCAAGGGTGGTTA CAA 
TCCAAAAG CGTGGAACT ATGTTAAAAAG CTACAACATAAT ATTAATG CTA 
TCAAATCTTCTAGCTCTTCAGAAGTTTATG^ 

ATGATTGTGGGG CTGACTTACGAAGAC CCT AGTGTCAATTTGCAAAAAAG 
TGGTGCCAATGTTT CTATTGTATAT CCGACAGAAGGGACAGTTTTTGT C C 
CATCTTCGGTTG CAATTAT AAAGAATG CT C CTTCTATGAAAG AAG CAAAG 
TTATTTATTAATTTTATGCTTTCTTTAGATGTT CAAAATGCCTTTGGGCA 
GT CAACGAGT AACCGAC CTATTCGTAAAGATG C CCAAACGAGTAATGGCA 
TGAAAGCTTTT AAAGG AT ATTGCTACT CTTAAAGAAGATTAT CG CTATGT C 
ACTAAGCAT AAGGG CCAAAT CCTTAAAAC CTATAATCGTATTCGTAG A^A 
TGCTGAT 

SEQ ID NO. 6010 
STRAIN 1169NT 

ATAGTCAAG CCATTTTAACAGG AACGATTC CAG CTTTTG AGG AAAAAT AC 
GGTATAAAAGTTAAGCTTATTCAAGGTGGGACAGGGCAACTAATAGATAG 
ATT AAGT AAGGAGGGTAAG CATTTG AAGGCGGATATTTT CT t TGGAGGAA 
ATTATACGCAATTTGAAAGT CATAAGG CATTGTTT GAGTCTTACGTATCA 
AAGAATGTTCATACTGTTATTC G^GACTAT AT CCATC CAAGTGATACGG C 
GACACCTTATACTATAAATGGGAGTGTCTTGATTGTAAATAACGAATTAG 
CTAAGGGACTTACCATCAAGAGTTATGAAGATTTATT^ 
AAAGGTAAAATTGC CTTTGCA.GATC CGAATACTTC CTCT 
ACAACT CACCAATATACTCTTGGCAAAGGGTGGTTACAC GAAT CCAAAAG 
CGTGGAACT ATGTTAAAAAG CTACAACATAATATTAATG CTATCAAAT CT 
T CTAG CT CTT CAGAAGTTTATCAAT CAGTTG CAGAAGGAAAAATGATTGT 
GGGGTTGACTTACGAAGAC C CT AGTGT CAATT t GCAAAAAAGTGGTG CCA 
ATGTTTCTATTGTATAT C CG ACAGAAGGG ACAGTTTTTGT CCCAT CTT CG 
GTTGCAATTATAAAGAATGCTCCTTCTATGAAAGAAGCAAAGTTAT^ 
TAATTTT ATG CITTCTTTAGATGTTCAAAATG C CTTTGGG CAGTCAACGA 
GTAACCGAC CTATTCGTAAAGATG C CCAAACGAGTAATGG CATGAAAG CT 
TT AAAGGATATTGCTACT CTTAAAG AAGATTATCG CTATGTCACTAAG CA 
TAAGGGCCAAATCCTTAAAACCTATAATCGTATTCGTAGAAATGCTGAT 

SEQ ID NO. 6011 
STRAIN JM9 1130013 

CAGC CTT CT AAACTACTT C CAC CAAAAGAATTAGT 

TATTCTAAGTCCAAATAGTCAAGCCATTTTAACAGGAAC^ 

TTGAGGAAAAATACGGTATAAAAGTTAAGCTTATTCAAGGTGGGACAGGG 

CAACT AATAGATAGATTAAGTAAGGAGGGTAAGCAGTTGAAGGCGGATGT 

TTTCTTTGG AGGAAA.TTATACG CAATTTG AAAGTCATAAGG CATTGTTTG 

AGTCITACGTATCAAAGAATGTTCATACTGTTATTCCAGA 

CCGAGTGATACGGCGACACCTTATACTATAAATGGGAGTGTCTTGATTGT 

AAAT AACGAATTAG CTAAGGGACTTAC CAT CAAGAGTTATGAAG ATTTAT 

TACAGCCTTCCTTAAAAGGTAAAATTGCCTTTGCAGATCCGAAT 

T CTAGTG CTTT CTCACAACT CAC CAATAT ACT CTTGG CAAAGGGTGGTTA 

CACCAAT C CAAAAGCGTGG AACTATGTTAAAAAG CTACAACATAATATTA 

ATGCTAT CAAAT CTT CTAG CTCTT CAG AAGTTTAT CAAT CAGTTG CAGAA 

GGCAAAATGATTGTGGGGCTGACTTACGAAGACCC^ 

AAAAAGTGGTGCCAATGTTTCTATTGTGTATCCGACAGAAGGGACAGTTT 

TTGTCCCATCTTCGGTTGCAATTATAAAC4AATGCTCCTTCT 

GCAAAGTTATTTATTAATTTTATGCTTTCTTTAGA 

TGGGCAGTCAACGAGTAACCGACCTATTCGTAAAGATGCCCAAACGAGTA 
ATGG CAT GAAAG CTTTAAAGGATATTG CTACTCTTAAAG AAGATTAT CG C 
TATGT CACTAAG CATAAGGG C CAAATCCTT AAAAC CTAT AAT CGTATTCG 
TAGAAATG CTGAT 

PRETTY of: /biotmp/msa523010 .2{*} April 28, 2003 08:55 .. 
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1 50 

msa523 010 .2{263_COHl} 

msa523010 .2 {263JVT732} ~ 

msa523010 .2{263_M781} 

msa523010 .2{263_A909) — . 

msa523010 .2{263_H36B} 

msa523010 .2{263_090} 

msa523010 .2{263_18RS2l} — 

msa523010 .2{263_2603} atgaaagaaa aacagtcgaa aaggcttatt tatatactac tggttgtttc 

msa523010 .2{263_CJB110} 

msa52301.0.2{2S3_1169NT} 

msa523010.2{263_JM91130013} 

Consensus ********** ********** ********** ********** ********** 

51 100 

msa523010 . 2 { 263_COHl} cag ccttctaaac 

msa523010 .2{263_M732} — cag ccttctaaac 

msa523010 . 2 {263_M78l} cag ccttctaaac 

msa523010 .2{263_A909} cag ccttctaaac 

msa523010.2{263_H36B} taaac 

msa523010.2{263_090} . cag ccttctaaac 

msa523010.2{263_18RS2l} cag ccttctaaac 

msa523010 ,2{263_2603} cattattttt ataagtgttt ttacatacag tattagccag ccttctaaac 

msa523010 .2 f 263_CJB110} cag ccttttaaac 

msa523010.2 {263_1169NT} 

msa523010 . 2 { 263_JM91130013} cag ccttctaaac 

Consensus ********** ********** ********** ******* _ 



msa523010 .2{263_C0H1} 
msa523010 .2{263_M732) 
msa523010.2{263_M781> 
msa523010.2f263_A909} 
msa523010.2{263_H3 6B} 
msa523010.2{263_090} 
msa523010.2{263_18RS2l] 
msa523010.2{263_2603 
msa523010.2{263_CJB110] 
msa523010.2{263_1169NT] 
msa523010.2{263_JM91130013] 
Consensus 



101 

tacttccacc 
tacttccacc 
tacttccacc 
tacttccacc 
tacttccacc 
tacttccacc 
tacttccacc 
tacttccacc 
tacttccacc 



aaaagaatta 
aaaagaatta 
aaaagaatta 
aaaagaatta 
aaaagaatta 
aaaagaatta 
aaaagaatta 
aaaagaatta 
aaaagaatta 



gttattctaa 
gttattctaa 
gttattctaa 
gttattctaa 
gttattctaa 
gttattctaa 
gttattctaa 
gttattctaa 
gttattctaa 



tacttccacc aaaagaatta gttattctaa 



gtccaaATAG 
gtccaaATAG 
gtccaaATAG 
gtccaaATAG 
gtccaaATAG 
gtccaaATAG 
gtccaaATAG 
gtccaaATAG 
gtccaaATAG 

ATAG 

gtccaaATAG 



150 

TCAAGCCATT 
TCAAGCCATT 
TCAAGCCATT 
TCAAGCCATT 
TCAAGCCATT 
TCAAGCCATT 
TCAAGCCATT 
TCAAGCCATT 
TCAAGCCATT 
TCAAGCCATT 
TCAAGCCATT 



..**** ********** 



151 200 

msa523010.2{263_COHl} TTAACAGGAA CGATTCCAGC TTTTGAGGAA AAATACGGTA TAAAAGTTAA 

msa523010.2{263_M732} TTAACAGGAA CGATTCCAGC TTTTGAGGAA AAATACGGTA TAAAAGTTAA 

msa52 3 0 1 0 . 2 { 2 63_M7 8 1 } TTAACAGGAA CGATTCCAGC TTTTGAGGAA AAATACGGTA TAAAAGTTAA 

msa523010 .2(263_A909> TTAACAGGAA CGATTCCAGC TTTTGAGGAA AAATACGGTA TAAAAGTTAA 

msa523010 -2{263_H36B) TTAACAGGAA CGATTCCAGC TTTTGAGGAA AAATACGGTA TAAAAGTTAA 

msa52301 0.2 {263^090} TTAACAGGAA CGATTCCAGC TTTTGAGGAA AAATACGGTA TAAAAGTTAA 

msa523010.2{263_18RS2l} TTAACAGGAA CGATTCCAGC TTTTGAGGAA AAATACGGTA TAAAAGTTAA 

msa523010 .2{263__2603 J TTAACAGGAA CGATTCCAGC TTTTGAGGAA AAATACGGTA TAAAAGTTAA 

msa523010 .2 {263_CJB110} TTAACAGGAA CGATTCCAGC TTTTGAGGAA AAATACGGTA TAAAAGTTAA 

rasa 523 010 . 2 {263_1169NT} TTAACAGGAA CGATTCCAGC TTTTGAGGAA AAATACGGTA TAAAAGTTAA 

msa523010 .2{263__JM91130013} TTAACAGGAA CGATTCCAGC TTTTGAGGAA AAATACGGTA TAAAAGTTAA 

Consensus ********** ********** ********** ********** ********** 



msa523010 
msa523010 
msa523010 
msa523010 
msa523010 
msa52301 
msa523010 .2 
rasa523010 
msa523010.2 
msa523010 .2 
msa523010.2{263 



.2{263_COHl 
.2f263_M732 
.2{263__M78l' 
.2{263__A909 
.2{263_H36B} 
0.2{263_090) 
{263_18RS21} 
.2{263_2603} 
{263_CJB110J 
{263_1169NTJ 
_JM91130013) 
Consensus 



201 

GCTTATTCAA 
GCTTATTCAA 
GCTTATTCAA 
GCTTATTCAA 
GCTTATTCAA 
GCTTATTCAA 
GCTTATTCAA 
GCTTATTCAA 
' GCTTATTCAA 
GCTTATTCAA 
GCTTATTCAA 



GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 



GgCAACTAAT 
GgCAACTAAT 
GgCAACTAAT 
GtCAACTAAT 
GtCAACTAAT 
GgCAACTAAT 
GgCAACTAAT 
GgCAACTAAT 
GgCAACTAAT 
GgCAACTAAT 
GgCAACTAAT 



AGATAGATTA 
AGATAGATTA 
AGATAGATTA 
AGATAGATTA 
AGATAGATTA 
AGATAGATTA 
AGATAGATTA 
AGATAGATTA 
AGATAGATTA 
AGATAGATTA 
AGATAGATTA 



250 

AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 



********** ********** *-******** ********** ********** 



msa523010 .2{263_COHl} 
msa523010.2{263JM732] 
msa523010.2(263_M78l) 
msa523010.2(263_A909J 
msa523010 .2{263_H36B} 
msa523010.2{263_090] 
msa523010 . 2 {263_18RS21 ) 
msa523010.2{263_2603} 
msa523010.2{263_CJB110} 
ltisa523010 . 2{263_1169NT} 
msa523010 . 2 { 263_JM91130013) 
Consensus 



251 

GTAAGCAgTT 
GTAAG CAgTT 
GTAAGCAgTT 
GTAAGCAgTT 
GTAAGCAgTT 
GTAAGCAgTT 
GTAAGCAgTT 
GTAAGCAgTT 
GTAAGCAgTT 
GTAAG CA t TT 
GTAAGCAgTT 
*******_** 



GAAGGCGGAT 
GAAGGCGGAT 
GAAGGCGGAT 
GAAGGCGGAT 
GAAGGCGGAT 
GAAGGCGGAT 
GAAGGCGGAT 
GAAGGCGGAT 
GAAGGCGGAT 
GAAGGCGGAT 
GAAGGCGGAT 
********** 



aTTTTCTTTG 
aTTTTCTTTG 
aTTTTCTTTG 
aTTTTCTTTG 
aTTTTCTTTG 
aTTTTCTTTG 
aTTTTCTTTG 
aTTTTCTTTG 
aTTTTCTTTG 
aTTTTCTTTG 
gTTTTCTTTG 
_********* 



GAGGAAATTA 
GAGGAAATTA 
GAGGAAATTA 
GAGGAAATTA 
GAGGAAATTA 
GAGGAAATTA 
GAGGAAATTA 
GAGGAAATTA 
GAGGAAATTA 
GAGGAAATTA 
GAGGAAATTA 
********** 



300 

TACGCAATTT 
TACGCAATTT 
TACGCAATTT 
TACGCAATTT 
TACGCAATTT 
TACGCAATTT 
TACGCAATTT 
TACGCAATTT 
TACGCAATTT 
TACGCAATTT 
TACGCAATTT 
********** 



928 
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Table 60: Comparative Sequences relating to SAG1945 



301 350 

msa523010 .2{263_COHl} GAAAGTCATA AGGCATTGTT TGAGTCTTAC GTATCAAAGA ATgTTCATAC 

msa523010 .2{263_M732} GAAAGTCATA AGGCATTGTT TGAGTCTTAC GTATCAAAGA ATgTTCATAC 

msa523010 .2|263_M78l| GAAAGTCATA AGGCATTGTT TGAGTCTTAC GTATCAAAGA ATgTTCATAC 

msa523010 .2{263_A90 9} GAAAGTCATA AGGCATTGTT TGAGTCTTAC GTATCAAAGA ATaTTCATAC 

msa523010.2{263_H36B} GAAAGTCATA AGGCATTGTT TGAGTCTTAC GTATCAAAGA ATaTTCATAC 

msa523010.2{263_090} GAAAGTCATA AGGCATTGTT TGAGTCTTAC GTATCAAAGA ATgTTCATAC 

msa523 010.2{263_18RS2l} GAAAGTCATA AGGCATTGTT TGAGTCTTAC GTATCAAAGA ATgTTCATAC 

msa 523010. 2 { 263^2603} GAAAGTCATA AGGCATTGTT TGAGTCTTAC GTATCAAAGA ATgTTCATAC 

msa523 010.2{263_CJB110} GAAAGTCATA AGGCATTGTT TGAGTCTTAC GTATCAAAGA ATgTTCATAC 

msa52 3 010.2{263_1169NT} GAAAGTCATA AGGCATTGTT TGAGTCTTAC GTATCAAAGA ATgTTCATAC 

msa52 3 0 1 0 . 2 { 2 63_JM9 1130013} GAAAGTCATA AGGCATTGTT TGAGTCTTAC GTATCAAAGA ATgTTCATAC 

Consensus ********** ********** ********** ********** **_******* 



msa523010 
msa523010 
msa523010 
msa523010 
msa523010 
msa52301 
msa523010.2 
msa523010 
msa523010 .2 
msa523010 .2 
msa523010 .2(263 



.2{263_COHl} 
.2{263_M732} 
-2{263_M78l} 
.21 263_A90 9} 
.2{263_H36B} 
0.2{263_090} 
{263 18RS21} 
.2{263_2603} 
{263_CJB110} 
{263__1169NT} 
JM91130013} 
Consensus 



351 

TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
********** 



GAcTATATCC 
GAcTATATCC 
GAcTATATCC 
GAtTATATCC 
GAtTATATCC 
GAcTATATCC 
GAcTATATCC 
GAcTATATCC 
GAcTATATCC 
GAcTATATCC 
GAcTATATCC 
**_******* 



ATCCgAGTGA 
ATC CgAGTGA 
ATCCgAGTGA 
ATCCgAGTGA 
ATCCgAGTGA 
ATCCaAGTGA 
ATCCaAGTGA 
ATCCaAGTGA 
ATCCaAGTGA 
ATCCaAGTGA 
ATCCgAGTGA 
****_***** 



TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
********** 



400 

CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
********** 



401 450 

rasa523010.2{263_COHl} TAAATGGGAG TGTCTTGATT GTAAATAACG AATTAGcTAA GGGACTTACC 

msa523010.2f 263_M732} TAAATGGGAG TGTCTTGATT GTAAATAACG AATTAGcTAA GGGACTTACC 

msa523010.2{263_M78l} TAAATGGGAG TGTCTTGATT GTAAATAACG AATTAGcTAA GGGACTTACC 

msa523010.2{263_A909} TAAATGGGAG TGTCTTGATT GTAAATAACG AATTAGcTAA GGGACTTACC 

msa523010 .2{2 63_H36B} TAAATGGGAG TGTCTTGATT GTAAATAACG AATTAGtTAA GGGACTTACC 

ms a 523010 - 2 { 263_090} TAAATGGGAG TGTCTTGATT GTAAATAACG AATTAGcTAA GGGACTTACC 

msa523010.2{263_18RS21) TAAATGGGAG TGTCTTGATT GTAAATAACG AATTAGcTAA GGGACTTACC 

msa523010.2{263_2603} TAAATGGGAG TGTCTTGATT GTAAATAACG AATTAGcTAA GGGACTTACC 

msa523 010.2{263_CJB110} TAAATGGGAG TGTCTTGATT GTAAATAACG AATTAGcTAA GGGACTTACC 

msa523 010.2{263_1169NT) TAAATGGGAG TGTCTTGATT GTAAATAACG AATTAGcTAA GGGACTTACC 

msa523010 .2 {263_JM91130013 } TAAATGGGAG TGTCTTGATT GTAAATAACG AATTAGcTAA GGGACTTACC 

Consensus ********** ********** ********** ******_*** ********** 



msa523010.2{263_COHl] 
msa523010.2{263_M732; 
msa523010.2{263_M78lj 
msa523010.2{263_A909] 
msa523010.2{263_H36B] 
msa523010 . 2 {263_090] 
msa523 010.2{263_18RS2l] 
msa523010.2{263_2603} 
msa523 010 . 2 {263_CJB110 } 
msa523010 . 2 {263_1169NT} 
msa523010 . 2 { 263_JM91130013 } 
Consensus 



451 

ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 



ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 



********** ********** 



ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
********** 



TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
********** 



500 

GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
********** 



501 550 

msa523010 . 2 { 263_COHl } CTTTGCAGAT CCGAATACTT CCTCTAGTGC TTTCTCACAA CTCACtAATA 

msa523010 .2f 263_M732} CTTTGCAGAT CCGAATACTT CCTCTAGTGC TTTCTCACAA CTCACtAATA 

msa523010 .2{263__M781} CTTTGCAGAT CCGAATACTT CCTCTAGTGC TTTCTCACAA CTCACtAATA 

msa523010 . 2 {263_A909 } CTTTGCAGAT CCGAATACTT CCTCTAGTGC TTTCTCACAA CTCACtAATA 

msa523010 .2{263_H36B> CTTTGCAGAT CCGAATACTT CCTCTAGTGC TTTCTCACAA CTCACtAATA 

msa523010.2{263_090} CTTTGCAGAT CCGAATACTT CCTCTAGTGC TTTCTCACAA CTCACtAATA 

msa523 010.2{263_18RS2l} CTTTGCAGAT CCGAATACTT CCTCTAGTGC TTTCTCACAA CTCACtAATA 

msa523010 .2 {263_2603 } CTTTGCAGAT CCGAATACTT CCTCTAGTGC TTTCTCACAA CTCACtAATA 

msa523010.2{263_CJB110} CTTTGCAGAT CCGAATACTT CCTCTAGTGC TTTCTCACAA CTCACtAATA 

msa523010.2{263_1169NT} CTTTGCAGAT CCGAATACTT CCTCTAGTGC TTTCTCACAA CTCACcAATA 

msa523010 .2 {263_JM9 1130013} CTTTGCAGAT CCGAATACTT CCTCTAGTGC TTTCTCACAA CTCACcAATA 

Consensus ********** ********** ********** ********** *****_**** 



551 600 

msa523010.2{263_COHl} TACTCTTGGC cAAGGGTGGT TACACCAATC CAAAAGCGTG GAACTATGTT 

msa523010.2{2'63_M732> TACTCTTGGC cAAGGGTGGT TACACCAATC CAAAAGCGTG GAACTATGTT 

msa523010.2{2 63_M781} TACTCTTGGC CAAGGGTGGT TACACCAATC CAAAAGCGTG GAACTATGTT 

msa523010.2{263__A909> TACTCTTGGC cAAGGGTGGT TACACCAATC CAAAAGCGTG GAACTATGTT 

msa523010 .2{2 63_H36B} TACTCTTGGC cAAGGGTGGT TACACCAATC CAAAAGCGTG GAACTATGTT 

msa523010.2{263_090} TACTCTTGGC cAAGGGTGGT TACACCAATC CAAAAGCGTG GAACTATGTT 

msa523 010.2{263_18RS21} TACTCTTGGC cAAGGGTGGT TACACCAATC CAAAAGCGTG GAACTATGTT 

msa523010 .2{263_2603} TACTCTTGGC CAAGGGTGGT TACACCAATC CAAAAGCGTG GAACTATGTT 

msa523 010.2(263_CJB110} TACTCTTGGC cAAGGGTGGT TACACCAATC CAAAAGCGTG GAACTATGTT 

msa523010.2{263_1169NT} TACTCTTGGC aAAGGGTGGT TACACCAATC CAAAAGCGTG GAACTATGTT 

msa523010.2{263_JM91130013} TACTCTTGGC aAAGGGTGGT TACACCAATC CAAAAGCGTG GAACTATGTT 



929 
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Table 60: Comparative Sequences relating to SAG1945 



Consensus 



********** 



.********* ********** ********** ********** 



msa523010 .2{263_COHl" 
msa523010 .2f263_M732; 
msa523010 .2(263_M78l' 
msa523 010 . 2 { 263_A9 0 9 ] 
msa523010.2{263_H36B} 
msa523010.2{263_090 T 
msa523010.2{263_18RS2l] 
msa523010.2{263_2603; 
msa523 0 1 0 . 2 { 2 63_CJB1 1 0 ] 
msa523010.2{263_1169NT] 
msa523010.2{263_JM91130013; 

Consensus 



601 

AAAAAGCTAC 
AAAAAGCTAC 
AAAAAGCTAC 
AAAAAGCTAC 
AAAAAGCTAC 
AAAAAGCTAC 
AAAAAGCTAC 
AAAAAGCTAC 
AAAAAGCTAC 
AAAAAGCTAC 
AAAAAGCTAC 



AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 



********** ********** 



TAATGCTATC 
TAATGCTATC 
TAATGCTATC 
TAATGCTATC 
TAATGCTATC 
TAATGCTATC 
TAATGCTATC 
TAATGCTATC 
TAATGCTATC 
TAATGCTATC 
TAATGCTATC 
********** 



AAATCTTCTA 
AAATCTTCTA 
AAATCTTCTA 
AAATCTTCTA 
AAATCTTCTA 
AAATCTTCTA 
AAATCTTCTA 
AAATCTTCTA 
AAATCTTCTA 
AAATCTTCTA 
AAATCTTCTA 



650 

GCTCTTCAGA 
GCTCTTCAGA 
GCTCTTCAGA 
GCTCTTCAGA 
GCTCTTCAGA 
GCTCTTCAGA 
GCTCTTCAGA 
GCTCTTCAGA 
GCTCTTCAGA 
GCTCTTCAGA 
GCTCTTCAGA 



********** ********** 



651 700 

rasa523010 .2{263_COHl} AGTTTATCAA TCAGTTGCAG AAGGaAAAAT GATTGTGGGG tTGACTTACG 

msa523010.2{263_M732} AGTTTATCAA TCAGTTGCAG AAGGaAAAAT GATTGTGGGG tTGACTTACG 

msa523010.2{263_M78l} AGTTTATCAA TCAGTTGCAG AAGGaAAAAT GATTGTGGGG tTGACTTACG 

msa523010.2{263_A909} AGTTTATCAA TCAGTTGCAG AAGGaAAAAT GATTGTGGGG tTGACTTACG 

msa523010 .2{263_H36B} AGTTTATCAA TCAGTTGCAG AAGGaAAAAT GATTGTGGGG tTGACTTACG 

msa523010.2{263_090} AGTTTATCAA TCAGTTGCAG AAGGaAAAAT GATTGTGGGG cTGACTTACG 

ms a52 3 0 1 0 . 2 { 2 63_1 8 RS 2 1 ) AGTTTATCAA TCAGTTGCAG AAGGaAAAAT GATTGTGGGG cTGACTTACG 

msa523010 .2{263_2603) AGTTTATCAA TCAGTTGCAG AAGGaAAAAT GATTGTGGGG cTGACTTACG 

msa523010 . 2 {263_CJB110} AGTTTATCAA TCAGTTGCAG AAGGaAAAAT GATTGTGGGG cTGACTTACG 

msa523010.2{263_1169NT} AGTTTATCAA TCAGTTGCAG AAGGaAAAAT GATTGTGGGG tTGACTTACG 

msa523010.2{263_JM9113 0013} AGTTTATCAA TCAGTTGCAG AAGGcAAAAT GATTGTGGGG cTGACTTACG 

Consensus ********** ********** ****_***** ********** _********* 

701 750 

msa523010.2f263_COHl> AAGACCCTAG TGTCAATTTG CAAAAAAGTG GTGCCAATGT TTCTATTGTa 

msa523010.2{263_M732) AAGACCCTAG TGTCAATTTG CAAAAAAGTG GTGCCAATGT TTCTATTGTa 

msa523010.2{263_M78l} AAGACCCTAG TGTCAATTTG CAAAAAAGTG GTGCCAATGT TTCTATTGTa 

msa523010 .2 {263_A909} AAGACCCTAG TGTCAATTTG CAAAAAAGTG GTGCCAATGT TTCTATTGTa 

msa523010.2{263_H36B} AAGACCCTAG TGTCAATTTG CAAAAAAGTG GTGCCAATGT TTCTATTGTa 

msa523 010. 2 {263^090} AAGACCCTAG TGTCAATTTG CAAAAAAGTG GTGCCAATGT TTCTATTGTa 

msa523010.2{263_18RS2l} AAGACCCTAG TGTCAATTTG CAAAAAAGTG GTGCCAATGT TTCTATTGTa 

msa523 010.2{263_2603) AAGACCCTAG TGTCAATTTG CAAAAAAGTG GTGCCAATGT TTCTATTGTa 

msa523010.2{263_CJB110) AAGACCCTAG TGTCAATTTG CAAAAAAGTG GTGCCAATGT TTCTATTGTa 

msa523010.2{263_1169NT} AAGACCCTAG TGTCAATTTG CAAAAAAGTG GTGCCAATGT TTCTATTGTa 

msa523010 . 2{263_JM91130013} AAGACCCTAG TGTCAATTTG CAAAAAAGTG GTGCCAATGT TTCTATTGTg 

Consensus ********** ********** ********** ********** *********_ 



751 

msa523 010 .2{263_COHl} TAcCCGACAG AAGGGACAGT 

msa523010.2(263_M732) TAcCCGACAG AAGGGACAGT 

msa523010.2{263_M78l} TAcCCGACAG AAGGGACAGT 

msa523010.2{263_A909} TAt CCGACAG AAGGGACAGT 

msa523010.2{263__H36B} TAt CCGACAG AAGGGACAGT 

msa523010.2{263_090} TAt CCGACAG AAGGGACAGT 

msa523010.2{263_18RS2l} TAt CCGACAG AAGGGACAGT 

msa523010.2{263_2603} TAt CCGACAG AAGGGACAGT 

msa523010.2{263_CJB110} TAt CCGACAG AAGGGACAGT 

msa523010.2{263_1169NT) TAt CCGACAG AAGGGACAGT 

msa523 010 .2 {263_JM9 1130013} TAt CCGACAG AAGGGACAGT 

Consensus **-******* ********** 



TTTTGTCCCA 
TTTTGTCCCA 
TTTTGTCCCA 
TTTTGTCCCA' 

TTTTGTCCCA 
TTTTGTCCCA 
TTTTGTCCCA 



TTTTGTCCCA 
TTTTGTCCCA 
********** 



TCTTCGGTTG 
TCTTCGGTTG 
TCTTCGGTTG 
TCTTCGGTTG 
TCTTCGGTTG 
TCTTCGGTTG 
TCTTCGGTTG 
TCTTCGGTTG 
TCTTCGGTTG 
TCTTCGGTTG 
TCTTCGGTTG 
********** 



800 

CAATTATAAA 
CAATTATAAA 
CAATTATAAA 
CAATTATAAA 
CAATTATAAA 
CAATTATAAA 
CAATTATAAA 
CAATTATAAA 
CAATTATAAA 
CAATTATAAA 
CAATTATAAA 
********** 



801 850 

msa523010 .2 {263_COHl} GAATGCTCCT TCTATGAAAG AAGCAAAGTT ATTTATTAAT TTTATGCTTT 

msa523010.2{263_M732} GAATGCTCCT TCTATGAAAG AAGCAAAGTT ATTTATTAAT TTTATGCTTT 

msa523010.2{263_M78l} GAATGCTCCT TCTATGAAAG AAGCAAAGTT ATTTATTAAT TTTATGCTTT 

msa523010.2{263_A909) GAATGCTCCT TCTATGAAAG AAGCAAAGTT ATTTATTAAT TTTATGCTTT 

m&a523 010 .2{263__H36B} GAATGCTCCT TCTATGAAAG AAGCAAAGTT ATTTATTAAT TTTATGCTTT 

msa523010 . 2 { 263__090 } GAATGCTCCT TCTATGAAAG AAGCAAAGTT ATTTATTAAT TTTATGCTTT 

msa523010.2{263_18RS2l| GAATGCTCCT TCTATGAAAG AAGCAAAGTT ATTTATTAAT TTTATGCTTT 

msa523010 .2{263_2603} GAATGCTCCT TCTATGAAAG AAGCAAAGTT ATTTATTAAT TTTATGCTTT 

msa523010 . 2 {263_CJB110 } GAATGCTCCT TCTATGAAAG AAGCAAAGTT ATTTATTAAT TTTATGCTTT 

msa523010.2{263_1169NT} GAATGCTCCT TCTATGAAAG AAGCAAAGTT ATTTATTAAT TTTATGCTTT 

msa523010.2{263_JM91130013) GAATGCTCCT TCTATGAAAG AAGCAAAGTT ATTTATTAAT TTTATGCTTT 

Consensus ********** ********** ********** ********** ********** 



msa523010 . 

msa523010 . 

msa523010 . 

msa523010. 

msa523010. 
msa523010 
msa523010.2{ 

mBa5 23010 . 
msa523010 
msa523010 



263_COHl) 
263_M732} 
263_M781} 
263_A909} 
263_H36B) 
2{263__090) 
263__18RS2l) 
x W .2{263_2603 J 
.2{263_CJB110) 
.2{263_1169NT} 



851 

CTTTAGATGT 
CTTTAGATGT 
CTTTAGATGT 
CTTTAGATGT 
CTTTAGATGT 
CTTTAGATGT 
CTTTAGATGT 
CTTTAGATGT 
CTTTAGATGT 
CTTTAGATGT 



TCAAAATGCC 
TCAAAATGCC 
TCAAAATGCC 
TCAAAATGCC 
TCAAAATGCC 
TCAAAATGCC 
TCAAAATGCC 
TCAAAATGCC 
TCAAAATGCC 
TCAAAATGCC 



TTTGGGCAGT 
TTTGGGCAGT 
TTTGGGCAGT 
TTTGGGCAGT 
TTTGGGCAGT 
TTTGGGCAGT 
TTTGGGCAGT 
TTTGGGCAGT 
TTTGGGCAGT 
TTTGGGCAGT 



CAACGAGTAA 
CAACGAGTAA 
CAACGAGTAA 
CAACGAGTAA 
CAACGAGTAA 
CAACGAGTAA 
CAACGAGTAA 
CAACGAGTAA 
CAACGAGTAA 
CAACGAGTAA 



900 

CCGACCTATT 
CCGACCTATT 
CCGACCTATT 
CCGACCTATT 
CCGACCTATT 
CCGACCTATT 
CCGACCTATT 
CCGACCTATT 
CCGACCTATT 
CCGACCTATT 



930 



WO 2004/018646 
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Table 60: Comparative Sequences relating to SAG1945 



msa523010 . 2 { 263_JM91130013 } 
Consensus 



CTTTAGATGT TCAAAATGCC TTTGGGCAGT CAACGAGTAA CCGACCTATT 
********** ********** ********** ********** ********** 



msa523010 
msa523010 
msa523010 
msa523010 
rasa523010 
msa52301 
msa523010 .2 
msa523010 
msa523010.2 
msa523010 .2 
msa523010.2{263. 



.2{263_COHl} 
.2{263_M732} 
.2{263_M781} 
.2{263_A909} 
.2{263_H36B} 
0.2{263_090) 
{263_18RS2l} 
2{263_2603} 
263_CJB110} 
263_1169NT) 
i^JM91130013} 
Consensus 



901 

CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 



CCCAAACaAG 
CCCAAACaAG 
CCCAAACaAG 
CCCAAACgAG 
CCCAAACgAG 
CCCAAACgAG 
CCCAAACgAG 
CCCAAACgAG 
CCCAAACgAG 
CCCAAACgAG 
CCCAAACgAG 



TAATGGCATG 
TAATGGCATG 
TAATGGCATG 
TAATGGCATG 
TAATGGCATG 
TAATGGCATG 
TAATGGCATG 
TAATGGCATG 
TAATGGCATG 
TAATGGCATG 
TAATGGCATG 



********** *******_** ********** 



AAAGCTTTAA 
AAAGCTTTAA 
AAAGCTTTAA 
AAAGCTTTAA 
AAAGCTTTAA 
AAAGCTTTAA 
AAAGCTTTAA 
AAAGCTTTAA 
AAAGCTTTAA 
AAAGCTTTAA 
AAAGCTTTAA 
********** 



950 

AGGATATcGC 
AGGATATcGC 
AGGATATcGC 
AGGATAT t GC 
AGGATAT tGC 
AGGATAT tGC 
AGGATAT tG C 
AGGATAT tGC 
AGGATAT tGC 
AGGATATtGC 
AGGATAT tGC 
*******_** 



msa523010 
msa523010 
msa523010 
msa523010 
msa523010 
niBa52301 
msa523010 .2 
msa523010 
msa523010.2 
msa523010 .2 
msa523010.2{263 



.2{263_COHl} 
-2{263_M732} 
.2{263_M78l} 
,2{263_A909} 
.2{263_H36B} 
0.2{263_090) 
{263_18RS2l} 
.2{263_2603} 
{263_CJB110} 
{263_1169NT> 
_JM91130013} 
Consensus 



951 

TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
********** 



GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
********** 



GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
********** 



TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
********** 



1000 
aGCCAAATCC 
aGCCAAATCC 
aGCCAAATCC 
gGCCAAATCC 
gGCCAAATCC 
gGCCAAATCC 
gGCCAAATCC 
gGCCAAATCC 
gGCCAAATCC 
gGCCAAATCC 
gGCCAAATCC 
_********* 



1001 1035 

msa523010 .2{263_COHl} TTAAAACCTA TAATCGcATT CGTAGAAATG CTGAT 

tnsa523010 .2{263_M732} TTAAAACCTA TAATCGcATT CGTAGAAATG CTGAT 

msa523010 -2{263_M7 8l} TTAAAACCTA TAATCGcATT CGTAGAAATG CTGAT 

rasa523010 .2{263_A909) TTAAAACCTA TAATCGtATT CGTAGAAATG CTGAT 

msa523010 .2{263_H36B) TTAAAACCTA TAATCGtATT CGTAGAAATG CTGAT 

ntsa523010 . 2{263_090} TTAAAACCTA TAATCGtATT CGTAGAAATG CTGAT 

msa523010.2{263_18RS2l} TTAAAACCTA TAATCGtATT CGTAGAAATG CTGAT 

msa523010 .2{263_2€03} TTAAAACCTA TAATCGtATT CGTAGAAATG CTGAT 

msa523010 .2{263_CJB110} TTAAAACCTA TAATCGtATT CGTAGAAATG CTGAT 

msa523010 .2{263_1169NT} TTAAAACCTA TAATCGtATT CGTAGAAATG CTGAT 

msa523010.2{263_JM91130013} TTAAAACCTA TAATCGtATT CGTAGAAATG CTGAT 

Consensus ********** ******_*** ********** ***** 



SEQ ID NO. 6012 
STRAIN 2603 frame: 1 

MKEKQSKRL I YI LLWS 1 1 FI S VFT YS I SQPSKLLPPKELVILSPNSQAI LTGTI PAFEE 
KYGIKVKLI OGGTGQL I DRLSKEGKQLKAD I FFGGNYTQFESHKALFE S YVS KNVHTVI P 
DY I HPSDTATP YTI NGS VL I VNNELAKGLT I KSYEDLLQPSLKGKIAFADPNTSSSAFSQ 
LTN I LIjAKGG YTNP KAWNYVKKLQHN I NA I KS SS S SEVYQSVAEGKM I VGLT YED PS VNL 
QKSGANVSI VYPTEGTVFVPSSVAI IKNAPSMKEAKLFINFMLSLDVQN^ 
RKDAOTSNGMKALKD I ATLKEDYRYVTKHKGQ I LKTYNRI RRNAD 

SEQ ID NO. 6013 
STRAIN 090 frame: 1 

QPSKLLPPKELVI LSPNSQAILTGT I PAFEE KYGIKVKLI QGGTGQLIDRLSKEGKQLKA 
DI FFGGNYTQFESHKALFESYVSKNVHTVI PDYIHPSDTATPYTINGSVLIVNNELAKGL 
TI KS YEDLLQPSIiKGKI AFADPNTSSSAFSQIiTNI LLAKGG YTN P KAWNYVKKLQHN I NA 
I KSSSSSEVYQSVAEGKMI VGLTYEDPSVNLQKSGANVS I VYPTEGTVFVPSSVAI I KNA 
PSMKEAKLF INFMLSLDVQNAFGQSTSNRP I RKDAOTSNGMKALKD I ATLKEDYRYVTKH 
KGQI LKTYNRI RRNAD 

SEQ ID NO. 6014 
STRAIN A909 frame: 1 

QPSKLLPPKELVI LS PNSQAI LTGT I PAFEEKYGI KVKLIQGGTGQL I DRLSKEGKQLKA 
DI FFGGNYTQFE SHKALFES YVSKN I HTVT PDYIHPSDTATPYTINGSVLIVNNELAKGL 
TI KSYEDLLQPSLKGKI AFADPNTS S S AF S QLTN I LLAKGG YTN P KA WN YVKKL QHN I NA 
IKSSSSSEVYQSVAEGKMI VGLTYEDPSVNLQKSGANVS I VYPTEGTVFVPSSVAI I KNA 
PSMKEAKLFINFMLSLDVQNAFGQSTSNRP I RKDAOTSNGMKALKD I ATLKEDYRYVTKH 
KGQI LKTYNRI RRNAD 

SEQ ID NO. 6015 

STRAIN H36Bframe:2 

KLLPPKELVILSPNSQAILTGTI PAFEEKYGI KVKLI QGGTGQLI DRLSKEGKQLKADI F 
FGGNYTQFESHKALFESYVSKNIHTVI PDYIHPSDTATPYTINGSVLIVNNELVKGLTIK 
SYEDLI^PSLKGKIAFADPNTSSSAFSQLTNILLAKGGYTNPKAWNYVKKLQHNIN^ 
SSSSEVYQSVAEGKMI VGLTYEDPSVNLQKSGANVS I VYPTEGTVFVPSSVAI I KNAPSM 
KEAKLF I NFMLSLDVQNAFGQSTSNRP I RKDAOTSNGMKALKD I ATLKEDYRYVTKHKGQ 
I LKTYNRI RRNAD 



SEQ ID NO. 6016 
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STRAIN 18RS21 frame: 1 

QPSKLLP PKELVI LSPNSQAI LTGT I PAFEEKYG I KVKL I QGGTGQL I DRLSKEGKQLKA 
DI FFGGNYTQFESHKALFESYVSKNVHTVI PDYI HPSDTATPYTINGSVLIVNNELAKGL 
TI KS YEDLLQPSLKGKI AFADPNTSSSAFSQLTNI LLAKGGYTN P KAWNYVKKLQHN I NA 
IKSSSSSEVYQSVAEGKMI VGLTYEDPSVNLQKSGANVS I VYPTEGTVFVPSSVAI I KNA 
PSMKEAKLF INFMLSLDVQNAFGQSTSNRP I RKDAQTSNGMKALKDI ATLKEDYRYVTKH 
KGQI LKTYNRI RRNAD 

SEQ ID NO. 6017 
STRAIN M732 frame: 1 

QPSKLLPPKELVILS PNSQAI LTGT I PAFEEKYG I KVKL I QGGTGQL I DRLSKEGKQLKA 
DI FFGGNYTQFESHKALFES YVSKNVHTVI PDYI HPSDTATP YTI NGSVLI VNNELAKGL 
TIKSYEDLLQPSLKGKIAFADPNTSSSAFSQLTNILLAKGGYTNPKAWNYVKKLQHNINA 
IKSSSSSEVYQSVAEGKMI VGLTYEDPSVNLQKSGANVS IVY PTEGTVFVPS SVAI I KNA 
PSMKEAKLF INFMLSLDVQNAFGQSTSNRP I RKDAQTSNGMKALKD I ATLKEDYRYVTKH 
KSQI LKTYNRI RRNAD 

SEQ ID NO . 6018 
STRAIN COH1 frame: 1 

QPSKLLP PKELVI LSPNSQAI LTGT I PAFEEKYG I KVKL I QGGTGQL I DRLSKEGKQLKA 
DI FFGGNYTQFESHKALFES YVSKNVHTVI PDYI HPSDTATP YTI NGSVLI VNNELAKGL 
TIKSYEDLLQPSLKGKIAFADPNTSSSAFSQLTNILLAKGGYTNPKAWNYVKKLQHNINA 
IKSSSSSEVYQSVAEGKMIVGLTYEDPSVNLQKSGANVS I VYPTEGTVFVPSSVAI IKNA 
PSMKEAKLF INFMLSLDVQNAFGQSTSNRP I RKDAQT SNGMKALKD I ATLKEDYRYVTKH 
KSQI LKTYNRI RRNAD 

SEQ ID NO. 6019 

STRAIN M781 frame: 1 

QPSKLLP PKELVI LS PNSQA I LTGT I PAFEEKYG I KVKL I QGGTGQL I DRLS KEGKQLKA 
DI FFGGNYTQFESHKALFES YVSKNVHTVI PDYI HPSDTATPYTINGSVLIVNNELAKGL 
TI KS YEDLLQPSLKGKI AFADPNTS S SAF SQLTN I LLAKGGYTNP KAWNYVKKLQHNINA 
IKSSSSSEVYQSVAEGKMI VGLTYEDPSVNLQKSGANVS I VYPTEGTVFVPSSVAI I KNA 
PSMKEAKLF INFMLSLDVQNAFGQSTSNRP I RKDAQTSNGMKALKDI ATLKEDYRYVTKH 
KSQI LKTYNR I RRNAD 

SEQ ID NO. 6020 

STRAIN CJB110 frame: 1 

QPFKLLPPKELVI LS PNSQAI LTGT I PAFEEKYG I KVKL I QGGTGQL I DRLSKEGKQLKA 
DI FFGGNYTQFESHKALFES YVSKNVHTVI PDYI HPSDTATP YT I NGSVL I VNNELAKGL 
T IKS YEDLLQPSLKGKI AFADPNTS S SAF SQLTN I LLAKGGYTNPKAWNYVKKLQHN I NA 
IKSSSSSEVYQSVAEGKMI VGLTYEDPSVNLQKSGANVS I VYPTEGTVFVPSSVAI I KNA 
PSMKEAKLF INFMLSLDVQNAFGQSTSNRP I RKDAQTSNGMKALKDI ATLKEDYRYVTKH 
KGQI LKTYNR I RRNAD 

SEQ ID NO. 6021 

STRAIN 1 169NT frame: 3 

SQAI LTGT I PAFEEKYG I KVKL I QGGTGQL I DRLS KEGKHLKAD I FFGGNYTQFE SHKAL 
FE S YVS KNVHTV I PDYI HPSDTATP YTINGSVL I VNNELAKGLTI KS YEDLLQPSLKGKI 
AFADPNTSSSAFSQLTNI LLAKGGYTNPKAWNYVKKLQHN 

MI VGLTYEDPSVNLQKSGANVS I VYPTEGTVFVPSSVAI I KNAPSMKEAKLFINFMLSLD 
VQNAFGQSTSNRP I RKDAQT SNGMKALKD I ATLKED YRYVTKHKGQ I LKTYNRI RRNAD 

SEQ ID NO. 6022 
STRAIN JM91 130013 frame: 1 

QPSKLLPPKELVILSPNSQAILTGTIPAFEEKYGIKVKLIQGGTGQLIDRLSKEGKQLKA 
DVFFGGNYTQFESHKALFES YVSKNVHTVI PDYI HPSDTATP YT I NGSVL I VNNELAKGL 
TIKSYEDLLQPSLKGKIAFADPNTSSSAFSQLTNILLAKGGYTNPKAWNYVKKLQHNINA 
IKSSSSSEVYQSVAEGKMI VGLTYEDPSVNLQKSGANVS I VYPTEGTVFVPSSVAI I KNA 
PSMKEAKLF INFMLSLDVQNAFGQSTSNRP I RKDAQTSNGMKALKDI ATLKEDYRYVTKH 
KGQ I LKTYNRI RRNAD 

PRETTY of : /biotmp/msa523117 . 2 { *} April 28, 2003 08:56 .. 

! 50 

msa523117.2{263_COHl} q pskllppkel vilspnSQAI 

msa523117.2{263_M732} q pskllppkel vilspnSQAI 

msa523117.2{263_M78l} ~q pskllppkel vilspnSQAI 

msa523117 . 2 {263_1169NT} ~ 7~~~7~7 ™J 

msa523117.2{263 CJBllO} q pfkllppkel vilspnSQAI 

msa523117.2{263 090} q pskllppkel vilspnSQAI 

msa523117.2{263 18RS21) q pskllppkel vilspnSQAI 

msa523117.2{263 2603} mkekqskrli yillwsiif isvftysisq pskllppkel vilspnSQAI 

msa523117.2{263>909) q pskllppkel vilspnSQAI 

msa523117.2{263 JM91130013) q pskllppkel vilspnSQAI 

msa523117T2{263 H36B} kllppkel vilspnSQAI 

Consensus ********** ********** *********- **** 

51 100 
msa523117 .2(263 COHl} LTGT I PAFEE KYGIKVKLIQ GGTGQLIDRL SKEGKqLKAD iFFGGNYTQF 
msa523117.2{263_M732} LTGT I PAFEE KYGIKVKLIQ GGTGQLIDRL SKEGKqLKAD iFFGGNYTQF 
msa523117 .2(263 M78l} LTGT I PAFEE KYGIKVKLIQ GGTGQLIDRL SKEGKqLKAD iFFGGNYTQF 
msa523117. 2(263 1169NT} LTGTI PAFEE KYGIKVKLIQ GGTGQLIDRL SKEGKhLKAD iFFGGNYTQF 
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msa523117.2{263_CJB110} LTGTI PAFEE 

msa523117.2{263_090} LTGTI PAPEE 

msa523117.2{263_18RS2l} LTGTI PAFEE 

msa523117 .2 {263^2603} LTGTI PAFEE 

msa523117.2{263_A909) LTGTI PAFEE 

msa523117.2{263__JM9113 0013} LTGTI PAFEE 

msa523117.2{263_H36B} LTGTI PAFEE 
Consensus 



KYGIKVKLIQ 
KYGIKVKLIQ 
KYGIKVKLIQ 
KYGIKVKLIQ 
KYGIKVKLIQ 
KYGIKVKLIQ 
KYGIKVKLIQ 



GGTGQLIDRL 
GGTGQLIDRL 
GGTGQLIDRL 
GGTGQLIDRL 
GGTGQLIDRL 
GGTGQLIDRL 
GGTGQLIDRL 



S KEGKqLKAD 
SKEGKqLKAD 
S KEGKqLKAD 
SKEGKqLKAD 
SKEGKqLKAD 
SKEGKqLKAD 
SKEGKqLKAD 



i F FGGNYTQF 
iFFGGNYTQF 
i FFGGNYTQF 
iFFGGNYTQF 
iFFGGNYTQF 
vF FGGNYTQF 
iFFGGNYTQF 



********** ********** ********** *****-**** _********* 



msa523117.2{263_COHl] 
msa523117 .2{263_M732; 
msa523117.2{263_M78lJ 
msa523117 . 2 f 263_JL169NT 
msa523117.2{263__CJB110; 

msa523117 . 2 { 263_090) 
msa523117.2{263_18RS2l} 
msa523117 .2{263_2603} 
msa523117.2{263_A909} 
msa523117.2{263_JM91130013> 
msa523117.2{263_H36B} 
Consensus 



101 

ESHKALFESY 
ESHKALFESY 
ESHKALFESY 
ESHKALFESY 
ESHKALFESY 
ESHKALFESY 
ESHKALFESY 
ESHKALFESY 
ESHKALFESY 
ESHKALFESY 
ESHKALFESY 



VSKNvHTVIP 
VSKNvHTVIP 
VSKNvHTVIP 
VSKNvHTVIP 
VSKNvHTVIP 
VSKNvHTVIP 
VSKNvHTVIP 
VSKNvHTVIP 
VSKNiHTVIP 
VSKNVHTVIP 
VSKNiHTVIP 



********** ****-***** 



DYIHPSDTAT 
DY I HPSDTAT 
DYIHPSDTAT 
DYIHPSDTAT 
DYIHPSDTAT 
DYIHPSDTAT 
DYIHPSDTAT 
DYIHPSDTAT 
DYIHPSDTAT 
DYIHPSDTAT 
DYIHPSDTAT 
********** 



PYTINGSVLI 
PYTINGSVLI 
PYTINGSVLI 
PYTINGSVLI 
PYTINGSVLI 
PYTINGSVLI 
PYTINGSVLI 
PYTINGSVLI 
PYTINGSVLI 
PYTINGSVLI 
PYTINGSVLI 
********** 



151 

msa523117.2{263_COHl} IKSYEDLLQP 

msa523117.2(263_M732} IKSYEDLLQP 

msa523117 .2 {263_M78l} IKSYEDLLQP 

msa523117.2f263_1169NT) IKSYEDLLQP 

msa523117.2{263_CJB110> IKSYEDLLQP 

msa523117.2{263_090} IKSYEDLLQP 

msa523117.2{263_18RS2l} IKSYEDLLQP 

msa523117 .2{263_2603> IKSYEDLLQP 

msa523117.2{263_A909} IKSYEDLLQP 

msa523117 . 2 (263_JM91130013 } IKSYEDLLQP 

msa523117.2{263_H36B} IKSYEDLLQP 
Consensus 



SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 



PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 



********** ********** ********** 



msa523117.2(263_COHl> 
rasa523117 . 2 {263_M732} 
msa523117 . 2 {263_M781} 
rasa523117.2{263_1169NT} 
msa523 117 . 2 { 2 63_CJB110 } 
msa523117 . 2{263_090} 
msa523117.2{263_18RS2l] 
tnsa523117.2{263_2603] 
msa523117.2{263_A909] 
msa523117.2{263_JM91130013l 
msa52 3 1 1 7 . 2 { 2 6 3_H3 6B ] 
Consensus 



msa523117.2(263_COHl} 
msa523 117 . 2 f 263_M732 } 
msa523117.2{263_M78lJ 
msa523117 . 2 {263_1169NT} 
msa523117.2{263_CJB110} 
moa523117-2{263_090} 
msa523117.2{263_18RS21* 
msa523117.2{263_2603 
msa52 3117.2(2 63_A9 0 9 
msa523117 . 2 { 263_JM91130013 
msa523 117 .2(2 63_H36B 
Consensus 



msa523117 .2{263_COHl) 
msa523117.2(263_M732} 
tnsa523117 ,2{263_M781} 
msa523117.2{263_1169NT) 
msa523117 . 2 {263_CJB110 } 
msa523117. 2 {263^090} 
msa523117.2{263_18RS2l} 
msa523117 .2{263_2603> 
msa523117.2{263_A909} 
msa523117.2{263_JM91130013} 
msa523117 , 2 {263_H36B} 
Consensus 



201 

KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
********** 

251 

YPTEGTVFVP 
YPTEGTVFVP 
YPTEGTVFVP 
YPTEGTVFVP 
YPTEGTVFVP 
YPTEGTVFVP 
YPTEGTVFVP 
YPTEGTVFVP 
YPTEGTVFVP 
YPTEGTVFVP 
YPTEGTVFVP 



KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 



SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 



LTN I LLAKGG 
LTNILLAKGG 
LTN I LLAKGG 
LTNILLAKGG 
LTN I LLAKGG 
LTNILLAKGG 
LTNILLAKGG 
LTNILLAKGG 
LTNILLAKGG 
LTNILLAKGG 
LTNILLAKGG 
********** 



LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 



150 

VNNELaKGLT 
VNNELaKGLT 
VNNELaKGLT 
VNNELaKGLT 
VNNELaKGLT 
VNNELaKGLT 
VNNELaKGLT 
VNNELaKGLT 
VNNELaKGLT 
VNNELaKGLT 
VNNELvKGLT 
*****_**** 

200 

YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 
********** 

250 

QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 



********** ********** ********** ********** 



SSVAIIKNAP 
SSVAIIKNAP 
SSVAIIKNAP 
SSVAI I KNAP 
SSVAIIKNAP 
SSVAIIKNAP 
SSVAIIKNAP 
SSVAIIKNAP 
SSVAI I KNAP 
SSVAIIKNAP 
SSVAIIKNAP 



SMKEAKLF I N 
SMKEAKLFIN 
SMKEAKLF I N 
SMKEAKLF IN 
SMKEAKLFIN 
SMKEAKLFIN 
SMKEAKLFIN 
SMKEAKLFIN 
SMKEAKLFIN 
SMKEAKLFIN 
SMKEAKLFIN 



********** ********** ********** 



FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
********** 



301 

RKDAQTSNGM 
RKDAQT SNGM 
RKDAQTSNGM 
RKDAQTSNGM 
RKDAQTSNGM 
RKDAQTSNGM 
RKDAQTSNGM 
RKDAQTSNGM 
RKDAQTSNGM 
RKDAQTSNGM 
RKDAQTSNGM 
********** 



KALKDIATLK 
KALKD I ATLK 
KALKDIATLK 
KALKDIATLK 
KALKDIATLK 
KALKDIATLK 
KALKDIATLK 
KALKDIATLK 
KALKDIATLK 
KALKDIATLK 
KALKDIATLK 



EDYRYVTKHK 
EDYRYVTKHK 
EDYRYVTKHK 
EDYRYVTKHK 
EDYRYVTKHK 
EDYRYVTKHK 
EDYRYVTKHK 
EDYRYVTKHK 
EDYRYVTKHK 
EDYRYVTKHK 
EDYRYVTKHK 



SQILKTYNRI 
SQILKTYNRI 
SQILKTYNRI 
gQILKTYNRI 
gQILKTYNRI 
gQILKTYNRI 
gQILKTYNRI 
gQILKTYNRI 
gQILKTYNRI 
gQILKTYNRI 
gQILKTYNRI 



300 

FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
********** 

345 
RRNAD 
RRNAD 
RRNAD 
RRNAD 
RRNAD 
RRNAD 
RRNAD 
RRNAD 
RRNAD 
RRNAD 
RRNAD 



********** ********** -********* ***** 
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SEQ ID NO. 6101 
STRAIN 2603 

ATGGT AAAAGTTAGTGTAAGTTCTGTAGGAA.CTCAAG CAT CAACAGT AG CTATTT CTATG 
TTTAGT CGTGTAT CGG CTTT AAATG ATG CAATAACAAAACT AT CAT CTTTTG CAGAGGCT 
GCAACTCTT CAAGGGACTG CTT ATT CAAATG CAAAAAG CTATGCTACTGG AACGTTAACT 
CCGATG CTT CAAGGAATGATTCTTTT CT CTGAAACATTG AGTGAGAAATGTACAG AATT A 
CAAACCTTATAT GT CTCAATTTGTGGTGATGAGGATTTAG ACT CTGT CGTTTTAGAATCA 
AAATT AG CAAGTGATAG GG CAT CATTAAAG ATTG CTGAAG CACTTTT AG AG CAT CTT AAC 
GATGATC CAGAACCTT C CAAAT CT GC CATAAGTT CTACAAAAAGTAAT ATTAAAAAATTA 
AAAAAACGTATAAAATCTAATCAAAAGAAATTAGACAACCTTAATGAATTTAACGCCCAT 
TCAG CAACAGTATTTG C GG ACATTT CTAATG CACAGT CAACTGTT AAC CAAGCACTAG CG 
GCrcTTTCAACAGGATTTTCTGGATATAATAGTAAAACCGGAGCT 

T C CGGACAG ATGGAATG GACAAAG ACAGTTAAGAAGAATTGG AAAGAGCG AG AAG ACG C C 
AAAG CTGAAGAACTG AAAAGTAAAAAG G C TGAAG AAAGTAAG AAAGCTT CAAAAATT GAA 
AATACTACT AAAAAAAGTAATGTTT CAGTTGATAAAAAG AAATT AATAAAAG CGG CTAAT 
GAAG CGTATAAATTAGG AG AAATT AAAAAAGATAC CTATGAAT CAATTATCAGTGGTTTA 
AGTAATG CAT CGG CTGC CTTACTTAAAGAGGT AG CTAAAT CAAAATTGACTGACACAGCT 
CGGCTATTGATG 

SEQ XD NO. 6102 
STRAIN 090 

TTAAATGATG CAAT AACAAAACTAT CAT CTTTTG CAGAGGCT 
G G^CTCTT CAAGGGACTG CTTATT CAAATGCAAAAAG CT ATGCTAC TG G 
AACGTTAACT CCGATGCTTCAAGGAATGAITCTT'ITCTCTGAAACATTGA 
GTGAGAAATGTACAGAATTACAAACCTTATATGT CT CAATTTGTGGTGAT 
GAGGATTTAGACT CTGTCGTTTTAGAATCAAAATTAG CAAGTGATAGGG C 
AT CATTAAAGATTG CTGAAG CACTTTTAGAG CAT CTTAACGATGATC CAG 
AACCTTC CAAAT CTGC GATAAGTTGTACAAAAAGTAATATTAAAAAATT A 
AAAAAACGTATAAAATCTAATCAAAAGAAATTAGACAACCTTAATGAATT 
TAACG CCCATT CAG CAACAGTATTTGCGGACATTT CTAATGCACAGT CAA 
CTGTTAACCAAG CACTAGCGGCTGTTT CAACAGGATTTT CTGGATATAAT 
AGTAAAACCGGAGCTTTTGGAAAAC CAACATC CGGACAGATGGAATGGAC 
AAAGACAGTTAAGAAGAATTGGAAAGAG CGAGAAGACGC CAAAG CTGAAG 
AACTGAAAAGTAAAAAGGCTGAAGAAAGTAAGAAAGCTTCAAAAATTGAA 
AATACTACTAAAAAAAGTAATGTTT CAGTTGATAAAAAGAAATT AAT AAA 
AG CGGCTAATGAAG CGTATAAATT AGGAGAAATTAAAAAAGATAC CTATG 
AATCAATTAT CAGTGGTTT AAGTAATG CAT CGG CTGC CTTACTTAAAGAG 
GTAG CTAAAT CAAAATTGACTGACACAGCTCGGCTATTGATG 

SEQ ID NO. 6103 

STRAIN 18RS21 

TTAAATGATG CAAT AACAAAACTAT CAT CTTTTG CAG AGG C 
TGCAACT CTT CAAGGGACT G CTTATT CAAATG CAAAAAG CTATG CTACTG 
GAACGTTAACTCCGATGCTTCAAGGAATGATTCTTTTCT 
AGTGAGAAAT GTACAGAATT ACAAAC CTTATATGTCT CAATTTGTGGTGA 
TGAGGATTTAGACTCTGTCGTTTTAGAAT CAAAATTAG CAAGTGATAGGG 
CATCATTAAAGATTG CTGAAG CACTTTTAGAG CAT CTTAACGATGATCCA 
GAACCTTCCAAATCTG C CAT AAGTTCTACAAAAAGTAATATT AAAAAATT 
AAAAAAACGTATAAAATCTAATCAAAAGAAATTAGACAACCTTAATGAAT 
TTAACGCC CATT CAG CAACAGTATTTGCGGACATTTCTAATG CACAGTCA 
ACTGTTAACCAAGCACTAGCGGCTGTTTCAACAGGAITTT'CTGGATATAA 
TAGTAAAACCG<3AGCITTTGGAAAAC CAACAT CCGGACAGATGGAATGG A 
CAAAGAC^GT^AAGAAGAATTCGAAAGAGCGAG 

GAACTGAAAAGTAAAAAGGCTGAAGAAAGTAAGAAAG CTT CAAAAATTG A 
AAAT ACTACTAAAAAAAGTAATGTTT CAGTTGATAAAAAGAAATT AATAA 
AAGCGGCTAATGAAGCGTATAAATTAGGAGAAATTAAAAAAGATACCTAT 
GAATCAATTATCAGTGGTTTAAGTAATGCATCGGCTGC 
GGTAG CTAAAT CAAAATTG ACTGACACAG CTCGG CTATTG ATG 

PRETTY of : /biotmp/msal85066 . 2 { * } May 13, 2003 07:01 

1 50 

msal85066.2{270_090} 

msal85066.2{270_18RS2l) ~ 

msa!85066.2{270 2603} atggtaaaag ttagtgtaag ttctgtagga actcaagcat caacagtagc 
Consensus ********** ********** ********** ********** ********** 

51 100 

msal85066.2{270 090} TT AAATGATGCA ATAACAAAAC 

msal85066.2{270 18RS21} TT AAATGATGCA ATAACAAAAC 

msal85066.2{270 2603} tatttctatg tttagtcgtg tatcggctTT AAATGATGCA ATAACAAAAC 
Consensus ********** ********** ********** ********** ********** 

101 150 
msal85066.2{270 090} TAT CATCTTT TGCAGAGGCT GCAACTCTTC AAGGGACTGC TT ATT CAAAT 
msal85066.2{270 18RS21) TATCATCTTT TGCAGAGGCT GCAACTCTTC AAGGGACTGC TTATT CAAAT 
msal85066. 2(270 2603} TATCATCTTT TGCAGAGGCT GCAACTCTTC AAGGGACTGC TTATT CAAAT 
********** ********** ********** ********** ********** 



Consensus 



151 



200 



msal85066'.2{270 090} GCAAAAAGCT ATGCTACTGG AACGTTAACT CCGATG CTTC AAGGAATGAT 
msal85066.2{270 18RS21} GCAAAAAGCT ATGCTACTGG AACGTTAACT CCGATG CTTC AAGGAATGAT 
msa!85066.2{270 2603} GCAAAAAGCT ATGCTACTGG AACGTTAACT CCGATGCTTC AAGGAATGAT 
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Consensus 



msal85066 .2{270_090} 
msal85066.2{270_18RS2l} 
msal85066 . 2 {270_2603 } 
Consensus 



msal85066 - 2 {270J390 } 
msal85066.2{270_JL8RS2l} 
msal85066.2{270_2603} 
Consensus 



msal85066 .2{270_090> 
msal85066.2{270_18RS2l} 
msal85066. 2 {270^2603} 
Consensus 



msal85066.2{270_090} 
msal85066.2{270_18RS2l} 
msal85066.2{270_2603} 
Consensus 



msal85066 .2{270_090) 
msal85066.2{270_18RS2l) 
msal85066 . 2{ 270_2603 } 
Consensus 



msal85066.2{270_090} 
msal85066.2{270_18RS2l} 
msal8 5 0 6 6 . 2 { 2 7 0_2 6 03 } 
Consensus 



msal85066.2{270_090} 
msal85066 . 2 { 270_18RS2l} 
msal85066 . 2 {270_2603 } 
Consensus 



msal85066. 2 (270^090) 
msal85066.2{270_18RS2l) 
msal85066 . 2{ 270_2603 } 
Consensus 



msa!85066.2{270_090} 
msal85066.2{270_18RS2l} 
msal8S066 . 2{270_2603 } 
Consensus 



msal85066.2{270_090} 
msal85066 - 2 { 270_18RS21 ) 
msal85066.2{270_2603) 
Consensus 



msal85066.2{270_090} 
msal85066.2{270_18RS2l} 
msal85066 . 2 { 270J2603 } 
Consensus 



msal85066.2{270_090l 
msa!850 66. 2 { 27 0_18RS2l} 
msal85066. 2 {270_2603} 
Consensus 



msal85066.2{270_090> 
msal85066.2{270_18RS21J 
msal85066.2{270_2603) 
Consensus 



msal85066.2{270_090) 
msal85066.2{270_18RS2l} 



********** ********** ********** ********** ********** 

201 250 
TCTTTTCTCT GAAACATTGA GTGAGAAATG TACAGAATTA CAAACCTTAT 
TCTTTTCTCT GAAACATTGA GTGAGAAATG TACAGAATTA CAAACCTTAT 
TCTTTTCTCT GAAACATTGA GTGAGAAATG TACAGAATTA CAAACCTTAT 
********** ********** ********** ********** ********** 

251 300 
ATGT CTCAAT TTGTGGTGAT GAGGATTTAG ACTCTGTCGT TTTAGAATCA 
ATGTCTCAAT TTGTGGTGAT GAGGATTTAG ACTCTGTCGT TTTAGAATCA 
ATGT CTCAAT TTGTGGTGAT GAGGATTTAG ACTCTGTCGT TTTAGAATCA 
********** ********** ********** ********** ********** 

301 350 
AAATT AG CAA GTGATAGGGC ATCATTAAAG ATTGCTGAAG CACTTTTAGA 
AAATTAG CAA GTGATAGGGC ATCATTAAAG ATTGCTGAAG CACTTTTAGA 
AAATTAG CAA GTGATAGGGC ATCATTAAAG ATTGCTGAAG CACTTTTAGA 
********** ********** ********** ********** ********** 

351 400 
GCATCTTAAC GATGAT CCAG AACCTTCCAA ATCTGCCATA AGTTCTACAA 
GCAT CTTAAC GATGAT CCAG AACCTTCCAA ATCTGCCATA AGTTCTACAA 
GCATCTTAAC GATGATCCAG AACCTTCCAA ATCTGCCATA AGTTCTACAA 
********** ********** ********** ********** ********** 

401 450 
AAAGTAATAT TAAAAAATTA AAAAAACGTA TAAAATCTAA TCAAAAGAAA 
AAAGTAATAT TAAAAAATTA AAAAAACGTA TAAAATCTAA TCAAAAGAAA 
AAAGTAATAT TAAAAAATTA AAAAAACGTA TAAAATCTAA TCAAAAGAAA 
********** ********** ********** ********** ********** 

451 500 
TTAGACAACC TTAATGAATT TAACG C C CAT TCAGCAACAG TATTTGCGGA 
TT AG ACAAC C TTAATGAATT TAACG CC CAT TCAGCAACAG TATTTGCGGA 
TTAGACAACC TTAATGAATT TAACG CC CAT TCAGCAACAG TATTTGCGGA 
********** ********** ********** ********** ********** 

501 550 
CATTTCTAAT GCACAGTCAA CTGTTAACCA AGCACTAGCG GCTG TTT CAA 
CATTTCTAAT GCACAGTCAA CTGTTAACCA AGCACTAGCG GCTGTTTCAA 
CATTTCTAAT GCACAGTCAA CTGTTAACCA AGCACTAGCG GCTGTTTCAA 
********** ********** ********** ********** ********** 

551 600 
CAGGATTTTC TGGATATAAT AGTAAAACCG GAGCTTTTGG AAAACCAACA 
CAGGATTTTC TGGATATAAT AGTAAAACCG GAGCTTTTGG AAAACCAACA 
CAGGATTTTC TGGATATAAT AGTAAAACCG GAGCTTTTGG AAAACCAACA 
********** ********** ********** ********** ********** 

601 650 
TCCGGACAGA TGGAATGGAC AAAGACAGTT AAGAAGAATT GGAAAGAGCG 
TCCGGACAGA TGGAATGGAC AAAGACAGTT AAGAAGAATT GGAAAGAGCG 
TCCGGACAGA TGGAATGGAC AAAGACAGTT AAGAAGAATT GGAAAGAGCG 
********** ********** ********** ********** ********** 

651 700 
AGAAGACGCC AAAGCTGAAG AACTGAAAAG TAAAAAGGCT GAAGAAAGTA 
AGAAGACGCC AAAGCTGAAG AACTGAAAAG TAAAAAGGCT GAAGAAAGTA 
AGAAGACGCC AAAGCTGAAG AACTGAAAAG TAAAAAGGCT GAAGAAAGTA 
********** ********** ********** ********** ********** 

701 750 
AGAAAGCTTC AAAAATTGAA AATACTACTA AAAAAAGTAA TGTTTCAGTT 
AGAAAGCTTC AAAAATTGAA AATACTACTA AAAAAAGTAA TGTTTCAGTT 
AGAAAGCTTC AAAAATTGAA AATACTACTA AAAAAAGTAA TGTTTCAGTT 
********** ********** ********** ********** ********** 

751 800 
GATAAAAAGA AATT AATAAA AG CGG CTAAT GAAGCGTATA AATTAGGAGA 
GATAAAAAGA AATTAATAAA AGCGG CTAAT GAAGCGTATA AATTAGGAGA 
GATAAAAAGA AATTAATAAA AGCGG CTAAT GAAGCGTATA AATTAGGAGA 
********** ********** ********** ********** ********** 

801 850 
AATTAAAAAA GATACCTATG AATCAATTAT CAGTGGTTTA AGTAATGCAT 
AATTAAAAAA GATACCTATG AATCAATTAT CAGTGGTTTA AGTAATGCAT 
AATTAAAAAA GATACCTATG AATCAATTAT CAGTGGTTTA AGTAATGCAT 
********** ********** ********** ********** ********** 

851 900 
CGGCTGCCTT ACTTAAAGAG GTAGCTAAAT CAAAATTGAC TGACACAGCT 
CGGCTGCCTT ACTTAAAGAG GTAGCTAAAT CAAAATTGAC TGACACAGCT 
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msal8 5066.2{270_2603} CGGCTGCCTT ACTTAAAGAG GTAGCTAAAT CAAAATTGAC TGACACAGCT 
Consensus ********** ********** ********** ********** ********** 

901 912 
msal85066.2{270_090} CGGCTATTGA TG 
msal85066.2{270_JL8RS2l} CGGCTATTGA TG 
msal850 66. 2 {270^2603} CGGCTATTGA TG 
Consensus ************ 

SEQ XD NO. 6104 

STRAIN 2603 frame: I 

MVKVS VS S VGTQASTVAI SMFS RVSALNDAITKLS S PAEAATLQGTAYSNAKS YATGTIiT 
PMLG^MILFSETLSEKCTELQTLYVSICGDEDLDSVVLESKIiASDRASLKIAEALLEHLN 
DDPEPSKSAI SSTKSNI KKLKKRI KSNQKKLDNLNEFNAHSATVFADI SNAQSTVNQALA 
AVSTGFSGYNSKTGAFGKPTSGQMEWTKTVKKNWKEREDAKAEELKSKKAEESKKASKIE 
NTTKKSNVS VDKKKLI KAANEAYKLGEIKKDTYESI I SGLSNASAALLKEVAKSKLTDTA 
RLLM 

SEQ ID NO. 6105 
STRAIN 090 frame: 1 

LNDAI TKLSS FAEAATLQGTAYSNAKS YATGTLT PMLQGM I LFSETLSEKCTELQTLYVS 
I CGDEDLDSWLESKLASDRASLKIAEALLEHLNDDPEPSKSAI SSTKSNI KKLKKRI KS 
NQKKLDNLNE FNAHSATVFAD I SNAQSTVNQALAAVSTGFSG YNS KTGAFGKPTSGQME W 
TKTVKKNWKEREDAKAEELKSKKAEESKKASKIENTTKKS NVSVDKKKLI KAANEAYKLG 
EIKKDTYES 1 1 SGLSNASAALLKEVAKSKLTDTARLLM 

SEQ ID NO. 6106 
STRAIN 18RS2I frame: 1 

LNDAI TKLS S FAEAATLQGTAYSNAKS YATGTLTPMLQGM I LFS ETLSE KCTELQTL YVS 
I CGDEDLDSWLESKLASDRASLKIAEALLEHLNDDPEPSKSAI SSTKSNI KKLKKRI KS 
NQKKLDNLNE FNAHSATVFADISNAQSTVNQALAAVSTGFSGYNSKTGAFGKPTSGQMEW 
TKTVKKNWKEREDAKAEELKSKKAEESKKASKIENTTKKSNVSTO 
E I KKDTYES 1 1 SGL SNASAALLKE VAKSKLTDTARLLM 

PRETTY of : /biotmp/msal85181 . 2 { * } May 13, 2003 07:03 

i 1 50 

msal85181.2{270_090} LNDA ITKLSSFAEA ATLQGTAYSN 

msal85181.2{270_18RS2l} LNDA ITKLSSFAEA ATLQGTAYSN 

msal85181 ,2{270_2603} mvkvsvssvg tqastvaism f srvsaLNDA ITKLSSFAEA ATLQGTAYSN 

Consensus ********** ********** ********** ********** ********** 

51 100 
msa!85 18 1.2 {270^090} AKS YATGTLT PMLQGMILFS ETLSEKCTEL QTLYVSICGD EDLDSWLES 
msal85181.2{270_18RS2l} AKS YATGTLT PMLQGMILFS ETLSEKCTEL QTLYVSICGD EDLDSWLES 
msal85181.2{270_2603} AKSYATGTLT PMLQGMILFS ETLSEKCTEL QTLYVSICGD EDLDSWLES 
Consensus ********** ********** ********** ********** ********** 

101 150 
msal8S181.2{270_090} KLASDRASLK IAEALLEHLN DDPEPSKSAI SSTKSNI KKL KKRIKSNQKK 
msal85181.2{270_18RS2l} KLASDRASLK IAEALLEHLN DDPEPSKSAI SSTKSNI KKL KKRIKSNQKK 
msal85181.2{270_2603} KLASDRASLK IAEALLEHLN DDPEPSKSAI SSTKSNI KKL KKRIKSNQKK 
Consensus ********** ********** ********** ********** ********** 

, 151 200 

msal85181.2{270_090} LDNLNEFNAH SATVFADISN AQSTVNQALA AVSTGFSGYN SKTGAFGKPT 

msal85181.2{270_18RS2l} LDNLNEFNAH SATVFADISN AQSTVNQALA AVSTGFSGYN SKTGAFGKPT 

msal85181.2{270_2603} LDNLNEFNAH SATVFADISN AQSTVNQALA AVSTGFSGYN SKTGAFGKPT 

Consensus ********** ********** ********** ********** ********** 

, 201 250 

msal85181.2{270_090) SGQMEWTKTV KKNWKEREDA KAEELKSKKA EESKKASKIE NTTKKSNVSV 

msal8 5181.2{270_18RS2l} SGQMEWTKTV KKNWKEREDA KAEELKSKKA EESKKASKIE NTTKKSNVSV 

msal85181.2{270_2603} SGQMEWTKTV KKNWKEREDA KAEELKSKKA EESKKASKIE NTTKKSNVSV 

Consensus ********** ********** ********** ********** ********** 

t 251 300 

msal85181.2{270_090} DKKKLIKAAN EAYKLGEIKK DTYESIISGL SNASAALLKE VAKSKLTDTA 

msal85181.2{27 0_18RS2l} DKKKLIKAAN EAYKLGEIKK DTYESIISGL SNASAALLKE VAKSKLTDTA 

msal8 518 1.2 {270_2603} DKKKLIKAAN EAYKLGEIKK DTYESIISGL SNASAALLKE VAKSKLTDTA 

Consensus ********** ********** ********** ********** ********** 

301 

msal 8 5 18 1.2 {270^090} RLLM 
msal85181.2.{270_18RS2l} RLLM 
msal85181.2{270_2603} RLLM 
Consensus **** 
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SEQ ID NO. 6201 
STRAIN 2603 

ATGATTTTAAAAATTTGTCGTGC^GCATATAGTTTACAATGGGGAGGTGTTTACCAATTA 
GCTTTGCTGGATTATCCTCGAATTAAGGCGTTTGAATTGGAAAGGATAGGAGCTTTCATA 
GCTTACGAGAAACAATATAAAAGAAAAACTGAGATACAATGTG 

GCAAAAATTGTT CATTTTTT AAAATAGA^TAGTTTTACTTTTCC CTATATTCC CAAATAT 
AGAGAAGCGGCAGCTACTTTTAATGAGGATGGTATTAGTTTAACTTCTGA^rTI'TTAAGC 
CATAGATGTACGATTGAAACTGCAAAACrAATTTTTAAAGAAGGTAAAATCrTATCAGCA 
GTTAAAG C CTTT AATAAGC CTG CTG AAGTACTGGTAAA^G ATAAG AGGAATG CTGCTGGA 
GACC CTAAAG ATTACTTTG ACTATGTGAT GTTGAACTGGT CAAAT AC CAATT CTGGTTAT 
CGTTTAGTAATGGAAAGATTGTTAGG CAAAGCAC CAT CTGAACAGGAGTTAACAGTAGGT 
TTTAAGC CAGGGGT CAGTTTTCATTTTACTTAT CAAGAT AT CAT CAATCATC CTGATTCT 
ATTTTTGATGGTTATCATCCrGCTAAAATTAAA 

GTTG CAT GTGTT AT C C CAAAAC AT TAT CAAGAAG ATT AT CAAAG C CTTGTG C C CAATGAC 

TTGAAACACAGGGTTTATTATTTAGATTACTGTAACG 

AAAGTTT ATGATTT TCTTTGTCATTTGGAAAATAAA 

SEQ ID NO. 6202 
STRAIN 090 

TGGATTATCCTCTAATTAAGGCGTTTGAATTGGAAAGGATAGGAGCriT^ 

ATAGCnTACGAGAAAC^^TATAAAAGAAAAATTGAGATAC^TGTGACGA 

TAAACATCTCCTCAC^AAAATTGTTC^TTTTTTAAAA 

CTTTTCC CTATATT C CCAAATATAGAGAAG CGGCAGCTACTTTTAATGAG 

GATGGTATTAGTTT AACTT CTG ATTTTTT AAG C CAT ACATGT ACGATTG A 

AACTGCAAAACTAATTTTTAAAGAAGGTAAAATC^ 

CCTTTAATAAGCCTGCTGAAGTACTGGTAAATGATAAGAGGAATGCTGCT 
GGAGACCCTAAAGATTACITTGACrATGTGATGTTGAACrGGTC^ 
CAATT CTGGTTATCGTTTAGTAATGGAAAGATTGTTAGG CAAAG CACCAT 
CTGAACAGGAGTTAACAGTAG CTTTTAAG CCAGGGGT CAGCTTT CATTTT 
AATT aTCAAG ATAT CAT CAATCATC CTGATT CTA'ITTTTGATGGTTATCA 
T CCTG CTAAAATTAAAAAT CAACiTT CTTTAG CAGAACATTT AGTTG CAT 
GTGTT ATCC CAA^CATTATCAAGAAGATTAT CAAAG CCTTGTG CCTAAT 
GACTTGAAACACAGAGTTTATTATTTAGATTACT 

TGAGTGGAAT CAAAAAGTTTATGATTTTCTTTGT CATTTGGAAAATAAA 

SEQ ID NO. 6203 
STRAIN A909 

TTGCTGGATTAT CCT CG AATTAAGG CGTTTGAATTGG AAAGGAT A 
GGAG CITTCATAGCTTACGAGAAACAATATAAAAGAAAA^TTGAGATACA 
ATGTGACGATAAACATCT CCT CACAAAAATTGTT CATTTTTT AAAATACA 
ATAGTTTTACTTTTCCCTATATTCCCAAATATAGAGAAGCG^CAGCrACT 
TTTAATG AGGATGGTATTAGTTTAACTTCTGATTTTTTAA 
TACGATTGAAACTG CAAAACTAATTTTTAAA.GAAGGTAAAAT CTT AT CAG 
C^GTTAAAGCCTTTAATAAGCCTGCTGAAGTACTGGTAAATGATAAGAGG 
AATG CTGCTGGAGAC CCTAAAG ATTACTTTGACT ATGTGATGTTGAACTG 
GTCAAATAC CAATT CTGGTTATCGTTTAGTAATGG AAAGATTGTT AGG CA 
AAGCAC CAT CTG AACAGGAGTTAACAGTAG CTTTTAAG C CAGGGGTCAG C 
TTTCATTTTAATTAT CAAGATATCATCAATCATC CTGATT CTATTTTTGA 
TGGTTAT CATCCTG CTAAAATTAAAAATCAACTTT CTTTAG CAGAACATT 
TAGTTGCATGTGTTATCCaaAAACATTATCAAGAAGATTATCAAAGCCTT 
GTGCCTAATGACrTGAAACACAGAGTTTATTATTTAGATTACTGTAAC 
AACACTTTATGAGTGGAAT CAAAAAGTTTATG ATTTT CTTTGTCATTTGG 
AAAATAAA 

SEQ ID NO. 6204 
STRAIN H36B 

TTAAGGCGTTTG AATTGGAAAGGATAGGAG CTTT CAT AG CTTACG AGAAA 
CAATATAAAAGAAAAATTGAGATACAATGTGACGATAAACAT CT C CT CAC 
AAAAATTGTTCATTTTTTAAAATACAATAGTTTTA 

C CAAATATAGAG AAG CGGCAG CTACTTTTAATGAGGATGGTATTAGTTTA 
ACTT CTG ATTTTTT AAG CCATACATGTACG ATTGAAACTG 
TTTTAAAGAAGGTAAAAT CTTATCAG CAGTTAAA.G CCTTTAATAAG CCTG 
CTGAAGT ACTGGTAAATGATAAGAGGAATG CTG CT GGAGACC CTAAAGAT 
TACTTTG ACT ATGTGATGTTGAACTGGT CAAATAC CAATT CTGGTTATCG 
TTTAGTAATGGAAAGATTGTTAGGCAAAG CACCAT CTGAACAGGAGTTAA 
CAGTAGCITTT'AAG CCAGG GGTCAG CTTT CATTTT AATTAT CAAGAT AT C 
ATCAATCATC CTGATT CTATTTTTGATGGTTATCATCCTG CTAAAA.TTAA 
AAAT CAACTTT CTTT AG CAGAACATTTAGTTG CATGTGTTATCCCAAAAC 
ATTAT CAAGAAGATT ATCAAAGCCTTGTG CCTAATGACTTGAAACAC^ 
GTTTATTATTTAGATTACTGTAACGAAACACTTTATG^ 
AGTTTATGATTTT CTTTGTCATTTGGAAAATAAA 

SEQ ID NO. 6205 
STRAIN 18RS21 

TTGCTGGATTATCCTCGAATTAAGGCGTT 

TGAATTGGAAAGGATAGGAG CTTT CATAG CTTACGAG AAACAAT AT AAAA 
GAAAAACTGAGATACAATGTGACGATAAA.CAT CT C CT CGCAAAAATTGTT 
CATTTTTTAAAATACAATAGTTTT ACTTTT C C CTATATT C CCAAATATAG 
AGAAG CGGCA.GCT ACITTTAATGAGGATGGTATT AGTTTAAC^ 
TTTTAAGCCATA(ZATGTACGATTGAAACTGCAAAACTAATTT 
GGTAAAATCTTAT CAG CAGTTAAAG CCTTTAATAAGC CTG CTGAAGT ACT 
GGTAAAAGATAAGAGGAATGCTGCTGGAGACCCTAAAGATTACITTX3A 
ATGTGATGTTGAACTGGTCAAATACCAATTCTGGTTATCGTTTAGTAATG 
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GAAAGATTGTTAGG CAAAG CAC CAT CTGAACZAGGAGTT^ 

TAAG C CAGGGGT CAGTTTT CATTTTACTT ATCAAG AT AT CAT CAAT CAT C 

CrGATTCTATTTTTGATGGTTATCATCCrGCT 

TCTTTAG (CAGAACATTT AGTTG CATGTGTTAT CC CAAAACATTAT CAAG A 
AGATTAT CAAAG CCTTGTG CCCAATGACTTGAAACACAGGGTTTAT^ 
TAGATTACTGTAACGAAACACITTTATGAGTGGAATCAAAAAGTTTATGAT 
TTTCTTTGTCATTTGGAAAATAAA 

SEQ ID NO. 6206 
STRAIN M732 

TTGCTGGATTATCCTCGAATTAAGGCGTT 

TGAATTGGAAAGGATAGGAG CTTT CATAG CTTACG AG AAACAAT ATAAAA 
GAAAAACTGAGATACAATGTGACGATAAACATCTCCTCGCAAAAATTGTT 
CATTTTTTAAAAT ACAAT AGTTTT ACITTT C C CTATATT C CCAAATATAG 
AG AAG CGGCAGCTACTTTTAATGAGGATGGTATTAGTT^ 
TTTT AAG CCATACATGT ACGATTGAAACTG (3AAAA.CT AATTTTTAAAG AA 
GGTAAAAT CITATCAGCA.GTTAAAG CCTTTAATAAGCCTG CTGAAGT ACT 
GGTAAAAGATAAGAGGAATGCTGCTGGAGACCCTAAAGATTACIT'TGACT 
ATGTGATGTTGAACTGGTCAAATAC CAATT CTGGTTAT CGTTTAGTAATG 
GAAAGATTGTTAGG CAAAG CAC CAT CTGAACAGGAGTTAACAGTAGGTTT 
TAAG C CAGGGGT CAGTTTTCATTTT ACTT ATCAAGATATCAT CAATCATC 
CTGATTCTATTTTTG ATGGTTATCAT CCTG CT AAAATTAAAAATCAG CTT 
TCTTTAG CAGAACATTT AGTTGCATGTGTTATC C CAAAACATTATCAAGA 
AGATTATCAAAGCCTTG-TGCCCAATGACITGAAACACAGGG 
TAGATTACTGTAACGAAACACTTTATGAGTGGAAT CAAAAAGTTTATGAT 
TTTCITTGnCATTTGGAAAATAAA 

SEQ XD NO. 6207 
STRAIN COH1 
TTGCTGGAT 

TATCCTCGAATTAAGGCGTTTGAATTGGAAAGGATAGGAG LTl'i"!' CAT AGC 
TTACGAGAAACAATATAAAAGAAAA^CTGAGATACAATGTGACGATAAAC 
ATCTC CTCX3CAAAAATTGTT CATTTTTTAAAATACAATAGTTTT ACTTTT 
CCCTATATT C C CAAAT ATAGAG AAG CGGCAG CTACTTTTAATGAGGATGG 
TATTAGTTTAACTT CTGATTTTTTAAG CCATACATGT ACGATTGAAACTG 
CAAAACTAATTTTTAAAGAAGGTAAAATCTTATCAGCAG 
AATAAGCCTGCTGAAGTACrGGTAAAAGATAAGAGGAATGCTGCTGGAGA 
CC CTAAAGATTACTTTGACT ATGTGATGTTGAACTGGT CAAATAC CAATT 
CTGGTTATCGTTTAGTAATGGAAAGATTGTTAGG CAAAGCACCATCTGAA 
CAGGAGlTAACAGTAGGTTTTAAG C CAGGGGTCAGTTTT CATTTTACTTA 
T CAAGATAT CATCAAT CAT C CTGATT crATTTTTGATGGTTATCATC CTG 
CTAAAATTAAAAATCAG CTTTCTTT AG CAGAACATTT AGTTG CATGTGTT 
ATCCCAAAACATTATCAAGAAG ATTATCAAAG C CTTGTG C CCAATGACTT 
GAAACAC^GCJGTTTATTATTTAGATT^ 

GGAATCAAAAAGTTTATGATTTTCTTTGGCATTTGGAAAATAAA 

SEQ ID NO. 6208 
STRAIN M781 
TTGCTGGA 

TTATCCTCGAATTAAGGCGTTTGAATTG<3AA 

CTTACGAGAAACAATATAAAAGAAAAACTGAGATACAATGTGACGATAAA 
CATCTCCrrCGCAAAAATTGTT CATTTTTTAAAAT^ 

TC CCTATATT C CCAAAT ATAGAGAAG CGG CAG CTACTTTTAATGAGGATG 
GTATTAGTTTAACTT CTGAITT'IUT'AAGC CATACATGTACGATTG AAACT 
G CAAAACTAATTTTTAA^GAAGGT AAAAT CTTAT CAG CAGTTAAAG C CTT 
TAATAAGCCTGCTGAAGTACTGGTAAAAGATAAGAGGAATGCTGCTGGAG 
ACCCTAAAGATTACTTTGACTATGTGATGTTGAACTGCT 
TCTGGTTATCGTTTAGTAATGGAAAGATTGTTAGGCAAAGCACCATCTGA 
ACAGGAGTTAACAGT AGGTTTTAAG CCAGGGGTCAGTTTT CATTTTACTT 
AT CAAGATAT CATCAATCATCCTG ATT OTATTTTTGATGGTTATCAT C CT 
G CTAAAATTAAAAAT CAG CTTT CTTTAGCAGAACATTTAGTTGCATGTGT 
TATC C CAAAACATTAT CAAG AAGATTAT CAAAG CCTTGTG CC CAATG ACT 
TGAAACACAGGGTTTATTATTTAGATTACTGTAACG^ 
TGGAATCAAAAAGTTTATGATTTTCTTTGTCATTTGGAAAATAAA 

SEQ ID NO. 6209 
STRAIN GJB110 

TTGCTGGATTATCCT CGAATTAAGG C 

GTTTGAATTGGAAAGGATAGGAGCTTTCATAGCTTACGAGAAACAATATA 
AAAGAAAAATTGAG ATACAATGTG ACGATAAACAT CTCCT CACAAAAATT 
GTTCATTTTTTAAAATACAATAGTTTTACITTT^ 
TAGAGAAGCGGCAGCTACTTTTAATGAGGATGGTATTAGTTTAAC^ 
ATTTTTT AAG CCATACATGT ACGATTGAAACTG CAAAACTAATT^ 
GAAGGTAAAATCTTATCAG CAGTTAAAG CCTTTAATAAG C CTGCTGAAGT 
ACTGGTAAATGATAAGAGGAATGCTGCTGGAGACCCTA^ 
AC?TATGTGATGTTGAACTGGTCAAATACCAATT CTGGTTAT CGTTTAGT A 
ATGG AAAGATTGTTAGG CAAAG CAC CATCTGAACAGG AGTTAACAGT AG C 
TTTTAAGCCAGGGGT CAGCTTT CATTTTAATTAT CAAGATAT CATCAATC 
ATC CTGATTCTATTTTTGATGGTTATCAT C CTGCTAAAATTAAAAAT CAA 
CTTTCTTTAG CAGAACATTTAGTTG CATGTGTTAT CCCAAAACATTAT CA 
AGAAGATTATCAAAGCCTTGTGCCTAATGACTTGAAACACAG 
ATTTAGATTACTGTAACGAAACACTTTATGAGTGGAATCAAAAAGTTTAT 
GATTTTOTTTGTCATTTGGAAAATAAA 
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Table 62: Comparative Sequences relating to SAG0690 



SEQ ID NO. 6210 
STRAIN 1169NT 

i^TTAAGGCGTTTGAATTGGAAAGGATAGGAGCTrTTCATAGCTTACGAGA 

AACAATATAAAAGAAAAACTGAGATACAATGTGACGATAAACATCTCCTC 

G CAAAAATTGTTCATTTTTT AAAATAGAAT AGTTTT ACT 

T C C CAAATATAG AGAAG CG G CAGCTACTTTTAATG AGGATGGTATTAGTT 

TAACTTCTG ATTTTTTAAG C CAT ACAT GTACG ATTGAAACTG CAAAACTA 

ATTTTTAAAGAAGGTAAAAT CTTAT CAG CAGTTAAAG C CTTTAAT AAGC C 

TGCTGAAGTACTGGTAAATG ATAAGAGGAATG CTG CT GG AGACC CTAAAG 

ATTACTTTGACTATGTGATGTTGAACTGGTCAAATACCAATTCTGGTTAT 

CGTTTAGTAATGGAAAGATTGTTAGGCAAAGCACCATCTGAACAGGAGTT 

AACAGTAGGTTTTAAGCCAGGGGTCAGCTTTCATTTTACTTATCAAGATA 

TCAT CAATCAT C CTGATTCTATTTTTGATGGTTATCATCCTG CTAAAATT 

AAAAAT CAG CTTT CTTTAG CAGAACATTT AGTTG CGTGTGTT AT C CCAAA 

ACATTAT CAAGAAG ATT AT CAAAAT CITGTGCCCAATGACTTGAAACACA 

GAGTTTATTA riTAGA TTACrGTAACGAAACACTOTATGAGTGGAATC^ 

AAAGTTT ATG ATTTT CTTTGT CATTTGGAAAATAAA 

SEQ ID NO. 6211 
STRAIN JM9130013 

ATAGGAGCTTTCATAGCTTACGAGAAACAATATAAAAGAAAAATTGAGAT 
ACAATGTGACGATAAACAT CTC CT CLAGAAAAATTGTTCATTTTTT AAAAT 
ACAATAGTTTTACTTTTCCCTATATT CCCAAATATAGAGAAG CGGCAG CT 
ACITTTAATG AGGATGGTATTAGTTTAACTT CTGATTTTTTAAG C CATAC 
ATGTACGATrGAAACTGCAAAACTAATTTTTAAA 

CAGCAGTTAAAGCCTTTAATAAGCCTGCrGAAGTACTGGTAAATGATAAG 
AGGAATG CTGCTGGAGACC CTAAAG ATTACTTTGACTATGTGATGTTGAA 
CT GGTCAAAT ACCAATT CTGGTTAT CGTTTAGTAATGGAAAGATTGTTAG 
GCAAAGCAC CATCTG aACAGGAGTTAACAGTAGCTTTTAAGC CAGGGGTC 
AGCTTTCATTTTAATTATCAAGATATCATCAATCATC 
TGATGGTTATCAT C CTG CTAAAATTAAAAATCAACrTTTCTTTAG CAG AAC 
ATTTAGTTGCATGTGTTATCCCAAAACATTATCAAGAAGATT 
CTTGTGCCTAATGACTTGAAACACAGAGTTTATTATTTAGATTACTGTAA 
CGAAACACTTTATGAGTGGAAT CAAAAAGTTTATG ATTTTCTTTGTCATT 
TGGAAAATAAA 

PRETTY of: /biotmp/msal85284 ,2 { * } May 13, 2003 07:08 



, 1 50 

msal85284 .2{271_090} 

msal8 5284.2{271_H36B} 

msal85284 . 2 { 271_JM913 0013 } 

msal85284.2{271_A909J 

msal85284.2{271_CJB110} 

msal85284.2{271_18RS2l} . 

msal85284.2{271_2603} atgattttaa aaatttgtcg tgcagcatat agtttacaat ggggaggtgt 

msal8 5284.2{271_M732} 

msal85284.2{271_M78l} 

msal85284.2{271_COHl} 

msal85284.2{271_1169NT) 

Consensus ********** ********** ********** ********** ********** 

51 100 

meal85284.2{271_090} tgg attatcctct aattaaggcg tttgaattgg 

msal85284 . 2 {271_H3GB} ttaaggcg tttgaattgg 

msal85284 . 2 { 271_JM9130013 } Z~ 

msal85284.2{271_A909} TTGCtgg attatcctcg aattaaggcg tttgaattgg 

msal85284.2{271_CJB110} TTGCtgg attatcctcg aattaaggcg tttgaattgg 

msal85284.2{271_18RS2l} TTGCtgg attatcctcg aattaaggcg tttgaattgg 

msal85284.2{271_2603) ttaccaatta gctTTGCtgg attatcctcg aattaaggcg tttgaattgg 

msal85284.2f271_M732) TTGCtgg attatcctcg aattaaggcg tttgaattgg 

msal85284.2{271_M78l} TTGCtgg attatcctcg aattaaggcg tttgaattgg 

msal85284.2{271_COHl} TTGCtgg attatcctcg aattaaggcg tttgaattgg 

msal85284.2{271_1169NT} aattaaggcg tttgaattgg 

Consensus ********** ******* _ , 



msal85284.2{271_090} 
msal 8 52 84 . 2 { 2 7 1__H3 6B } 
msal85284 .2{271__JM9130013' 
msal8S284.2{271_A909] 
msal852 84 . 2 { 271_CJB110 1 
msal85284.2{271_18RS2l' 
msal85284 . 2 (271_2603 ' 
msal85284.2{271_M732} 
msal8 52 84 . 2 ( 271_M78 1 } 
t msal85284.2{271j20Hl) 
msal85284 . 2 { 271JL169NT} 
Consensus 



101 

aaaggATAGG 
aaaggATAGG 

ATAGG 

aaaggATAGG 
aaaggATAGG 
aaaggATAGG 
aaaggATAGG 
aaaggATAGG 
aaaggATAGG 
aaaggATAGG 
aaaggATAGG 



AG CTTT CAT A 
AGCTTTCATA 
AG CTTT CAT A 
AGCTTTCATA 
AGCTTTCATA 
AGCTTTCATA 
AGCTTTCATA 
AGCTTTCATA 
AGCTTTCATA 
AGCTTTCATA 
AGCTTTCATA 



GCTTACGAGA 
GCTTACGAGA 
GCTTACGAGA 
GCTTACGAGA 
GCTTACGAGA 
GCTTACGAGA 
GCTTACGAGA 
GCTTACGAGA 
GCTTACGAGA 
GCTTACGAGA 
GCTTACGAGA 



AACAATATAA 
AACAATATAA 
AACAATATAA 
AACAATATAA 
AACAATATAA 
AACAATATAA 
AACAATATAA 
AACAATATAA 
AACAATATAA 
AACAATATAA 
AACAATATAA 



150 

AAGAAAAAtT 
AAGAAAAAtT 
AAGAAAAAtT 
AAGAAAAAtT 
AAGAAAAAtT 
AAGAAAAAcT 
AAGAAAAAcT 
AAGAAAAAcT 
AAGAAAAAcT 
AAGAAAAAcT 
AAGAAAAAcT 



-***** ********** ********** ********** ********_* 



151 200 
msal85284.2{271_090} GAGATACAAT GTGACGATAA ACATCTCCTC aCAAAAATTG TTCATTTTTT 
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Table 62: Comparative Sequences relating to SAG0690 



msal85284 
msa!85284.2{271 

rasal85284 
msal85284.2{ 
msal85284.2{ 

msal85284 . 

msal85284 . 

msa!85284. 

msal85284 . 
msal85284.2{ 



2{271_H36B} 
_JM9130013} 
2{271_A909} 
271_CJB110} 
271_18RS21> 
2{271_2603} 
2{271_M732} 
2{271_M781} 
2{271_COHl} 
271_1169NT} 
Consensus 



msa!85284.2{271_090} 
msal85284.2{271_H36B} 
msal85284 . 2 {271_JM9130013 ) 
msal85284.2{271_A909} 
msal85284 . 2 {27 1_CJB110 } 
msal85284.2{271_18RS2l} 
msal85284.2{271_2603) 
msal85284.2{271_M732} 
msal85284.2{271_M78l} 
ms al 8 52 84 . 2 { 27 l_COHl } 
msal85284.2{271_JL169NT} 
Consensus 



msal85284.2{271_090} 
tnsal85284.2{271_H36B} 
msal85284.2{271_JM9130013} 
msal85284 .2{271_A909} 
msal85284 . 2 { 271_CJB110 \ 
msal85284.2{271_18RS2l} 
msal85284.2f271_2603} 
msal85284.2i271_M732} 
msal85284.2{271_M78l) 
msal85284.2{271_COHl} 
msal85284 . 2 { 271_1169NT> 
Consensus 



msal85284 . 2 { 271_090 } 
msal85284.2{271_H36B} 
rasal85284-2{271_JM9130013} 
msa!85284 .2{271_A909] 
maal85284 . 2 { 271_CJB110 ) 
msal85284.2{271_18RS2l] 
msal85284.2{271_2603 j 
msal85284 . 2[ 271_M732 ] 
msal85284.2{271_M78l} 
msal85284.2{271_COHlJ 
msal85284.2{271_1169NT} 
Consensus 



msal85284 .2{271_090} 
msal85284.2{271_H36B} 
msal85284.2{271_JM9130013} 
msal85284 . 2 {271_A909 I 
msal85284 . 2 ( 271_CJB110 } 
msal85284 . 2 {271_18RS2l} 
msa!85284.2{271_2603; 
msal85284.2{271_M732; 
msal85284 . 2 {271J4781 j 
msal85284.2{271_COHl 
msal85284.2{271_1169NT] 
Consensus 



msal85284 . 2 { 271_090 
msal85284 .2{271_H36B, 
msal 8 52 8 4 . 2 { 2 7 1_JM9 13 0 0 13 
rasal85284 . 2 { 271_A90 9 [ 
msal85284.2(271_CJB110 
msal85284.2{271_18RS21^ 
msal85284 ,2(271_2603} 
msal85284.2i271_M732> 
msal85284.2(271_M781> 
msal85284.2{271_COHl> 
msal85284 . 2 { 271_1169NT} 
Consensus 



GAGATACAAT 
GAGATACAAT 
GAGATACAAT 
GAGATACAAT 
GAGATACAAT 
GAGATACAAT 
GAGATACAAT 
GAGATACAAT 
GAGATACAAT 
GAGATACAAT 
********** 

201 

AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
********** 

251 

CAGCTACTTT 
CAGCTACTTT 
CAGCTACTTT 
CAGCTACTTT 
CAGCTACTTT 
CAGCTACTTT 
CAGCTACTTT 
CAGCTACTTT 
CAGCTACTTT 
CAGCTACTTT 
CAGCTACTTT 
********** ********** 



GTGACGATAA 
GTGACGATAA 
GTGACGATAA 
GTGACGATAA 
GTGACGATAA 
GTGACGATAA 
GTGACGATAA 
GTGACGATAA 
GTGACGATAA 
GTGACGATAA 
********** 



AGTTTTACTT 
AGTTTTACTT 
AGTTTTACTT 
AGTTTTACTT 
AGTTTTACTT 
AGTTTTACTT 
AGTTTTACTT 
AGTTTTACTT 
AGTTTTACTT 
AGTTTTACTT 
AGTTTTACTT 
********** 



ACATCTCCTC aCAAAAATTG TTCATTTTTT 
ACATCTCCTC aCAAAAATTG TTCATTTTTT 
ACATCTCCTC aCAAAAATTG TTCATTTTTT 
ACATCTCCTC aCAAAAATTG TTCATTTTTT 
ACATCTCCTC gCAAAAATTG TTCATTTTTT 
ACATCTCCTC gCAAAAATTG TTCATTTTTT 
ACATCTCCTC gCAAAAATTG TTCATTTTTT 
ACATCTCCTC gCAAAAATTG TTCATTTTTT 
ACATCTCCTC gCAAAAATTG TTCATTTTTT 
ACATCTCCTC gCAAAAATTG TTCATTTTTT 
********** 

250 

TCCCAAATAT AGAGAAGCGG 
TCCCAAATAT AGAGAAGCGG 
TCCCAAATAT AGAGAAGCGG 
TCCCAAATAT AGAGAAGCGG 
TCCCAAATAT AGAGAAGCGG 
TCCCAAATAT AGAGAAGCGG 
TCCCAAATAT AGAGAAGCGG 
TCCCAAATAT AGAGAAGCGG 
TCCCAAATAT AGAGAAGCGG 
TCCCAAATAT AGAGAAGCGG 
TCCCAAATAT AGAGAAGCGG 
********** ********** 



********** w********* 



TTCCCTATAT 
TTCCCTATAT 
TTCCCTATAT 
TTCCCTATAT 
TTCCCTATAT 
TTCCCTATAT 
TTCCCTATAT 
TTCCCTATAT 
TTCCCTATAT 
TTCCCTATAT 
TTCCCTATAT 
********** 



TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 



GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 
********** 



TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 
********** 



301 

CATACATGTA 
CATACATGTA 
CATACATGTA 
CATACATGTA 
CATACATGTA 
CATACATGTA 
CATACATGTA 
CATACATGTA 
CATACATGTA 
CATACATGTA 
CATACATGTA 
********** 

351 

CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 



CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
********** 



TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
********** 



GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 



********** ********** 



TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
********** 



ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
********** 



TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
********** 



401 

ATAAGAGGAA 
ATAAGAGGAA 
ATAAGAGGAA 
ATAAGAGGAA 
ATAAGAGGAA 
ATAAGAGGAA 
ATAAGAGGAA 
ATAAGAGGAA 
ATAAGAGGAA 
ATAAGAGGAA 
ATAAGAGGAA 
********** 



TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
********** 



GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
********** 



ATTACTTTGA 
ATTACTTTGA 
ATTACTTTGA 
ATTACTTTGA 
ATTACTTTGA 
ATTACTTTGA 
ATTACTTTGA 
ATTACTTTGA 
ATTACTTTGA 
ATTACTTTGA 
ATTACTTTGA 
********** 



300 

TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
********** 

350 

AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
********** 

400 

CTGGTAAAtG 
CTGGTAAAtG 
CTGGTAAAtG 
CTGGTAAAtG 
CTGGTAAAtG 
CTGGTAAAaG 
CTGGTAAAaG 
CTGGTAAAaG 
CTGGTAAAaG 
CTGGTAAAaG 
CTGGTAAAtG 
********_* 

450 

CTATGTGATG 
CTATGTGATG 
CTATGTGATG 
CTATGTGATG 
CTATGTGATG 
CTATGTGATG 
CTATGTGATG 
CTATGTGATG 
CTATGTGATG 
CTATGTGATG 
CTATGTGATG 
********** 



451 



500 



940 



WO 2004/018646 



PCT/US2003/026827 



Table 62: Comparative Sequences relating to SAG0690 



msal85284.2{271__090) TTGAACTGGT CAAATACCAA TTCTGGTTAT CGTTTAGTAA TGGAAAGATT 

msal85284 .2{271_H3 6B} TTGAACTGGT CAAATACCAA TTCTGGTTAT CGTTTAGTAA TGGAAAGATT 

msal 8 52 84. 2 { 27 1_JM9 13 00 13} TTGAACTGGT CAAATACCAA TTCTGGTTAT CGTTTAGTAA TGGAAAGATT 

msal85284 .2{271_A909} TTGAACTGGT CAAATACCAA TTCTGGTTAT CGTTTAGTAA TGGAAAGATT 

msal85284 .2f 271_CJB110} TTGAACTGGT CAAATACCAA TTCTGGTTAT CGTTTAGTAA TGGAAAGATT 

msal85284.2{271_18RS2l} TTGAACTGGT CAAATACCAA TTCTGGTTAT CGTTTAGTAA TGGAAAGATT 

msal85284 .2f 271_2 603 } TTGAACTGGT CAAATACCAA TTCTGGTTAT CGTTTAGTAA TGGAAAGATT 

msal85284 .2{271_M732 } TTGAACTGGT CAAATACCAA TTCTGGTTAT CGTTTAGTAA TGGAAAGATT 

msal85284 .2{271_M78l} TTGAACTGGT CAAATACCAA TTCTGGTTAT CGTTTAGTAA TGGAAAGATT 

msal85284 . 2{271_COHl} TTGAACTGGT CAAATACCAA TTCTGGTTAT CGTTTAGTAA TGGAAAGATT 

msal85284 .2{271_1169NT} TTGAACTGGT CAAATACCAA TTCTGGTTAT CGTTTAGTAA TGGAAAGATT 

Consensus ********** ********** ********** ********** ********** 



msal85284 . 2 {271_090 ) 
msal85284.2{271_H36B} 
msal85284.2{271_JM9130013} 
msal85284.2{271_A909} 
msal85284.2{271_CJB110} 
msal85284.2{271_18RS2lJ 
msal85284.2{271_2603) 
msa!85284.2{271_M732} 
msal8 52 84 . 2 { 2 7 1_M7 8 1 } 
msal85284.2{271_COHl) 
msal85284.2{271_1169NT} 
Consensus 



501 

GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
********** 



GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
********** 



AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
********** 



AACAGTAGcT 
AACAGTAGcT 
AACAGTAGcT 
AACAGTAGcT 
AACAGTAGcT 
AACAGTAGgT 
AACAGTAGgT 
AACAGTAGgT 
AACAGTAGgT 
AACAGTAGgT 
AACAGTAGgT 
********_* 



550 

TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
********** 



551 600 

msal85284.2{271_090} GGGTCAGcTT TCATTTTAaT TATCAAGATA TCATCAATCA TCCTGATTCT 

msal85284.2{271__H36B} GGGTCAGcTT TCATTTTAaT TATCAAGATA TCATCAATCA TCCTGATTCT 

msal85284 .2{271_JM9130013} GGGTCAGcTT TCATTTTAaT TATCAAGATA TCATCAATCA TCCTGATTCT 

msal85284.2{271_A909} GGGTCAGcTT TCATTTTAaT TATCAAGATA TCATCAATCA TCCTGATTCT 

msal85284.2{271_CJB110} GGGTCAGcTT TCATTTTAaT TATCAAGATA TCATCAATCA TCCTGATTCT 

msal85284 .2{271_18RS2l} GGGTCAG t TT TCATTTTAcT TATCAAGATA TCATCAATCA TCCTGATTCT 

msal85284 . 2{271_2603 } GGGTCAG tTT TCATTTTAcT TATCAAGATA TCATCAATCA TCCTGATTCT 

msal8 5284.2{271_M732> GGGTCAG tTT TCATTTTAcT TATCAAGATA TCATCAATCA TCCTGATTCT 

msal85284.2{271_M781) GGGTCAGtTT TCATTTTAcT TATCAAGATA TCATCAATCA TCCTGATTCT 

msal85284.2{271_COHl} GGGTCAGtTT TCATTTTAcT TATCAAGATA TCATCAATCA TCCTGATTCT 

msal85284.2{271_1169NT} GGGTCAGcTT TCATTTTAcT TATCAAGATA TCATCAATCA TCCTGATTCT 

Consensus *******-** ********_* ********** ********** ********** 



msal85284 . 2 { 271_090 ) 
msal85284.2{271_H36B} 
tnsal85284.2{271_JM9130013} 
msal85284.2{271_A909} 
msal85284.2{271_CJB110) 
msal85284.2{271_18RS21} 
msal85284 . 2 { 271_2603 } 
msal85284 . 2 { 271_M732 } 
msal85284.2f271_M78l) 
msal 85284.2(27 l_COHl } 
msal85284.2{271_1169NT} 
Consensus 



601 

ATTTTTGATG 
ATTTTTGATG 
ATTTTTGATG 
ATTTTTGATG 
ATTTTTGATG 
ATTTTTGATG 
ATTTTTGATG 
ATTTTTGATG 
ATTTTTGATG 
ATTTTTGATG 
ATTTTTGATG 
********** 



GTTATCATCC 
GTTATCATCC 
GTTATCATCC 
GTTATCATCC 
GTTATCATCC 
GTTATCATCC 
GTTATCATCC 
GTTATCATCC 
GTTATCATCC 
GTTATCATCC 
GTTATCATCC 
********** 



TGCTAAAATT 
TGCTAAAATT 
TGCTAAAATT 
TGCTAAAATT 
TGCTAAAATT 
TGCTAAAATT 
TGCTAAAATT 
TGCTAAAATT 
TGCTAAAATT 
TGCTAAAATT 
TGCTAAAATT 



AAAAATCAaC 
AAAAATCAaC 
AAAAATCAaC 
AAAAATCAaC 
AAAAATCAaC 
AAAAATCAgC 
AAAAATCAgC 
AAAAATCAgC 
AAAAATCAgC 
AAAAATCAgC 
AAAAATCAgC 



********** ********_* 



650 

TTTCTTTAGC 
TTTCTTTAGC 
TTTCTTTAGC 
TTTCTTTAGC 
TTTCTTTAGC 
TTTCTTTAGC 
TTTCTTTAGC 
TTTCTTTAGC 
TTTCTTTAGC 
TTTCTTTAGC 
TTTCTTTAGC 
********** 



msal85284 
msa!85284 . 
msal85284.2{271 
msal85284 . 
msal85284 
msal85284 
msal85284 
msal85284 
msal85284 
msal85284 
msal85284.2{ 



L.2{ 



2{271_090] 
2{271_H36B) 
_JM9130013j 
2{271_A909} 
271_CJB110j 
271_18RS2i; 
2{271_2603 
2(271_M732! 
2{271_M781^ 
2{271_COHl' 
271__1169NT] 
Consensus 



651 

AGAACATTTA 
AGAACATTTA 
AGAACATTTA 
AGAACATTTA 
AGAACATTTA 
AGAACATTTA 
AGAACATTTA 
AGAACATTTA 
AGAACATTTA 
AGAACATTTA 
AGAACATTTA 



GTTGCaTGTG 
GTTGCaTGTG 
GTTGCaTGTG 
GTTGCaTGTG 
GTTGCaTGTG 
GTTGCaTGTG 
GTTGCaTGTG 
GTTGCaTGTG 
GTTGCaTGTG 
GTTGCaTGTG 
GTTGCgTGTG 



TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 



********** *****_**** ********** 



ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
********** 



700 

GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
********** 



701 

msal85284 .2{271_090} AAAgcCTTGT GCCtAATGAC 

msal85284 .2{271_H36B} AAAgcCTTGT GCCtAATGAC 

msal 8 52 84. 2 { 27 1_JM9 13 0 013} AAAgcCTTGT GCCtAATGAC 

msal85284.2{271_A909> AAAgcCTTGT GCCtAATGAC 

msal85284.2{271_CJB110} AAAgcCTTGT GCCtAATGAC 

msal85284.2{271_18RS2l} AAAgcCTTGT GCCcAATGAC 

msal85284.2{271_2603} AAAgcCTTGT GCCcAATGAC 

msal85284.2{271_M732j AAAgcCTTGT GCCcAATGAC 

msal85284.2?271_M781> AAAgcCTTGT GCCcAATGAC 

msal85284 .2{271_COHl} AAAgcCTTGT GCCcAATGAC 

msal85284 . 2 {271_1169NT} AAAatCTTGT GCCcAATGAC 

Consensus ★**-.-***** ***-****** 



TTGAAACACA 
TTGAAACACA 
TTGAAACACA 
TTGAAACACA 
TTGAAACACA 
TTGAAACACA 
TTGAAACACA 
TTGAAACACA 
TTGAAACACA 
TTGAAACACA 
TTGAAACACA 



GaGTTTATTA 
GaGTTTATTA 
GaGTTTATTA 
GaGTTTATTA 
GaGTTTATTA 
GgGTTTATTA 
GgGTTTATTA 
GgGTTTATTA 
GgGTTTATTA 
GgGTTTATTA 
GaGTTTATTA 



750 

TTTAGATTAC 
TTTAGATTAC 
TTTAGATTAC 
TTTAGATTAC 
TTTAGATTAC 
TTTAGATTAC 
TTTAGATTAC 
TTTAGATTAC 
TTTAGATTAC 
TTTAGATTAC 
TTTAGATTAC 



********** *_******** ********** 
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maal85284 

msal85284 . 
msal85284 .2(271 

msal85284 . 
msal85284 
msal85284 

msal85284 . 

msal85284 . 

msal85284 . 

msal85284 . 
msal85284 .2{ 



.2{271_090} 
2{271_H36B} 
_JM9130013} 
o -*.2{271__A909) 
.2{271_CJB110) 
.2{271_18RS2l' 
2{271_2603* 
2f 271_M732; 
2{271_M78l] 
2{271_COHl : 
271_1169NT] 
Consensus 



751 

TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 



GACTTTATGA 

CACTTTATGA 
GACTTTATGA 

CACTTTATGA 
CACTTTATGA 
CACTTTATGA 
CACTTTATGA 
CACTTTATGA 
CACTTTATGA 
CACTTTATGA 
CACTTTATGA 



GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 



********** ********** ********** 



AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
********** 



800 

ATTTTCTTTG 
ATTTTCTTTG 
ATTTTCTTTG 
ATTTTCTTTG 
ATTTTCTTTG 
ATTTTCTTTG 
ATTTTCTTTG 
ATTTTCTTTG 
AT 
AT 

ATTTTCTTTG 
********** 



msal85284.2{271_090j 
msal85284.2{271_H36B} 
msal85284.2{271_JM9130013> 
msal85284 . 2{271_A909> 
msal85284 . 2 f 271_CJB110 J 
msal85284 . 2 { 271JL8RS21 } 
msal8S284.2{271_2603; 
msal85284 .2{271_M732; 
msal85284.2i271_M78l] 
msal85284 .2 {271__COHl ! 
msal85284.2{271_1169NT] 
Consensus 



801 

tCATTTGGAA 
tCATTTGGAA 
tCATTTGGAA 
tCATTTGGAA 
tCATTTGGAA 
tCATTTGGAA 
tCATTTGGAA 
nCATTTGGAA 
tCATTTGGAA 
gCATTTGGAA 
tCATTTGGAA 



816 
AATAAA 
AATAAA 
AATAAA 
AATAAA 
AATAAA 
AATAAA 
AATAAA 
AATAAA 
AATAAA 
AATAAA 
AATAAA 



_********* ****** 



SEQ ID NO. 6212 

STRAIN 2603 frame: 1 

MILK I CRAAYSLQWGGVYQLALLD YPR I KAFELER I GAF I AYEKQYKRKTE I QCDDKHLL 
AKI VHFLKYNSFTFPYI PKYREAAATFNEDGI SLTSDFLSHTCTI ETAKL I FKEGKI LSA 
VKAFNKPAEVLVKDKRNAAGDPKDYFDYVML^SimiS 

FKPGVSFHFTYQDI INHPDS I FDGYHPAKI KNQL.SIAEHLVACVI PKHYQEDYQSLVPND 
LKHRVYYLDYCNETLYEWNQKVYDFLCHLENK 

SEQ ZD NO. 6213 

STRAIN A909 frame: 1 

LLDYPRIKAFELERIGAFIAYEKQYKRKIEIQCDDKHLLTKIVHFLKYNSFTFPYIPKYR 
EAAATFNEDGI SLTSDFLSHTCTI ETAKL I FKEGKI LSAVKAFNKPAEVLVNDKRNAAGD 
PKDYFDYVMIjNWSNTNSGYRLWERLLGKAPSEQELTVAFKPGVSFHFNYQD I INHPDS I 
FDGYHPAKI KNQLSLAEHLVACVI PKHYQEDYQSLVPNDLKHRVYYLDYCNETLYEWNQK 
VYDFLCHLENK 

SEQ ID NO. 6214 
STRAIN H36B frame: 3 

KAFELERIGAFIAYEKQYKRKI EI QCDDKHLLTKI VHFLKYNSFTFPYI PKYREAAATFN 
EDGI SLTSDFLSHTCTI ETAKL I FKEGKI LSAVKAFNKPAEVLVNDKRNAAGDPKDYFDY 
VMLNWSNTNSGYRLVMERLLGKAPSEQELTVAFKPGVSFHFNYQD I INHPDS I FDGYHPA 
KI KNQLSLAEHLVACVI PKHYQEDYQSLVPNDLKHRVYYLDYCNETLYEWNQKVYDFLCH 
LENK 

SEQ ID NO. 6215 
STRAIN 18RS21 frame: 1 

LLDYPRI KAFELERI GAF I AYEKQYKRKTE I QCDDKHLLAKI VHFLKYNS FTFPY I PKYR 
EAAATFNEDG I SLTSDFLSHTCTI ETAKL I FKEGKI LSAVKAFNKPAEVLVKDKRNAAGD 
PKDYFDYVMLNWSNTNSGYRLVMERLLGKAPSEQELTVGFKPGVS FHFTYQD I INHPDS I 
FDGYHPAKI KNQLSLAEHLVACVI PKHYQEDYQSLVPNDLKHRVYYLDYCNETLYEWNQK 
VYDFLCHLENK 

SEQ ID NO. 6216 
STRAIN M732 frame: 1 

IJ.DYPRI KAFELER I GAF I AYEKQYKRKTE I QCDDKHLLAKI VHFLKYNS FTFPY I PKYR 
EAAATFNEDGI SLTSDFLSHTCTI ETAKL I FKEGKI LSAVKAFNKPAEVLVKDKRNAAGD 
PKDYFDYVMLNWSNTNSGYRLVMERLLGKAPSEQELTVGFKPGVSFHFTYQDI INHPDS I 
FDGYHPAKI KNQLSLAEHLVACVI PKHYQEDYQSLVPNDLKHRVYYLDYCNETLYEWNQK 
VYDFLXHLENK 

SEQ ID NO. 6217 

STRAIN COH1 frame: 1 

LLDYPRI KAFELERI GAF I AYEKQYKRKTE I QCDDKHLLAKI VHFLKYNS FTFPY I PKYR 
EAAATFNEDGI SLTSDFLSHTCTI ETAKLI FKEGKI LSAVKAFNKPAEVLVKDKRNAAGD 
PKDYFDYVMLNWSNTNSGYRLVMERLLGKAPSEQELTVGFKPGVS FHFTYQD I INHPDS I 
FDGYHPAKI KNQLSLAEHLVACVI PKHYQEDYQSLVPNDLKHRVYYLDYCNETLYEWNQK 
VYDFLWHLENK 

SEQ ID NO. 6218 
STRAIN M781 frame: 1 
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LLDYPRIKAFELERIGAFIAYEKQYKRKTEIQCDDKHLLAKIVHFLKYNSFTFPYIPKYR 
EAAATFNEDG I SLTSDFLSHTCTI ETAKLI FKEGKI LSAVKAFNKPAEVLVKDKRNAAGD 
PKDYFDYVMLNWSNTNSGYRLVN1ERLLGKAPSEQELTVGFKPGVSFHFTYQDI INHPDS I 
FDGYHPAKI KNQLSLAEHLVACVI PKHYQEDYQS LVPNDLKHRVYYLDYCNETLYE WNQK 
VYDFLCHLENK 

SEQ ID NO. 6219 

STRAIN CJB1 10 frame: 1 

liLDYPRIKAFELERIGAFIAYEKQYKRKIEIQCDDKHLLTKIVHFLKYNSFTFPYIPKYR 
EAAATFNEDG I SLTSDFLSHTCTI ETAKL I FKEGKI LSAVKAFNKPAEVLVNDKRNAAGD 
PKDYFDYVMLNWSNTNSGYRLVMERLLGKAPSEQELTVAFKPGVSFHFNYQD I INHPDS I 
FDGYHPAKI KNQLSLAEHLVACVI PKHYQEDYQSLVPNDLKHRVYYLDYCNETLYEWNQK 
VYDFLCHLENK 

SEQ ID NO. 6220 

STRAIN 1169NTframe:2 

I KAFELERI GAFI AYEKQYKRKTE I QCDDKHLLAKI VHFLKYNSFTFPY I PKYREAAATF 
NEDG I SLTSDFLSHTCTI ETAKL I FKEGKI LSAVKAFNKPAEVLVNDKRNAAGDPKDYFD 
YVMLNWSNITSrSGYRLVMERLIiGKAPSEQELTVGFKPGVSFHFTYQDI INHPDS I FDGYHP 
AKI KNQLSLAEHLVACVI PKHYQEDYQNLVPNDLKHRVYYLDYCNETLYEWNQKVYDFLC 
HLENK 

SEQ ID NO. 6221 
STRAIN JM9130013 frame: 1 

I GAF I AYEKQYKRKI E I QCDDKHLLTKI VHFLKYNSFTF PY I PKYREAAATFNEDGI SLT 
SDFLSHTCTI ETAKLI FKEGKILSAVKAFNKPAEVLVl^KRNAAGDPKDYFDYVMLNWSN 
TNSGYRLVMERLLGKAPSEQELTVAFKPGVS FHFNYQD I INHPDS I FDGYHPAKI KNQLS 
LAEHLVACVIPKHYQEDYQSLVPNDLKHRVYYLDYCNETLYEWNQKVYDFLCHLENK 

SEQ ID NO. 6222 
STRAIN 090 frame: 3 

DYPL I KAFELER I GAF I AYEKQYKRKI E I QCDDKHLLTKI VHFLKYNS FT FP Y I PKYREA 
AATFNEDGI SLTSDFLSHTCTI ETAKL I FKEGKI LSAVKAFNKPAEVLVNDKRNAAGDPK 
DYFDYVMLNWSNTNSGYRLVMERLLGKAPSEQELTVAFKPGVSFHFNYQDI INHPDS I FD 
GYHPAKI KNQLSLAEHLVACVI PKHYQEDYQSLVPNDLKHRVYYLDYCNETLYEWNQKVY 
DFLCHLENK 



PRETTY of : /biotmp/msal85358 -2{*} May 13, 2003 07:11 

1 50 

msal85358 .2{271_090} dyplika felerlGAFI AYEKQYKRKI 

rasal853S8.2{271 JM9130013} r 1 GAFI AYEKQYKRKI 

msal85358.2{2711H36B} ka felerlGAFI AYEKQYKRKI 

msal85358.2{271_A909} LLdyprika felerlGAFI AYEKQYKRKI 

msal85358.2{271_CJB110} LLdyprika felerlGAFI AYEKQYKRKI 

msal85358.2{271_1169NT) ika felerlGAFI AYEKQYKRKt 

msal85358.2{271_18RS2l} LLdyprika felerlGAFI AYEKQYKRKt 

msal85358 .2{271 2603} milkicraay slqwggvyql aLLdyprika felerlGAFI AYEKQYKRKt 

rasal85358.2{271_M732} LLdyprika felerlGAFI AYEKQYKRKt 

msal85358.2{271_M78l} LLdyprika felerlGAFI AYEKQYKRKt 

msa!85358 . 2 { 271__COHl} ' LLdyprika felerlGAFI AYEKQYKRKt 

Consensus ********** ********** *** ***** *********_ 



msal85358 
msal85358.2{271 
msal85358 
msal85358. 
rasal85358.2{ 
msal85358 
rasal85358 
msal85358 
nisal85358 
msal85358 
msa!85358 . 



>: 2 

J.2{ 



.2{271_090} 
_JM9130013> 
2{271_H36B} 
2{271_A909} 
271_CJB110} 
271_1169NT} 
271_18RS2lJ 
2{271_2603) 
2{271_M732} 
2{271_M781> 
2{271_COHl} 
Consensus 



51 

EIQCDDKHLL 
EIQCDDKHLL 
EIQCDDKHLL 
EIQCDDKHLL 
EIQCDDKHLL 
EIQCDDKHLL 
EIQCDDKHLL 
EIQCDDKHLL 
EIQCDDKHLL 
EIQCDDKHLL 
EIQCDDKHLL 
********** 



tKIVHFLKYN 
t KI VHFLKYN 
tKIVHFLKYN 
tKIVHFLKYN 
tKIVHFLKYN 
aKI VHFLKYN 
aKI VHFLKYN 
aKI VHFLKYN 
aKI VHFLKYN 
aKI VHFLKYN 
aKI VHFLKYN 
_********* 



SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
********** 



REAAATFNED 
REAAATFNED 
REAAATFNED 
REAAATFNED 
REAAATFNED 
REAAATFNED 
REAAATFNED 
REAAATFNED 
REAAATFNED 
REAAATFNED 
REAAATFNED 
********** 



100 

GISLTSDFLS 
GISLTSDFLS 
GISLTSDFLS 
GISLTSDFLS 
GISLTSDFLS 
GISLTSDFLS 
GISLTSDFLS 
GISLTSDFLS 
GISLTSDFLS 
GISLTSDFLS 
GISLTSDFLS 
********** 



msal85358.2{271_090 
msal85358 .2{271_JM9130013 
msa!85358.2(271_H36B 
msal85358.2{271_A909 
msal85358.2{271_CJB110 
msal85358 .2{271_1169NT 
msa!85358 . 2 {271_18RS21 
msal85358 .2{271_2603 
msal85358.2(271_M732 
msal85358 .2{271_M781 
msal8 53 58 . 2 { 27 l_COHl 



101 

HTCTIETAKL' 
HTCTIETAKL 
HTCTIETAKL 
HTCTIETAKL 
HTCTIETAKL 
HTCTIETAKL 
HTCTIETAKL 
HTCTIETAKL 
HTCTIETAKL 
HTCTIETAKL 
HTCTIETAKL 



I FKEGKI LSA 
I FKEGKI LSA 
I FKEGKI LSA 
I FKEGKI LSA 
IFKEGKILSA 
I FKEGKI LSA 
IFKEGKILSA 
IFKEGKILSA 
IFKEGKILSA 
IFKEGKILSA 
IFKEGKILSA 



VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 



LVnDKRNAAG 
LVnDKRNAAG 
LVnDKRNAAG 
LVnDKRNAAG 
LVnDKRNAAG 
LVnDKRNAAG 
LVkDKRNAAG 
LVkDKRNAAG 
LVkDKRNAAG 
LVkDKRNAAG 
LVkDKRNAAG 



150 

DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
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Consensus 



********** ********** ********** **-******* ********** 



msal85358 
msal85358.2{271 

msal85358 . 

msal85358 . 
msal85358 .2{ 
msal85358 .2 ' 
msal85358 .2 

msal85358 . 

msal85358 . 

msal85358 . 

msal85358 . 



2{271_090 
_JM9130013 
2{271_H36B 
2{271_A909} 
271_CJB110} 
271_1169NT} 
271_18RS21~ 
2{271_2603 
2{271_M732 
2l271_M781 
2{271_COHl 
Consensus 



msal85358 
msal85358.2{271 
msal85358 
msal85358 
msal85358 .2{ 
msal85358 .2{ 
msal85358 .2{ 
msal85358 
msal85358 . 
msal85358 . 
msal8S358 . 



2{271_090} 
_JM9130013} 
2{271_H36B} 
2{271_A909} 
271_CJB110} 
271_1169NT} 
271_18RS2lJ 
2{271_2603} 
~271_M732} 
271_M781} 
271_COHl} 
Consensus 



151 

LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 



RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 



APSEQELTVa 
APSEQELTVa 
APSEQELTVa 
APSEQELTVa 
APSEQELTVa 
APSEQELTVg 
APSEQELTVg 
APSEQELTVg 
APSEQELTVg 
APSEQELTVg 
APSEQELTVg 



FKPGVSFHFn 
FKPGVSFHFn 
FKPGVSFHFn 
FKPGVSFHFn 
FKPGVSFHFn 
FKPGVSFHFt 
FKPGVSFHFt 
FKPGVSFHFt 
FKPGVSFHFt 
FKPGVSFHFt 
FKPGVSFHFt 



200 

YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 



********** ********** *********_ *********- ********** 



201 

I FDGYHPAKI 
I FDGYHPAKI 
I FDGYHPAKI 
I FDGYHPAKI 
I FDGYHPAKI 
I FDGYHPAKI 
I FDGYHPAKI 
I FDGYHPAKI 
I FDGYHPAKI 
I FDGYHPAKI 
I FDGYHPAKI 
********** 



KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 



VACVIPKHYQ 
VACVI PKHYQ 
VACVIPKHYQ 
VACVIPKHYQ 
VACVIPKHYQ 
VACVIPKHYQ 
VACVIPKHYQ 
VACVIPKHYQ 
VACVIPKHYQ 
VACVIPKHYQ 
VACVIPKHYQ 



EDYQsLVPND 
EDYQSLVPND 
EDYQsLVPND 
EDYQsLVPND 
EDYQsLVPND 
EDYQnLVPND 
EDYQsLVPND 
EDYQsLVPND 
EDYQsLVPND 
EDYQsLVPND 
EDYQsLVPND 



********** ********** ****_***** 



250 

LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 
********** 



msal85358 
msal85358 .2{271 
msal85358 
msa!85358 . 
msal85358 .2 
' msal85358.2 
msal85358.2{ 
msal85358 . 
msal85358 . 
msal85358. 
msal85358 . 



.2{271_090} 
_JM9130013} 
2{271_H36B} 
2{271_A909} 
271_CJB110> 
271_1169NT) 
271_18RS2l] 
2(271_2603 < 
2{271_M732j 
2{271_M78i; 
2{271_COHl] 
Consensus 



, 251 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 



272 

KVYDFLcHLE NK 
KVYDFLcHLE NK 
KVYDFLCHLE NK 
KVYDFLcHLE NK 
KVYDFLcHLE NK 
KVYDFLcHLE NK 
KVYDFLCHLE NK 
KVYDFLcHLE NK 
KVYDFLxHLE NK 
KVYDFLCHLE NK 
KVYDFLWHLE NK 



********** ******-*** ** 
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SEQ ID NO. 6301 
STRAIN 2603 

ATGAAAAGTCGAAAAAAAGATAAATTGGTATTGAGGTTAAC7VACAACACTATTGGTTTTT 
GGTTTGGGTGGGGTTTGGTTTTATAATTATAAAAATGATAATGTCGAACCGACAGTCACT 
AGTG CAT CGG AT CAAACGACGACTTTTATT CAAAC GATTT CT CCAACAG CTATTG AAATT 
TCTAAGACCT ATGATTTGT ATG CGT G^GTCTTATTAG CACAAGCT ATTT TGGAAT CATC C 
AGTGGACAAT CAGATTTGT CTAAGG CT C CT AATTATAAC CT CTTTGG CAT CAAAGGAGAA 
T ATAAAGGT AAAT CTGT C CAAATG C CT ACTTT AG AAGATGAT GGG AAAGG CAAT ATGACT 
CAAAT C CAAG CT CCTTTTCG CG CCT AT CCAAATTATTCTG CTTCACT AT ATGAT TATG CT 
GAGTTAGTATCT AGT CAAAAGTATG CATCTGTTTGGAAAT CAAAT AC CT CTT CTTAT AAG 
GATG CTACTG CAG CT CT AACAGGT C TTTATG CGACAG ATACTG CTTATG CTAGT AAATT A 
AACCAAATTATTGAAACCTACAGTCTAGATGCTTATGATAAA 

SEQ ID NO. 6302 
STRAIN 090 

GGGGTTTGGTTTTATAATTATAA - 

AAATGATAATGTCGAACCGACAGTCACTAGTGCATCGGATCAAACGACGA 
CTTTTATTCAAACGATTT CT CCAA.CAG CT ATTGAAATTTCTAAGAC CTAT 
GATTTGTATGCGT CAGT CTTATTAG CACAAGCTATTTTGGAATCAT C CAG 
TGG ACAATCAGATTTGT CTAAGGCT CCTAATT ATAAC CT CTTTGG CATCA 
AAGG AGAATATAAAGGT AAAT CTGT C CAAATGCCTACTTTAGAAGATGAT 
GGGAAAGGCAAT AT GACTCAAATC CAAG CT CCTTTTCGCG CCTAT CCAAA 
TTATTCTGCnTCACTATATGATTATGCTGAGTTAGTA 
ATG CATCTGTTTGG AAATCAAATACCTCTT CTTATAAGGATG CTACTG CA 
G CTCTAACAGGT CTTTATG CGACAGATACTG CTT ATG CTAGTAAATT AAA 
CCAAATTATTGAAAC CTACAGT CTAGATG CTTATGATAAA 

SEQ ID NO. 6303 
STRAIN A909 

GGGGTTTGGTTTTATAATTATAA 

AAATGAT AATGT CGAAC CGACAGT CACTAGTG CATCGGAT CAAACGACGA 
CTTTTATTCAAACGATTTCTCCAACAGCTA^ 

GATTTGTAT G CGT CAGT CTTAT TAG CACAAGCTATTTTGGAATCAT C CAG 
TGGACAATCAGATTTGTCTAAGGCTCCTAATTATAACCT CTTTGG 
AAGGAGAATATAAAG GT AAAT CTGT C (CAAATG CCT ACTTTAGAAGATGAT 
GGGAAAGGCAATATG ACT CAAATCCAAG CT C CTTTT CGCG C CTAT CCAAA 
TTATTCTGCTTCACTATATGATT 

ATGCATCTGCTTGGAAATCA^TACTTCTTCTTATAAGGATGCTACTG^ 
G CT CTAACAGGT CTTTATG CGACAGATACTG CTT ATG CTAGTAAATTAAA 
CCAAATTATTGAAAC CTACAGT CTAGATG CTTATG ATAAA 

SEQ ID NO. 6304 

STRAIN H36B 

GGGGTTTGGTTTTATAATTATAAAAATGATA 

ATGTCGAACCGACAGTCACTAGTGCATCGGATCAAACGACGACTTTTATT 
GAAACGATTTCTCCAACAGCTATTGAAATTTCT 

TGCGT CAGTCTTATT AG CACAAGCTATTTTGGAAT CATC CAGTGGACAAT 
CAGATTTGT CTAAGGCT CCTAATTATAACCT CTTTGG CAT CAAAGGAGAA 
TATAAAGGTAAATCTGTCCAAATGCCTACTTTAGAAGATGATGGGAAAGG 
CAATATGACT CAAAT CCAAG CT CCTTTT CG CG C CTAT CCAAATTATTCTG 
CTTCACTATATGATTATGCTGAGTTAGTATCTAGTCAAAAGTaTGCATCT 
GCTTGGAAAT CAAATACTT CTT CTTATAAGGATG CTACTG CAGCTCTAAC 
AGGTCTTTATGCGACAGATACTGCTTATGCTAGTAAATTAAACCAAAT^ 
TTGAAACCTACAGT CTAGATG CTT ATG ATAAA 

SEQ ID NO. 6305 

STRAIN 18RS21 

GGGGTTTGGTTTTATAATTATAAAAAT GAT AATG 

TCGAACCGACAGTCACTAGTGCATCGaA.TCAAACGACGAU'iu-iTATTCAA 
ACGATTTCT C CAACAGCTATTG AAATTTCTAAGAC CTATG ATTTGTATG C 
GTCAGTCTT ATT AG CACAAG CTATTTTGG AATCAT C CAGTGGACAAT CAG 
ATTTGT CTAAC4GCT CCTAATTATAACCTCTTTC^ CATCAAAGGAGAA 
AAAGGTAAATCTGT CCAAATGC CT ACTTT AGAAGATGATGGG AAAGG CAA 
TATG ACT CAAAT CCAAG CT C CTTTT CG CG C CT ATC CAAATTATTCTG CTT 
CACTATATGATTATG CTGAGTTAGT AT CTAGT CAAAAGTATG CAT CTGTT 
TGGAAATCAAAT ACCT CTT CTTATAAGGATG CTACTG CAG CT CT AACAGG 
TCTTTATGCGACAGATACTGCTTATGCTAGTAAATTAAACCAAATTAT^ 
AAAC CTACA.GT CTAGATGCTTATGAT AAA 

SEQ ID NO. 6306 
STRAIN M732 

GGGGTTTGGTTTTATAATTATAA 

AAATGAT AATGTCGAAC CGACAGT CACTAGTG CATCGGAT CAAACGACGA 
CTTTTATT CAAACGATTT CT C CAACAG CT ATTGAAATTT CTAAGACCTAT 
GATTTGT ATG CGTCAGT CTTATTAG CACAAG CTATTTTGG AATCATC CAG 
TGGACAAT CAGATTTGT CT AAG G CT C CTAATT AT AACCTCTT TGG CATCA 
AAGGAGAAT ATAAAGGTAAATCTGT CCAA^TG CCT ACTTT AG AAGATGAT 
GGGAAAGGCAATATGACT CAAATC CAAG CT CCTTTTCGCG C CTATCCAAA 
TTATTCTGCTTCACTATATGATTATGCTGAGTTAGTATCTAGTCAAAAGT 
. ATGCATCTGTTTGGAAATCAAATACTT CTT CTTATAAGGATG CTACTGCA 
G CT CTAACAGGT CTTTATG CGACAGATACTG CTT ATG CTAGTAAATTAAA 
CCAAATTATTGAAACCTACAGTCTAGATG CTTATGATAAA 
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SEQ ID NO. 6307 
STRAIN COH1 

GGGGTTTGGTTTTATAATTATAA 

AAATGATAATGTCX3AACCGACAGTCACTAGTGCATCGGATCAAACGACGA 
CTTTT ATT CAAACG ATTTCT C CAACAG CT ATTGAAATTT CTAAG AC CTAT 
GATTTGTAT G CGTCAGT CTT ATTAG CACAAG CTATTTTGGAATCATC CAG 
TGGACAATCAGATTTGT CTAAGGCT CCT AATTATAAC CT CTTTGG CATCA 
AAGGAGAATATAAAGGTAAAT CTGTCCAAATG CCT ACTTTAG AAG ATGAT 
GG GAAAGG CAAT AT GACT CAAAT C CAAGCT CCTTTT CG CG C CTAT C CAAA 
TTATTCTGCTTCACrATATGATTATGCT^ 

ATGCATCTGTTTGGAAATCAAATACrTCTTCrTATAAGGATGCTACT 
GCTCTAACAGGTCTTTATGCGACAGATACTGCTTATGCTAGTAAATTAAA 
CCAAATTATTGAAAC CT ACAGT CT AGATG CTT AT GAT AAA 

SEQ ID NO. 6308 
STRAIN M781 

GGGGTTTGGTTTTATAATTATAAAAATGA 

TAATGTCGAACCGACAGTCACTAGTGCATCGGATCAAACGACGACTTTTA 
TT CAAACGATTTCr C CAACAG CTATTGAAATTTCTAAGAC CT ATGATTTG 
TATG CGT CAGTCTTATT AG CACAAG CTATTTTGGAATCAT CCAGTGGACA 
ATCAGATTTGT CTAAGG CT CCTAATT ATAACCTCTTTGGCAT CAAAG GAG 
AATAT AAAGGTAAAT CTGT C CAAATGCCTACTTTAGAAGATG AT GGG AAA 
GGGAATATGACTCAAATCCAAG CT CCTTTT CG CG C CTAT C CAAATTATT C 
TG CTT CACTATATGATT ATGCTGAGTT AGTAT CTAGTCAAAAGTATG CAT 
CTGTTTGGAAATCAAATACTT CiT CTTATAAGGATGCTACTG CAG CT CT A 
ACAGGT CTTTATG CGACAGATACTGCTTATG CTAGTAAATT AAAC CAAAT 
TATTGAAAC CTACAGTCTAGATGCTTATGATAAA 

SEQ ID NO. 6309 
STRAIN CJB110 

GGGGTTTGGTTTTATAATTATAAAAATGATAATGT 

CGAAC CGACAGT CACTAGTG CAT CGGATCAAACG ACG AC TTTTAT T CAAA 
CGATTTCT C CAACAG CT ATT GAAATTT CTAAG AC CTATGATTTGT ATG CG 
TCAGT CTTATTAG CACAAG CTATTTTGGAAT CAT CCAGTGGACAATCAG A 
TTTGTCTAAC^CTCCTAATTATAACCTCTTC 

AAGGTAAAT CTGT CCAAATG C CTACTTTAG AAGATGATGGGAAAGGCAAT 
ATGACTCAAATCCAAGCTCCTTTTCGCGCCTATCCAAATTA 
ACTATATGATTATGCK3AGTTAGTATCTAGTCAAAAGTATGCATCTGTTT 
GGAAAT CAAATACCT CTTCTTATAAGGATGCT ACT GCAG CTCTAACAGGT 
CTTTATG CGACAGATACTG CTTATGCTAGTAAATTAAAC CAAATTATTG A 
AACCTACAGTCTAGATGCTTATGATAAA 

SEQ ID NO. 6310 
STRAIN 1169NT 

GGGGTTTGGTTTTATAATTATAAAAATGATAATGT 

CGAACAGACAGT CACTAGTG CAT CGGATGAAACGACGACTTTTATTCAAA 
CGATTTCCCCAACAGCrATTGAAATTTCTAAGACCTATGATTTGTATGCG 
TCAGT CTTATTAGCACAAGCTATTTTGGAATCAT C CAGTGGACAATCAGA 
TTTGTCTAAGGCTC CTAATTATAACCTCTTTGGCATCAAAGGAGAATATA 
AAGGTAAAT CTGTC CAAATG CCTACTTTAGAAGATGATGGGAAAGGCAAT 
ATGACTCAAATCCAAGCTCCTTTTCGCGCCTATCCAAATTAT 
ACTATAT(3ATTATGCTGAGTTAGTATCT^^ 

GGAAATCAAATACTT CTTCTTATAAGG ATG CTACTGCAG CT CTAACAGGT 
CTTTATG CGACAGATACTG CTTATG CTAGT AAATTAAAC CAAATTATTG A 
AACCTACAGTCTAGATGCTTATGATAAA 

SEQ ID NO. 6311 

• STRAIN JM9 1300 13 
TTTGGTTTTATAATTATAAAAATGATAATGT CGAAC CGACAGT CACTAGT 
G CAT CGGAT CAAACG ACGACTTTT ATT CAAACGATTT CC C CAACAG CTAT 
TGAAATTTCT AAGAC CT ATGATTTGTATGCGT CAGT CTTATTAG CACAAG 
CTATTTTGGAAT CAT CCAGTGGACAAT CAG ATTTGTCTAAGGCT C CTAAT 
TATAACCTCTTTGG CAT CAAAGGAGAATATAAAGGT AAAT CTGT T CAAAT 
GC CTACTTTAGAAG ATG ATGGGAAAGGTAATATGACC CAAATCCAAG CT C 
CTTTT CG CG C CTAT C CAAATTATT CTG CTT CACT ATATG ATT ATG CTGAG 

- TT AGTAT CTAGT CAAAAGTATG CATCrGTTTGGAAATCAAATACCTCTT C 
TTATAAGGATGCTACTG CAG CT CTAACAGGTCTTT ATGCG ACAGATACTG 
CTTATGCTAGTAAATTAAAC CAAATTATTG AAAACTACAGTCTAGATG CT 
TATGATAAA 

PRETTY of : /biotmp/msa243324 . 2 { * } February 11, 2003 05:11 



msa243324.2{275_A909} 
maa243324 .2{275_H36B} 
msa243324 . 2 {275_090 } 

msa243324.2{275_18RS2ll 
msa243324 . 2 {275_2603 } 

msa243324.2{275_CJB110} 
msa243324.2(275_COHl} 
msa243324 . 2 { 275_M732 } 



atgaaaagtc gaaaaaaaga taaattggta ttgaggttaa caacaacact 
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Table 63: Comparative Sequences relating to SAG1912 



msa243324.2{275_M78l} 

msa243324 . 2 {275_1169NT} 

msa243324 .2(275 JM9130013} 

~~ Consensus ********** ********** ********** ********** ********** 

51 100 

msa243324.2{275 A909} 9 gggTTTGGTT TTATAATTAT AAAAATGATA 

msa243324.2{275~H36B} . 9 gggTTTGGTT TTATAATTAT AAAAATGATA 

msa243324.2{275 090} 9 gggTTTGGTT TTATAATTAT AAAAATGATA 

msa243324.2{275 18RS21} 9 gggTTTGGTT TTATAATTAT AAAAATGATA 

msa243324.2{275 2603} attggttttt ggtttgggtg gggTTTGGTT TTATAATTAT AAAAATGATA 

msa243324.2{275 CJBllO} g gggTTTGGTT TTATAATTAT AAAAATGATA 

msa243324.2{275 COHl} 3 gggTTTGGTT TTATAATTAT AAAAATGATA 

msa243324.2 275~M732} g gggTTTGGTT TTATAATTAT AAAAATGATA 

msa243324.2 275~M78l} 9 gggTTTGGTT TTATAATTAT AAAAATGATA 

msa243324.2{275 1169NT} 9 gggTTTGGTT TTATAATTAT AAAAATGATA 

msa243324.2{275 JM9130013} -TTTGGTT TTATAATTAT AAAAATGATA 

x - consensus ********** *********- -__*★***** ********** ********** 



101 

msa243324.2{275_A909} ATGTCGAACc GACAGTCACT 

msa243324 .2{275_H36B) ATGTCGAACc GACAGTCACT 

msa243324 .2{275_090} ATGTCGAACc GACAGTCACT 

msa243324.2{275_18RS2lJ ATGTCGAACc GACAGTCACT 

msa243324.2{275_2603) ATGTCGAACc GACAGTCACT 

Ttisa243324.2{275_C»IB110} ATGTCGAACc GACAGTCACT 

msa243324.2{27 5_COHl} ATGTCGAACc GACAGTCACT 

msa243324.2{275_M732} ATGTCGAACc GACAGTCACT 

msa243324 .2{275_M781} ATGTCGAACc GACAGTCACT 

msa243324.2{275_1169NT} ATGTCGAACa GACAGTCACT 

msa243324.2{275_JM9130013} ATGTCGAACc GACAGTCACT 

Consensus *********- ********** 



AGTG CATCGG 
AGTGCATCGG 
AGTGCATCGG 
AGTGCATCGG 
AGTGCATCGG 
AGTGCATCGG 
AGTGCATCGG 
AGTGCATCGG 
AGTGCATCGG 
AGTGCATCGG 
AGTGCATCGG 
********** 



ATCAAACGAC. 
ATCAAACGAC 
ATCAAACGAC 
ATCAAACGAC 
ATCAAACGAC 
ATCAAACGAC 
ATCAAACGAC 
ATCAAACGAC 
ATCAAACGAC 
ATCAAACGAC 
ATCAAACGAC 
********** 



150 

GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
********** 



151 

msa243324.2{275_A909} CAAACGATTT 

msa243324.2{275_H36B} CAAACGATTT 

msa243324.2{275_090} CAAACGATTT 

msa243324.2{275_18RS2l} CAAACGATTT 

msa243324.2{275_2603} CAAACGATTT 

msa243324.2{275_CJB110} CAAACGATTT 

msa243324.2{275_COHl} CAAACGATTT 

msa243324.2{275_M732} CAAACGATTT 

msa243324.2{275_M78l} CAAACGA TTT 

msa243324.2{275_1169NT} CAAACGATTT 

msa243324 . 2 {275_JM9130013} CAAACGATTT 

Consensus ********** 



CtCCAACAGC 
CtCCAACAGC 
CtCCAACAGC 
CtCCAACAGC 
CtCCAACAGC 
CtCCAACAGC 
CtCCAACAGC 
CtCCAACAGC 
CtCCAACAGC 
CcCCAACAGC 
CcCCAACAGC 
*_******** 



TATTGAAATT 
TATTGAAATT 
TATTGAAATT 
TATTGAAATT 
TATTGAAATT 
TATTGAAATT 
TATTGAAATT 
TATTGAAATT 
TATTGAAATT 
TATTGAAATT 
TATTGAAATT 
********** 



TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
********** 



200 

ATGATTTGTA 
ATGATTTGTA 
ATGATTTGTA 
ATGATTTGTA 
ATGATTTGTA 
ATGATTTGTA 
ATGATTTGTA 
ATGATTTGTA 
ATGATTTGTA 
ATGATTTGTA 
ATGATTTGTA 
********** 



msa243324 . 2 { 275_A909 } 
msa243324.2{275_H36B} 
msa243324. 2 {275_090} 
msa243324 . 2 {275_18RS2l} 
msa243324.2{275_2603} 
msa243324 .2{275_CJB110} 
msa243324.2{275_COHl} 
msa243324.2(275_M732} 
msa243324.2{275_M78l} 
msa243324 . 2 {275_1169NT} 
msa243324.2{275_JM9130013} 
Consensus 



201 

TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 



TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 



********** ********** 



AAGCTATTTT 
AAGCTATTTT 
AAGCTATTTT 
AAGCTATTTT 
AAGCTATTTT 
AAGCTATTTT 
AAGCTATTTT 
AAGCTATTTT 
AAGCTATTTT 
AAGCTATTTT 
AAGCTATTTT 
********** 



GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
********** 



250 

AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 
********** 



msa243324.2(275_A909; 
msa243324 . 2 { 275_H36B, 
msa243324.2{275_090; 
msa243324.2{275_18RS2i; 

msa243324.2{275_2603. 
msa243324.2{275_CJB110; 
msa243324.2{275_COHi; 
ms a2 43 3 24 . 2 ( 2 7 5_M73 2 
msa2 43 3 24 . 2 { 27 5_M78 1 
msa243324 . 2 {275_1169NT 
msa243324.2{275_JM9130013 
Consensus 



msa243324 . 

msa243324 . 
msa243324 
msa243324.2{ 

msa243324 
msa243324.2{ 

msa243324 . 



2{275_A909 
2{275__H36B 
.2{275_090 
275_18RS21 
2{275_2603 
275_CJB110 
2{275_C0H1 



251 

CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 



TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 



AATTATAACC 
AATTATAACC 
AATTATAACC 
AATTATAACC 
AATTATAACC 
AATTATAACC 
AATTATAACC 
AATTATAACC 
AATTATAACC 
AATTATAACC 
AATTATAACC 



TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 



300 

CAAAGGAGAA 
CAAAGGAGAA 
CAAAGGAGAA 
CAAAGGAGAA 
CAAAGGAGAA 
CAAAGGAGAA 
CAAAGGAGAA 
CAAAGGAGAA 
CAAAGGAGAA 
CAAAGGAGAA 
CAAAGGAGAA 



********** ********** ********** ********** ********** 



301 

TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 



AATCTGTcCA 
AATCTGTcCA 
AATCTGTcCA 
AATCTGTcCA 
AATCTGTcCA 
AATCTGTcCA 
AATCTGTcCA 



AATGCCTACT 
AATGCCTACT 
AATGCCTACT 
AATGCCTACT 
AATGCCTACT 
AATGCCTACT 
AATGCCTACT 



TTAGAAGATG 
TTAGAAGATG 
TTAGAAGATG 
TTAGAAGATG 
TTAGAAGATG 
TTAGAAGATG 
TTAGAAGATG 



350 

ATGGGAAAGG 
ATGGGAAAGG 
ATGGGAAAGG 
ATGGGAAAGG 
ATGGGAAAGG 
ATGGGAAAGG 
ATGGGAAAGG 
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Table 63: Comparative Sequences relating to SAG1912 



msa243324.2{275_M732} 
msa243324.2{275_M78l} 
msa243324.2{275_1169NT} 
msa243324.2{275_JM9130013} 
Consensus 



msa243324.2{275_A909} 
msa243324.2{275_H36B} 
msa243324. 2 {275^090} 
msa243324 .2 {275_18RS2l} 
msa243324.2{275_2603] 
msa243324.2{275_CJB110} 
msa243324 . 2 { 275_COHl } 
msa243324 . 2 {275J4732 } 
msa243324 . 2 { 275__M781 } 
msa243324.2{275_1169NT} 
msa243324.2{275_JM9130013} 
Consensus 



msa243324 . 2 { 275_A909 } 
msa243324.2{275_H36B} 
rasa243324.2{275_090} 
msa243324 . 2 { 275_18RS21 } 
msa243324.2{275J2603} 
tnsa243324 . 2 { 275_CJB110 } 
msa243324.2{275_COHl} 
msa243324 . 2 { 275J4732 } 
msa243324.2{275_M78l} 
msa243324.2{275_1169NT} 
msa243324.2{275_JM9130013} 
Consensus 



msa243324.2{275_A909} 
msa243324 . 2{275_H36B} 
msa243324 . 2 { 27 5_090 } 
msa2 43324.2(27 5_1 8RS2 1 \ 
msa243324.2{275_2603> 
msa243324 . 2 { 275_CJB110 } 
msa243324 . 2{275_C0H1 1 
msa243324.2(275_M732 
msa243324.2{275_M781 
msa243324.2{275_1169NT] 
msa243324.2{275_JM9130013] 
Consensus 



msa243324 . 2[ 275_A909} 
msa243324.2{275_H36B| 
msa243324 . 2 { 275_090 
msa243324 . 2 { 275_18RS21 
msa243324 . 2 {275_2603 
msa243324 . 2 { 275_CJB110 
msa2 43324 . 2 { 275_COHl 
maa243324 . 2 { 275J4732 } 
msa243324.2{275_M78l} 
msa243324 . 2 { 275_1169NTl 
msa243324.2{275_JM9130013} 
Consensus 



TATAAAGGTA AATCTGTcCA AATGCCTACT TTAGAAGATG ATGGGAAAGG 
TATAAAGGTA AATCTGTcCA AATGCCTACT TTAGAAGATG ATGGGAAAGG 
TATAAAGGTA AATCTGTcCA AATGCCTACT TTAGAAGATG ATGGGAAAGG 
TATAAAGGTA AATCTGTtCA AATGCCTACT TTAGAAGATG ATGGGAAAGG 
********** *******-** ********** ********** ********** 



msa243324.2{275_A909} 
msa243324.2{275JH36B} 
msa243324.2{275__090> 
msa243324 . 2 { 275__18RS21> 
msa243324.2{275_2603} 
msa243324 . 2 { 275_CJB110 } 
msa243324.2(275_C0Hl} 
msa243324.2{275_M732} 
msa243324.2{275_M78l} 
msa243324 . 2 {275_1169NT} 
msa243324 .2{275_JM9X30013} 
Consensus 



351 

cAATATGACt 
cAATATGACt 
CAATATGACt 
cAATATGACt 
CAATATGACt 
CAATATGACt 
CAATATGACt 
CAATATGACt 
CAATATGACt 
cAATATGACt 
tAATATGACc 
_*******★- 

401 

CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
********** 

451 

GcTTGGAAAT 
GcTTGGAAAT 
GtTTGGAAAT 
GtTTGGAAAT 
GtTTGGAAAT 
GtTTGGAAAT 
GtTTGGAAAT 
GtTTGGAAAT 
GtTTGGAAAT 
GtTTGGAAAT 
GtTTGGAAAT 
*_******** 

501 

AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
********** 

551 

TTGAAAc CTA 
TTGAAAcCTA 
TTGAAAc CTA 
TTGAAAcCTA 
TTGAAAcCTA 
TTGAAAcCTA 
TTGAAAcCTA 
TTGAAAcCTA 
TTGAAAcCTA 
TTGAAAcCTA 
TTGAAAaCTA 



CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
********** 



CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
********** 



CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
********** 



TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
********** 



GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 



CTAGTCAAAA 
CTAGTCAAAA 
CTAGTCAAAA 
CTAGTCAAAA 
CTAGTCAAAA 
CTAGTCAAAA 
CTAGTCAAAA 
CTAGTCAAAA 
CTAGTCAAAA 
CTAGTCAAAA 
CTAGTCAAAA 



400 

AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
********** 

450 

GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
GTATGCATCT 



********** ********** ********** 



CAAATACtTC 
CAAATACtTC 
CAAATACcTC 
CAAATACcTC 
CAAATACCTC 
CAAATACcTC 
CAAATACtTC 
CAAATACtTC 
CAAATACtTC 
CAAATACtTC 
CAAATACcTC 



TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 



GATGCTACTG 
GATGCTACTG 
GATGCTACTG 
GATGCTACTG 
GATGCTACTG 
GATGCTACTG 
GATGCTACTG 
GATGCTACTG 
GATGCTACTG 
GATGCTACTG 
GATGCTACTG 



*******_** ********** ********** 



GCGACAGATA 
GCGACAGATA 
GCGACAGATA 
GCGACAGATA 
GCGACAGATA 
GCGACAGATA 
GCGACAGATA 
GCGACAGATA 
GCGACAGATA 
GCGACAGATA 
GCGACAGATA 



CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 



TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 



500 

CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
********** 

550 

AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
AACCAAATTA 



********** ********** ********** ********** 



CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
******_*** ********** 



582 

GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
********** ** 



SEQ ID NO. 6312 

SKTYDLYASVIiliAQAILESSSGQSDLSKAPNYNLFGIKGEYKGKSVQMPTLEDIXSI^NMT 
QI QAP FRAYPNYSAS LYDYAELVS SQKYAS VWKSNTS S YKDATAALTGLYATDTAYAS KL 
NQI I ETYSLDAYDK 



948 



WO 2004/018646 
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Table 63: Comparative Sequences relating to SAG1912 



SEQ ID NO. 6313 
STRAIN 090 frame: 1 . 

GVWFYNYKNDNVEPTVTSASDQTTTFIQTISPTAIEISKTYDIiYASVLIiAQAILESSSGQ 
SDLSKAPimiLFGIKGEYKGKSVQMPTLEDDGKGNMTQIQAPFRAYPNYSASLYDYAELV 
SSQKYASVWKSNTS S YKDATAALTGLYATDTAYAS KLNQ 1 1 ETYSLDAYDK 

SEQ ID NO. 6314 
STRAIN A909 frame: 1 

GVWFYNYKNDNVEPTVTSASDQTTTFI QTI SPTAI EI SKTYDLYASVLLAQAI LESS SGQ 
SDLSKAPNYNLFGI KGEYKGKSVQMPTLEDDGKGNMTQ I QAPFRAYPNYSASLYDYAELV 
SSQKYASAWKSNTSSYKDATAALTGLYATDTAYASKLNQI I ETYSLDAYDK 

SEQ ID NO. 6315 
STRAIN H36B frame: 1 

GVWFYNYKNDNVEPTVTSASDQTTTFICyTISPTAIEI SKTYDLYASVLLAQAI LESSSGQ 
SDLSKAPNYNLFGI KGEYKGKS VQMPTLEDDGKGNMTQI QAPFRAYPNYSASLYD YAELV 
SS QKYASAWKSNTS S YKDATAALTGLYATDTAYAS KLNQ 1 1 ETYSLDAYDK 

SEQ ID NO. 6316 
STRAIN 18RS21 frame: 1 

GVWFYNYKNDNVEPTVTSASDQTTTFI QT I SPTAI E I SKTYDLYASVLLAQAI LESSSGQ 
SDLS KAPNYNLFG I KGEYKGKS VQMPTLEDDGKGNMTQI QAP FRAYPNY SAS LYD YAELV 
SSQKYASVWKSNTS S YKDATAALTGLYATDTAYAS KLNQ 1 1 ETYSLDAYDK 

SEQ ID NO. 6317 

STRAIN M732 frame: I 

GVWFYNYKNDNVEPTWSASDQTTTFIGTI S PTAIEI SKTYDLYASVLLAQAI LESSSGQ 
SDLSKAPNYNLFGI KGEYKGKS VQMPTLEDDGKGNMTQI QAPFRAYPNYSASLYDYAELV 
SSQKYASVWKSNTS S YKDATAALTGLYATDTAYAS KLNQ 1 1 ETYSLDAYDK 

SEQ ID NO. 6318 

STRAIN M78 1 frame: 1 

GVWF YNYKNDNVEPTVTSAS DQTTTF I QT I S PTA I E I SKTYDLYASVLLAQAILESSSGQ 
SDLSKAPNYNLFGI KGEYKGKSVQMPTLEDDGKGNMTQ I QAPFRAYPNYSASLYDYAELV 
S SQKYAS VWKSNTS S YKDATAALTGLYATDTAYAS KLNQ 1 1 ETYSLDAYDK 

SEQ ID NO. 6319 
STRAIN CJB1 10 frame: 1 

GWFYNYKNDNVEPTVTSASDQTTTFI QTI SPTAIE I SKTYDLYASVLLAQAI LESSSGQ 
SDLSKAPNYNLFGI KGEYKGKSVQMPTLEDDGKGNMTQ I QAPFRAYPNYSASLYDYAELV 
S SQKYAS VWKSNTSS YKDATAALTGLYATDTAYAS KLNQ 1 1 ETYSLDAYDK 

SEQ ID NO. 6320 
STRAIN 1169NT frame: 1 

GVWF YNYKNDNVEQT VTSASDQTTTF I QT I S PTAI E I SKT YDLYAS VLLAQAI LE S S SGQ 
SDLSKAPNYNLFGI KGEYKGKS VQMPTLEDDGKGNMTQI QAP FRAYPNY SAS LYD YAELV 
SSQKYASVWKSNTSSYKDATAALTGLYATDTAYASKLNQI I ETYSLDAYDK 

SEQ ID NO. 6321 
STRAIN JM9 1300 13 frame: 3 

WFYNYKNDNVEPTVTS ASDQTTTF I QT I S PTAI E I S KTYDLYAS VLLAQAI LES S SGQS D 
L S KAPNYNLFG! KGEYKGKSVQMPTLEDDGKGNMTQ I QAPFRAYPNYS AS L YD YAELVSS 
QKYASVWKSNTSS YKDATAALTGLYATDTAYAS KLNQ 1 1 ENYSLDAYDK 

PRETTY of : /biotmp/msa243476 . 2 { * } February 11, 2003 05:17 

1 

msa243476.2{275_O90} gvWFYNY KNDNVEpTVT 

rasa243476.2{275_18RS2l} gvWFYNY KNDNVEpTVT 

msa2 43476 -2 {275^2603} mksrkkdklv lrltttllvf glggvWFYNY KNDNVEpTVT 

msa243476.2{275_CJB110} gvWFYNY KNDNVEpTVT 

msa243476.2(275_M732} gvWFYNY KNDNVEpTVT 

msa243476.2{275_M78l} gvWFYNY KNDNVEpTVT 

msa243476 . 2 {275_A909 } gvWFYNY KNDNVEpTVT 

msa243476.2{275_H36B} gvWFYNY KNDNVEpTVT 

msa243476.2{275_JM9130013} WFYNY KNDNVEpTVT 

msa243476.2{275 1169NT) gvWFYNY KNDNVEqTVT 

Consensus ********** ********** ***_-***** ******_*** 

51 

msa243476.2{275_090} QTI SPTAI EI SKTYDLYASV LLAQAILESS SGQSDLSKAP 

msa243476.2{275 18RS21} QTISPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP 

msa243476.2{275 2603} QTISPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP 

msa243476.2{275 CJB110} QTISPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP 

msa243476.2{275_M732) QTISPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP 

mBa243476.2{275_M78l} QTISPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP 

msa243476.2{275_A909} QTISPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP 

msa243476.2{275 H36B) QTISPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP 

msa243476.2{275 JM9130013) QTISPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP 

msa243476.2{275_1169NT} QTISPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP 

Consensus ********** ********** ********** ********** 



50 

S ASDQTTTF I 
SASDQTTTFI 
SASDQTTTFI 
SASDQTTTFI 
SASDQTTTFI 
SASDQTTTFI 
SASDQTTTFI 
SASDQTTTFI 
SASDQTTTFI 
SASDQTTTFI 
********** 

100 

NYNLFG I KGE 
NYNLFGIKGE 
NYNLFG I KGE 
NYNLFGIKGE 
NYNLFGIKGE 
NYNLFGIKGE 
NYNLFGIKGE 
NYNLFGIKGE 
NYNLFGIKGE 
NYNLFGIKGE 
********** 
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msa243476.2{275_090 
msa243476 . 2 { 275_18RS21 
msa243476 .2{275_2603 
msa243476.2{275_CJB110 
msa243476 . 2{275_M732 
msa243476.2{275_M78l) 
msa243476.2(275_A909} 
msa243476 .2{275_H36B} 
msa243476.2{275_JM9130013) 
msa243476 . 2 { 275_1169NT} 
Consensus 



101 

YKGKSVQMPT 
YKGKSVQMPT 
YKGKSVQMPT 
YKGKSVQMPT 
YKGKSVQMPT 
YKGKSVQMPT 
YKGKSVQMPT 
YKGKSVQMPT 
YKGKSVQMPT 
YKGKSVQMPT 



LEDDGKGNMT 
LEDDGKGNMT 
LEDDGKGNMT 
LEDDGKGNMT 
LEDDGKGNMT 
LEDDGKGNMT 
LEDDGKGNMT 
LEDDGKGNMT 
LEDDGKGNMT 
LEDDGKGNMT 



QIQAPFRAYP 
QIQAPFRAYP 
QIQAPFRAYP 
QIQAPFRAYP 
QIQAPFRAYP 
QIQAPFRAYP 
QIQAPFRAYP 
QIQAPFRAYP 
QIQAPFRAYP 
QIQAPFRAYP 



NYSASLYDYA 
NYSASLYDYA 
NYSASLYDYA 
NYSASLYDYA 
NYSASLYDYA 
NYSASLYDYA 
NYSASLYDYA 
NYSASLYDYA 
NYSASLYDYA 
NYSASLYDYA 



150 

ELVSSQKYAS 
ELVSSQKYAS 
ELVSSQKYAS 
ELVSSQKYAS 
ELVSSQKYAS 
ELVSSQKYAS 
ELVSSQKYAS 
ELVSSQKYAS 
ELVSSQKYAS 
ELVSSQKYAS 



********** ********** ********** ********** ********** 



msa243476 
msa243476.2{ 

msa243476 . 
msa243476 .2{ 

tnsa243476 . 

msa243476 . 

msa243476 . 

msa243476- 
msa243476.2{275 
msa243476.2{ 



2{275_090} 
275_18RS21} 
2{275_2603} 
275__CJB110} 
2f 275_M732} 
2f 275JM781) 
2{275_A909} 
2{275_H36B} 
_JM9130013} 
275_1169NT} 
Consensus 



151 

vWKSNTSSYK 
vWKSNTSSYK 
vWKSNTSSYK 
vWKSNTSSYK 
vWKSNTSSYK 
vWKSNTSSYK 
aWKSNTSSYK 
aWKSNTSSYK 
vWKSNTSSYK 
VWKSNTSSYK 
_********* 



DATAALTGLY 
DATAALTGLY 
DATAALTGLY 
DATAALTGLY 
DATAALTGLY 
DATAALTGLY 
DATAALTGLY 
DATAALTGLY 
DATAALTGLY 
DATAALTGLY 
********** 



ATDTAYASKL 
ATDTAYASKL 
ATDTAYASKL 
ATDTAYASKL 
ATDTAYASKL 
ATDTAYASKL 
ATDTAYASKL 
ATDTAYASKL 
ATDTAYASKL 
ATDTAYASKL 
********** 



NQIIEtYSLD 
NQIIEtYSLD 
NQIIEtYSLD 
NQIIEtYSLD 
NQIIEtYSLD 
NQIIEtYSLD 
NQI lEtYSLD 
NQIIEtYSLD 
NQIIEnYSLD 
NQIIEtYSLD 
*****„ **** 



194 
AYDK 
AYDK 
AYDK 
AYDK 
AYDK 
AYDK 
AYDK 
AYDK 
AYDK 
AYDK 
**** 
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SEQ ID NO. 6401 
STRAIN 2603 

ATG AACAAGT CT AAG AAAAT CG AAAATT ATCAATTACTATTACTACAAG CGCAAG CT CTA 
TTCTCAGATGAAACAAATGCTCTTGCCAACTTATCAAATGCTTCAGCTATGCTAAATGCT 
ATG CTTC CAAATT CTGT ATTT ACAGGCTTTTATTT ATTTG ATGG AGAAG AGTTAATT CTT 
GGCCCITTCCAGGGTGGTGTATCATGTGTGC^TATTACTTTAGGAAAAGGTGTTTGTGGT 
GAAT CTG CACAAACTGCTAAGACG CTG AT CGTTGATGATGTTACAAAGCATG CT AACTAT 
ATCT C CTGTGATT CAAAAG CTATGAGTGAAAT CGTAGTAC CTATGTTTAAAAATGG CAAA 
CTTCT AGGAGTT CTAGATTTAGATT CTTCTTTAGT AG CAGATTATGATGAGATTG AT CAA 
GAATACTTAGAAAAATTTGTAGGTATTCTAGTAGAACATAaSATTTGGAAT^ 
TTTGGAGTTGAAAAG 

SEQ ID NO. 6402 
STRAIN 090 

CT CT ATT CTCAGATGAAACAAATG CTCTTG C CAACTT A 

T CAAATG CTT CAGCTATG CTAAATG CT ATG CTT C CAAATT CTGT ATTTAC 

AGGCTTTTATTTATTTGATGGAAAGGAGITAATTCOT 

GTGGTGTAT C&TGTGTG CAT ATTACTTTAGGAAAAGGTGTTTGTGGTGAA 

TCTCCACAAACTGCTAAGACGCTGATTGTTGATGATGTTACAAAGCA 

TAACTATATCTCCTGTGATTCAAAAGCTATGAGTGAAATCGTAGTACCTA 

TGTTT AAAAATGG CAAACTT CT AGGAGTT CTAGATTTAGATT CTT CTTTA 

GTAGCAGATTATGATGAGATTGATCAAGAATACITAGAAAAATTTGTAGG 

TATTCTAGTAGAACATACGATTTGGAATTTGGATA 

SEQ ID NO. 6403 

STRAIN A909 

CTCTATTCTCAGATGAAACAAATGCTCTTGCCAA 

CTTATCAAATGCTTCAG CTATGCTAAATGCTATG CTTCCAAATT CTGTAT 

TTACAGG CTTTT ATTTATTTGATGGAG AAGAGTTAATTCITG GC CC1"1T' C 

CAGGGTGGTGTATCATGTGTGCATATTACTTTAGGAA 

TGAAT CTGCACAAACTG CTAAGACGCTGAT CGTTGATGATGTTACAAAG C 

ATGCTAACTATATCT CCTGTGATT CAAAAG CTATGAGTGAAATCGTAGTA 

CCTATGTTTAAAAATGG CAAACTT CTAGGAGTT CT AG ATTTAGATTCTT C 

TTTAGTAGCAGATTATGATGAGATTGATCAAGAATACTTAGAAAAATTTG 

TAGGTATTCTAGTAGAACATACGATTTGGAATTTGGATATGTTTGGAGTT 

GAAAAG 

SEQ ID NO. - 6404 
STRAIN H36B 

CTCTATTCrCAGATGAAACAAATGCTCTTGC 

CAACTTATCAAATG CTT CAG CTATGCTAAaTG CTATG CTT CCAAATT CTG 

TATTTACAGG CTTTTATTT ATTTG ATGGAGAAGAGTTAATTCTTGGCC CT 

TTC CAGGGTGGTGT ATCATGTGTG CAT ATTACTTTAGGAAAAGGTGTTTG 

TGGTGAATCTGCACAAACTGCTAAGACGCTGATCGTTGATGATGTTACAA 

AGCATGCTAACTATATCTCCTGTGATTCAAAAGCTATGAGTGAAATCGTA 

GTAC CTATGTTTAAAAATGG CAAACTTCTAGGAGTTCTAGATT^ 

TT CTTTAGTAGCAGATTATGATGAGATTGATCAAGAATACTTAGAAAAAT 

TTGTAGGTATTCTAGTAGAACATACGATTTGGAATTTGGATATGTTTGGA 

GTTGAAAAG 

SEQ ID NO. 6405 
STRAIN 18RS21 

CTCTATTC!TCAGATGAAACAAATGCTCTTGCCAACT 

ATCAAATGCTTCAGCTATG CTAAATGC3TATGCTTCCAAATTCTGTATTTA 
CAGG CTTTTATTTATTTGATGGAGAAGAGTTAATT CTTGG CC CTTTC CAG 
GGTGGTGTATCATGTGTGCATATTACTTTAGG 

AT CTG CACAAACTGCTAAGACG CTGAT CGTTGATGATGTTACAAAGCATG 

CT AACTATAT CT C CTGTGATT CAAAAGCTATG AGTGAAAT CGTAGTACCT 

ATGTTTAAAAATGGCAAACTTCTAGGAGTTCTAGATCTAGAT^ 

AGTAGCAGATTATGATGAGATTGATCAAGAATACTTAGAAAAATTTGTAG 

GTATTCTAGTAGAACATACGATTTGGAATTTGGATATGTTTGGAGTTGAA 

AAG 

SEQ ID NO. 6406 
STRAIN M732 

CTCTATTCTCAG ATG AAA.CAAATG CT CTTG CCAACTT 
ATCAAATGCTTCAGCTATG CTAAATG CTATGCTT C CAAATTCTGT ATTTA 
CAGGCTTTTATTTATCTGATGGAG AGGAGTTAATT (CT CAG 
GGTGGTGTATCATGTGTGCATACTACTTTAGGAAAAGGTGTTTGTGGTGA 
AT CTGCACAAACTG CTAAG ACG CTG ATTGTTGATGATGTTACAAAG CATG 
CT AACTATATCTCCTGTGATTCAAAAG CTATGAGTGAAAT CGTAGTACC C 
ATGTTTAAAAATGG CAAACTT CTAGGAGTT CTAGATTTAGATTC^ 
AGTAG CAGATTATGATG AG ATTGAT CAAG AATACTTAGAAAAATTTGTAG 
GTATT CT AGTAGAACAT AC GATCTGGAATTTGGATATGTTTGGAGTTGAA 
AAG 

SEQ ID NO. 6407 
STRAIN COH1 

CT CTATT CT CAGATGAAACAAATG CTCTTG C CAAC 

TTATCAAATGCTTCAGCTATGCTAAATGCTATGCTTCCAAATTCTGTATT 
TAC^GGCTTTTATTTATCTGATGGAGAGGAGTTAATTCTTGGC 
AGGGTGGTGT AT CAT GTGTG CATATT ACCTTAGGAAAAGGTGTTTGTGGT 
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GAATCTGCAC^^CTGCTAAGACGCTGATTGTTGATGATGTTACAAAGCA 
TGCTAACTATATOTCCTGTGATTCAAAAGCTATGAGTGAAATCGTAGTAC 
CCATGTrTAAAAATGGCAAACTTCTAGGAGTTCTAGATTTAGATTCTTCT 
TT AGT AG GAG ATTATGATG AGATTG AT CAAGAAT ACTTAG AAAAATTTGT 
AGGTATTCTAGTAGAACATACGATTTGGAATTTGGATATGTTTGGAGTTG 
AAAAG 



SEQ ID NO. 6408 
STRAIN M781 

CTCTATTCTCAGATGAAACAAATG CTCTTGCCAACTT 

AT CAAAT GCTTCAG CTATG CTAAATG CTATGCTT C CAAATTCTGTATTTA 

CAGG CTTTTATT TATTTGATGG AGAGG AGTTAATT CTTGG CC CTTTT CAG 

GGTGGTGTATCATGTGTGCATATTACTTTAGGAAAAGGTGTTTGTGGTGA 

ATCTGCACAAACTGCTAAGACGCTGATTGTTGATGATGTTACAAAGCATG 

CTAACTATATCTCCTGTGATTCAAAAGCTATGAGTGAAATCGTAGTACCC 

ATGTTTAAAAATGGCAAACTTCTAGGAGTTCTAGATTTAGATTCTTCTTT 

AGTAG CAGATTATGATG AG ATTGAT CAAGAAT ACTTAGAAAAATTTGT AG 

GTATTOTAGTAGAACATACGATTTGGAATTTGGATATGTTTGGAGTTGAA 

AAG 



SEQ ID NO. 6409 
STRAJNCJB110 

CTCTATTCTCAGATGAAACAAATGCTCTTGCCAACTTA 

TCAAATG CTTCAG CTATG CTAAATG CTATGCTTC CAAATTCTGT ATTTAC 

AGGCTTTTATTTATTTGATGGAAAGGAGTTAATTCre 

GTGGTGTAT CATGTGTG CATATTACTTTAGGAAAAGGTGTTTGTGGTGAA 

TCTG CACAAACTGCTAAGACGCIX3ATTGTTGATC CATG C 

TAACTATAT CTCCTGTGATTCAAAAG CTATGAGTGAAAT CGTAGT ACCT A 

TGTTTAAAAATGGCAAACTTCTAGGAGTT CTAGATTTAGATT CTT CTTTA 

GT AGCAGATT ATGATGAGATTGAT CAAGAATACCTAGAAAAATTTGTAGG 

TATTCTAGTAGAACATACGATTTGGAA.TTTGGATATGTTTGGAGTTGAAA 

AG 



SEQ XD NO. 6410 
STRAIN 1169NT 

CT CTATTCT CAG ATGAAACAAATG CTCTTG C CAACTTA 

TCAAATG CTT CAG CTATGCTAAATG CTATG CTTC CAAATT CTGT ATTTAC 

AGGCTTTTATTTATTTGATG GAGAAGAGTTAATT CTTGG CCCTTTC CAGG 

GTGGTGTATCATGTGTGCATATTACTTTAGGAAAAGGTGTTTGTGGTGAA 

TCTG CAazy^CTGCT A^GACG CTGATTGTTGATGATGTTACAAAGCATG C 

TAACTATAT CT C CTGTG ATTCAAAAGCTATGAGTGAAATCGTAGTAC CCA 

TGTTTAAAAATGG CAAACTT CTAGGAGTT CTAGATTTAGATT CTT CTTTA 

GTAG CAGATTATGATGAGATTG AT CAAGAATACTTAGAAAAATTTGTAGG 

TATTCTAGTAGAACATACGATTTGGAATTTGGATATGTTTGGAGTTGAAA 

AG 



SEQ ID NO. 6411 
STRAIN JM9 1300 13 

CTCTATTCTCAGATGAAACAAATGCTCTTGCCAACTTA 

TCAAATG CTT CAG CTATG CTAAATG CTATG CITC CAAATTCTGT ATTTAC 

AGGCTTTTATTTATTTGATGGAGAAGAGTTAATT 

GTGGTGTAT CATGTGTG CATATTACTTTAGGAAAAGGTGTTTGTGGTGAA 
TCn\3<^CAAACrGCTAAGACGCrGATCGTT 

TAACTATAT CT C CTGTGATT CAAAAGCTATGAGTGAAAT CGTAGTACCTA 

TGTTTAAAAATGGCAAACTTCTAGGAGTTCTAGATTT 

GTAG CAGATTATGATGAGATTGAT CAAGAATACHTTAGAAAAATTTGTAGG 

TATTCTAGTAGAAGATACGATTTGGAATTTGGATATC 

AG 

PRETTY of: /biotmp/msa236796 . 2 { * } February 11, 2003 02:42 

1 50 

msa236796.2{282_COHl} 

msa236796.2{282_M732} 

msa236796.2{282_M78l} 

msa236796.2{282_090> . 

msa236796.2{282_CJB110} . 

msa236796.2{282_18RS2l} 

msa236796.2{282_2603} atgaacaagt ctaagaaaat cgaaaattat caattattat tactacaagc 

msa236796.2(282_A909} 

msa236796.2{282_H36B} 

msa236796.2{282_JM9130013} 

msa236796.2{282_1169NT} 

Consensus ********** ********** ********** ********** ********** 



51 100 

msa236796.2{282_COHl) CTCTA TTCTCAGATG AAACAAATGC TCTTGCCAAC TT AT CAAATG 

msa236796.2{282_M732} CTCTA TTCTCAGATG AAACAAATGC TCTTGCCAAC TTATCAAATG 

msa236796.2{282_M78l} CTCTA TTCTCAGATG AAACAAATGC TCTTGCCAAC TTATCAAATG 

msa2 36796. 2 {282^090} CTCTA TTCTCAGATG AAACAAATGC TCTTGCCAAC TTATCAAATG 

msa2 36796.2(282 CJB110) CTCTA TTCTCAGATG AAACAAATGC TCTTGCCAAC TTATCAAATG 

msa236796.2{282_18RS21} CTCTA TTCTCAGATG AAACAAATGC TCTTGCCAAC TTATCAAATG 

msa2 36796. 2 {282J2603} gcaagCTCTA TTCTCAGATG AAACAAATGC TCTTGCCAAC TTATCAAATG 
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tns a2 36796. 2 { 2 8 2_A9 0 9 } CTCTA TTCT CAGATG AAACAAATGC TCTTGCCAAC TTATCAAATG 

ttisa236796 .2{282_H36B} CTCTA TTCT CAGATG AAACAAATGC TCTTGCCAAC TTATCAAATG 

msa236796.2{282_JM9130013} CTCTA TTCT CAGATG AAACAAATGC TCTTGCCAAC TTATCAAATG 

msa236796.2{282_1169NT} CTCTA TTCTCAGATG AAACAAATGC TCTTGCCAAC TTATCAAATG 

Consensus ********** ********** ********** ********** ********** 

101 150 

msa2 36796 .2(28 2_COHl} CTTCAGCTAT GCTAAATGCT ATGCTTCCAA ATTCTGTATT TACAGGCTTT 

msa236796 ,2( 282_M732} CTTCAGCTAT GCTAAATGCT ATGCTTCCAA ATTCTGTATT TACAGGCTTT 

msa236796.2{282_M78l} CTTCAGCTAT GCTAAATGCT ATGCTTCCAA ATTCTGTATT TACAGGCTTT 

msa236796.2{282_090} CTTCAGCTAT GCTAAATGCT ATGCTTCCAA ATTCTGTATT TACAGGCTTT 

msa2367 96.2{282_CJB110} CTTCAGCTAT GCTAAATGCT ATGCTTCCAA ATTCTGTATT TACAGGCTTT 

msa236796.2{282_18RS2l} CTTCAGCTAT GCTAAATGCT ATGCTTCCAA ATTCTGTATT TACAGGCTTT 

msa236796.2{282_2603} CTTCAGCTAT GCTAAATGCT ATGCTTCCAA ATTCTGTATT TACAGGCTTT 

, msa2 3 67 9 6 . 2 { 2 8 2_A9 0 9 } CTTCAGCTAT GCTAAATGCT ATGCTTCCAA ATTCTGTATT TACAGGCTTT 

rasa23 6796.2{282_H36B) CTTCAGCTAT GCTAAATGCT ATGCTTCCAA ATTCTGTATT TACAGGCTTT 

ms a2 36796.2(28 2JJM9 130013} CTTCAGCTAT GCTAAATGCT ATGCTTCCAA ATTCTGTATT TACAGGCTTT 

msa2367 96. 2 {282_1169NT} CTTCAGCTAT GCTAAATGCT ATGCTTCCAA ATTCTGTATT TACAGGCTTT 

Consensus ********** ********** ********** ********** ********** 



151 

msa2 36796.2(28 2_COHl } TATTTATTTG 

msa236796 .2(282_M732} TATTTATTTG 

msa236796.2(282_M78l} TATTTATTTG 

msa236796 .2{282_090} TATTTATTTG 

msa236796.2{282_CJB110) TATTTATTTG 

msa236796 .2(282_18RS2l} TATTTATTTG 

msa236796.2{282_2603} TATTTATTTG 

msa236796 .2(282_A90 9} TATTTATTTG 

msa236796 .2{282_H36B} TATTTATTTG 

ms a2 36796. 2(28 2_JM9 130013} TATTTATTTG 

msa236796.2(282_1169NT} TATTTATTTG 

Consensus ********** 



ATGGAgAgGA 
ATGGAgAgGA 
ATGGAgAgGA 
ATGGAaAgGA 
ATGGAaAgGA 
ATGGAgAaGA 
ATGGAgAaGA 
ATGGAgAaGA 
ATGGAgAaGA 
ATGGAgAaGA 
ATGGAgAaGA 
*****_*_** 



GTTAATTCTT 
GTTAATT CTT 
GTTAATTCTT 
GTTAATTCTT 
GTTAATTCTT 
GTTAATTCTT 
GTTAATTCTT 
GTTAATTCTT 
GTTAATTCTT 
GTTAATTCTT 
GTTAATTCTT 
********** 



GGCCCTTTtC 
GGCCCTTTtC 
GGCCCTTTtC 
GGCCCTTTcC 
GGCCCTTTcC 
GGCCCTTTcC 
GGCCCTTTcC 
GGCCCTTTcC 
GGCCCTTTcC 
GGCCCTTTCC 
GGCCCTTTcC 
********_* 



200 

AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
********** 



msa2 36796.2(28 2_C0H1 
msa236796.2(282_M732| 
msa23 6796.2(282_M78i; 
msa236796.2(282_090' 
msa236796.2(282__CJB110" 
msa236796 .2(28 2_18RS21 ] 
msa236796.2{282_2603) 
msa236796.2{282_A9Q9) 
msa236796.2(282_H36B} 
msa236796 . 2{282_«JM9130013 } 
msa236796.2(282_1169NT} 
Consensus 



201 

ATCATGTGTG 
ATCATGTGTG 
ATCATGTGTG 
ATCATGTGTG 
ATCATGTGTG 
ATCATGTGTG 
ATCATGTGTG 
ATCATGTGTG 
ATCATGTGTG 
ATCATGTGTG 
ATCATGTGTG 



CATATTACTT 
CATATTACTT 
CATATTACTT 
CATATTACTT 
CATATTACTT 
CATATTACTT 
CATATTACTT 
CATATTACTT 
CATATTACTT 
CATATTACTT 
CATATTACTT 



********** ********** 



TAGGAAAAGG 
TAGGAAAAGG 
TAGGAAAAGG 
TAGGAAAAGG 
TAGGAAAAGG 
TAGGAAAAGG 
TAGGAAAAGG 
TAGGAAAAGG 
TAGGAAAAGG 
TAGGAAAAGG 
TAGGAAAAGG 
********** 



TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
********** 



250 

GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
********** 



msa2 3 67 9 6 . 2 ( 2 8 2_COHl } 
msa23 6796 . 2 { 282_M73 2 ) 
ms a2 3 67 9 6 . 2 { 2 8 2_M7 8 1 } 
msa236796 . 2 (282_090 } 
msa236796.2(282_CJB110} 
msa236796.2{282_18RS2l) 
msa236796.2(282_2603) 
msa236796.2{282_A909} 
msa2 3 679 6 . 2 { 2 8 2_H3 6B * 
msa236796.2{282_JM9130013 
msa236796.2(282_1169NT 
Consensus 



251 

AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 
********** 



GACGCTGATt 
GACGCTGATt 
GACGCTGATt 
GACGCTGATt 
GACGCTGATt 
GACGCTGATC 
GACGCTGATc 
GACGCTGATC 
GACGCTGATc 
GACGCTGATc 
GACGCTGATt 
*********_ 



GTTGATGATG 
GTTGATGATG 
GTTGATGATG 
GTTGATGATG 
GTTGATGATG 
GTTGATGATG 
GTTGATGATG 
GTTGATGATG 
GTTGATGATG 
GTTGATGATG 
GTTGATGATG 



TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 



300 

TGCTAACTAT 
TGCTAACTAT 
TGCTAACTAT 
TGCTAACTAT 
TGCTAACTAT 
TGCTAACTAT 
TGCTAACTAT 
TGCTAACTAT 
TGCTAACTAT 
TGCTAACTAT 
TGCTAACTAT 



********** ********** ********** 



msa236796. 
msa236796 . 
msa236796. 
msa236796 
msa236796.2{ 
msa236796.2( 
msa236796 
msa236796 
msa236796 
msa236796 .2(282 
msa236796.2{ 



2(282_COHl] 
2{282__M732 
2{282_M781^ 
.2(282_090' 
282_CJB110' 
282_18RS21] 
2(282_2603, 
2(282_A909] 
2{282_H36B] 
JM9130013 , 
282_1169NT] 
Consensus 



301 

ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
********** 



ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
********** 



TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
********** 



ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
********** 



350 

CcATGTTTAA 
CcATGTTTAA 
CcATGTTTAA 
CtATGTTTAA 
CtATGTTTAA 
CtATGTTTAA 
CtATGTTTAA 
CtATGTTTAA 
CtATGTTTAA 
CtATGTTTAA 
CcATGTTTAA 
*_******** 



msa2 36796.2(28 2_COHl } 
msa236796.2(282_M732} 
msa236796.2(282_M78l| 
msa236796 .2(282_090) 
msa236796.2{282_CJB110j 
msa236796,2(282__18RS21) 



351 

AAATGGCAAA 
AAATGGCAAA 
AAATGGCAAA 
AAATGGCAAA 
AAATGGCAAA 
AAATGGCAAA 



CTTCTAGGAG 
CTTCTAGGAG 
CTTCTAGGAG 
CTTCTAGGAG 
CTTCTAGGAG 
CTTCTAGGAG 



TTCTAGATTT 
TTCTAGATTT 
TTCTAGATTT 
TTCTAGATTT 
TTCTAGATTT 
TTCTAGATTT 



AGATTCTTCT 
AGATTCTTCT 
AGATTCTTCT 
AGATTCTTCT 
AGATTCTTCT 
AGATTCTTCT 



400 

TTAGTAGCAG 
TTAGTAGCAG 
TTAGTAGCAG 
TTAGTAGCAG 
TTAGTAGCAG 
TTAGTAGCAG 



953 



WO 2004/018646 
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Table 64: Comparative Sequences relating to SAG 0827 



msa23 6796.2{282_2603> AAATGGCAAA CTTCTAGGAG 

msa236796 .2{282_A909} AAATGGCAAA CTTCTAGGAG 

msa236796.2.{282JH36B} AAATGGCAAA CTTCTAGGAG 

msa236796.2{282_JM9130013} AAATGGCAAA CTTCTAGGAG 

msa236796.2{282_1169NT} AAATGGCAAA CTTCTAGGAG 

Consensus ********** ********** 



TTCTAGATTT 
TTCTAGATTT 
TTCTAGATTT 
TTCTAGATTT 
TTCTAGATTT 



AGATTCTTCT 
AGATTCTTCT 
AGATTCTTCT 
AGATTCTTCT 
AGATTCTTCT 



TTAGTAGCAG 
TTAGTAGCAG 
TTAGTAGCAG 
TTAGTAGCAG 
TTAGTAGCAG 



********** ********** ********** 



msa236796 .2(282_COHl 
msa236796 .2{282_M732 
msa23 6796 . 2 { 282_M781 
msa236796 .2 {282JD90 
msa236796 .2 f 282_CJB110 , 
msa236796 .2 (282_18RS2l} 
msa23 6796 . 2 { 282 J2603 } 
msa23 6796 . 2 f 28 2_A909 * 
msa23 6796 . 2 { 282_H36B 
msa236796.2{282_JM9130013 
msa236796.2{282_1169NT} 
Consensus 



401 

ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
********** 



GATTGATCAA 
GATTGATCAA 
GATTGATCAA 
GATTGATCAA 
GATTGATCAA 
GATTGATCAA 
GATTGATCAA 
GATTGATCAA 
GATTGATCAA 
GATTGATCAA 
GATTGATCAA 



GAATACTTAG 
GAATACTTAG 
GAATACTTAG 
GAATACTTAG 
GAATACTTAG 
GAATACTTAG 
GAATACTTAG 
GAATACTTAG 
GAATACTTAG 
GAATACTTAG 
GAATACTTAG 



********** ********** 



AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
********** 



450 

AGGTATTCTA 
AGGTATTCTA 
AGGTATTCTA 
AGGTATTCTA 
AGGTATTCTA 
AGGTATTCTA 
AGGTATTCTA 
AGGTATTCTA 
AGGTATTCTA 
AGGTATTCTA 
AGGTATTCTA 
********** 



msa236796 . 

msa236796 . 

msa236796 . 
msa236796 
msa2367 96.2{ 
msa2367 96.2{ 

msa236796 . 

msa236796 . 

msa236796 . 
msa236796.2{282 
msa236796.2{ 



2{282_COHl} 
2(282_M732} 
2{282_M781} 
,2{282_090} 
282_CJB110 } 
282_18RS2l} 
2{282_2603} 
2{282_A909} 
2{282_H36B} 
_JM9130013) 
282_1169NT} 
Consensus 



451 

GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
********** 



CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
********** 



TTTGGATAtg 
TTTGGATAtg 
TTTGGATAtg 
TTTGGATA — 
TTTGGATAtg 
TTTGGATAtg 
TTTGGATAtg 
TTTGGATAtg 
TTTGGATAtg 
TTTGGATAtg 
TTTGGATAtg 



495 

tttggagttg aaaag 
tttggagttg aaaag 
tttggagttg aaaag 



********_ 



tttggagttg 
tttggagttg 
tttggagttg 
tttggagttg 
tttggagttg 
tttggagttg 
tttggagttg 



aaaag 
aaaag 
aaaag 
aaaag 
aaaag 
aaaag 
aaaag 



SEQ ZD NO. 6412 
STRAIN 2603 frame: 1 

mnkskki enyqlll.lqaqalfsdetnalanlsnasamlnamlpnsvftgfylfdgeel.il 
gp fqggv s cvh i tlgkg vcge s aqtaktl i vdd vt khany i s cd s kams e i wpm f kngk 

LLGVLDLDSSLVADYDE IDQEYLEKFVGI LVEHT I WNLDMFGVEK 

SEQ ID NO. 6413 

STRAIN 090 frame: 3 

LFSDETNALANLSNASAMLNAMLPNSVFTGFYLFDGKEL I LG PFQGGVS CVH I TLGKG VC 
GESAQTAKTLIVDDVTKHANYI SCDSKAMSEIWPMFKNGKLLGVLDLDSSLVADYDEID 
QEYLEKFVGILVEHTIWNLD 

SEQ XD NO. 6414 

STRAIN A909 frame: 3 

LFSDETNAIJVNLSNASAMLNAMLPNSVFTGFYLFDGEEL I LGPFQGGVS CVH I TLGKGVC 
GESAQTAKTL I VDDVTKHANYI SCDSKAMSEI WPMFKNGKLLGVLDLDSSLVADYDEID 
QEYLEKFVG I LVEHT I WNLDMFGVEK 

SEQ XD NO. 6415 
STRAIN H36B frame: 3 

LFSDETNALANL SMASAMLNAMLPNS VFTGFYLFDGEEL I LGPFQGGVS CVHI TLGKGVC 
GESAQTAKTL I VDDVTKHANYI SCDSKAMSE I WPMFKNGKLLGVLDLDSSLVADYDE ID 
QEYLEKFVG I LVEHT I WNLDMFGVEK 

SEQ XD NO. 6416 
STRAIN 18RS21 frame: 3 

LFSDETNALANLSNASAMLNAMLPNSVFTGFYLFDGEELIIiGPFQ^VSOmiTLGKGVC 
GESAQTAKTL I VDDVTKHANYI SCDSKAMSEI WPMFKNGKLLGVLDLDSSLVADYDE ID 
QEYLEKFVG I LVEHT I WNLDMFGVEK 

SEQ ID NO. 6417 
STRAIN M732 frame: 3 

LFSDETNALANLSNASAMLNAMLPNS VFTGFYLFDGEELI LGPFQGGVS CVH I TLGKGVC 
GESAQTAKTLIVDDVTKHANYI SCDSKAMSE I WPMFKNGKLLGVLDLDSSLVADYDE ID 
QEYLEKFVG I LVEHT I WNLDMFGVEK 

SEQ XD NO. 6418 
STRAIN COH1 frame: 3 

LFSDETNALANLSNASAMLNAMLPNSVFTGFYLFDGEELI LGPFQGGVSCVHI TLGKGVC 
GESAQTAKTL I VDDVTKHANYI SCDSKAMSE I WPMFKNGKLLGVLDLDSSLVADYDE I D 
QEYLEKFVG I LVEHT I WNLDMFGVEK 

SEQ XD NO. 6419 
STRAIN M78 1 frame: 3 

LFSDETNALANLSNASAMLNAMLPNS VFTGFYLFDGEEL I LG PFQGGVS CVH I TLGKGVC 
GESAQTAKTL I VDDVTKHANYI SCDSKAMSE I WPMFKNGKLLGVLDLDSSLVADYDE ID 



954 



WO 2004/018646 
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Table 64: Comparative Sequences relating to SAG 0827 



QEYLEKFVGI LVEHTI WNLDMPGVEK 

SEQ ID NO. 6420 

STRAIN M781 frame: 3 

LFSDETNALANL SNASAMLNAMLPNS VFTGFYLFDGEEL I LGP FQGGVSCVH ITLGKGVC 
GESAQTAKTL I VDDVTKHANYI SCDSKAMSEI WPMFKNGKLLGVLDLDSSLVADYDEI D 
QE YIiEKFVG I LVEHT I WNLDMFGVE K 

SEQ ID NO. 6421 
STRAIN CJB1 10 frame: 3 
LFSDETNALANL SNASAMLNAMLPNS 

GESAQTAKTL I VDDVTKHANYI SCDSKAMSEI WPMFKNGKLLGVLDLDSSLVADYDEI D 
QEYLEKFVGI LVEHTI WNLDMFGVEK 

SEQ ID NO. 6422 
STRAIN II69NT frame: 3 

LFSDETNALANLSNASAMLNAMLPNSVFTGFYLFDGEEL I LGPFO^SGVSCVH ITLGKGVC 
GESAQTAKTLI VDDVTKHANYI SCDSKAMSEI WPMFKNGKLLGVLDLDSSLVADYDEID 
QEYLEKFVGI LVEHTI WNLDMFGVEK 

SEQ ID NO. 6423 
STRAIN JM9130013 frame: 3 

LFSDETNALANLSNASAMLNAMLPNSVFTGFYLFDGEELI LGPFQGGVSCVH ITLGKGVC 
GESAQTAKTLIVDDVTKHANYI SCDSKAMSEI WPMFKNGKL 1X3 VLDLiDSSLVADYDEID 
QEYLEKFVGI LVEHT I WNLDMFGVEK 

PRETTY of: /biotmp/msa237960 . 2 { * } February 11, 2003 02:46 

. 1 50 
Tnsa237960 . 2 {282_1169NT} L FSDETNALAN LSNASAMLNA MLPNSVFTGF 

msa237960 .2 {282_18RS2l| L FSDETNALAN LSNASAMLNA MLPNSVFTGF 

msa237960 .2{282_2603) mnkskkieny qllllqaqaL FSDETNALAN LSNASAMLNA MLPNSVFTGF 

msa237960.2(282_A909} L FSDETNALAN LSNASAMLNA MLPNSVFTGF 

msa237960 .2{282_COHl} L FSDETNALAN LSNASAMLNA MLPNSVFTGF 

msa237960.2{282_H36B) L FSDETNALAN LSNASAMLNA MLPNSVFTGF 

msa237960.2{282_JM9130013} L FSDETNALAN LSNASAMLNA MLPNSVFTGF 

msa237960 .2{282_M732} L FSDETNALAN LSNASAMLNA MLPNSVFTGF 

msa237960 .2{282_M781> L FSDETNALAN LSNASAMLNA MLPNSVFTGF 

rasa237960.2{282_090) L FSDETNALAN LSNASAMLNA MLPNSVFTGF 

msa237960.2{282_CJB110} L FSDETNALAN LSNASAMLNA MLPNSVFTGF 

Consensus ********** ********** ********** ********** ********** 



msa237960.2{282_1169NT} 
msa237960 . 2 { 282_18RS21 } 
msa237960.2{282_2603} 
msa237960 . 2 { 282_A909 } 
msa237960 . 2 { 282_C0H1 } 
msa237960 . 2 { 282_H36B} 
msa237960.2{282_JM9130013} 
msa237960.2{282__M732) 
msa237960.2{282_M781} 
msa237960.2{282_090} 
msa237960.2{282_CJB110} 
Consensus 



51 

YLFDGeELIL 
YLFDGeELIL 
YLFDGeELIL 
YLFDGeELIL 
YLFDGeELIL 
YLFDGeELIL 
YLFDGeELIL 
YLFDGeELIL 
YLFDGeELIL 
YLFDGkELIL 
YLFDGkELIL 



GPFQGGVSCV 
GPFQGGVSCV 
GPFQGGVSCV 
GPFQGGVSCV 
GPFQGGVSCV 
GPFQGGVSCV 
GPFQGGVSCV 
GPFQGGVSCV 
GPFQGGVSCV 
GPFQGGVSCV 
GPFQGGVSCV 



*****_**** ********** 



HITLGKGVCG 
HITLGKGVCG 
HITLGKGVCG 
HITLGKGVCG 
HITLGKGVCG 
HITLGKGVCG 
HITLGKGVCG 
HITLGKGVCG 
HITLGKGVCG 
HITLGKGVCG 
HITLGKGVCG 
********** 



ESAQTAKTLI 
ESAQTAKTLI 
ESAQTAKTLI 
ESAQTAKTLI 
ESAQTAKTLI 
ESAQTAKTLI 
ESAQTAKTLI 
ESAQTAKTLI 
ESAQTAKTLI 
ESAQTAKTLI 
ESAQTAKTLI 
********** 



100 

VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
********** 



msa237960 .2 
msa237960 .2 
msa237960 . 
msa237960 
msa237960 
msa237960 
msa237960.2{282 
msa237960 
msa237960 
msa237960 
msa237960.2{ 



282_1169NT} 
282_18RS21} 
2{282_2603} 
2{282_A909} 
2(282_COHlj 
2{282_H36B) 
_JM9130013} 
2{282_M732} 
2{282_M781} 
.2{282_090} 
282_GJB110} 
Consensus 



101 

ISCDSKAMSE 
ISCDSKAMSE 
ISCDSKAMSE 
ISCDSKAMSE 
ISCDSKAMSE 
ISCDSKAMSE 
ISCDSKAMSE 
ISCDSKAMSE 
ISCDSKAMSE 
ISCDSKAMSE 
ISCDSKAMSE 
********** 



IWPMFKNGK 
IWPMFKNGK 
IWPMFKNGK 
IWPMFKNGK 
IWPMFKNGK 
IWPMFKNGK 
IWPMFKNGK 
IWPMFKNGK 
IWPMFKNGK 
IWPMFKNGK 
IWPMFKNGK 
********** 



LLGVLDLDSS 
LLGVLDLDSS 
LLGVLDLDSS 
LLGVLDLDSS 
LLGVLDLDSS 
LLGVLDLDSS 
LLGVLDLDSS 
LLGVLDLDSS 
LLGVLDLDSS 
LLGVLDLDSS 
LLGVLDLDSS 



LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 



********** ********** 



150 

EYLEKFVGIL 
EYLEKFVGIL 
EYLEKFVGIL 
EYLEKFVGIL 
EYLEKFVGIL 
EYLEKFVGIL 
EYLEKFVGIL 
EYLEKFVGIL 
EYLEKFVGIL 
EYLEKFVGIL 
EYLEKFVGIL 
********** 



msa237960.2f 
msa237960.2{ 

msa237960 . 

msa237960 . 

msa237960 . 

msa237960 . 
msa237960.2{282 

msa237960 

msa237960 . 
msa237960 
msa237960.2{ 



282_1169NT} 
282_18RS21} 
2(282_2603} 
2f 282_A909} 
2l282_COHl> 
2{282_H36BJ 
!_JM9130013) 
2(282_M732) 
2{282_M781} 
.2{282_090j 
282_CJB110) 
Consensus 



151 

VEHTIWNLDm 
VEHTIWNLDm 
VEHTIWNLDm 
VEHTIWNLDm 
VEHTIWNLDm 
VEHTIWNLDm 
VEHTIWNLDm 
VEHTIWNLDm 
VEHTIWNLDm 
VEHTIWNLD- 
VEHTIWNLDm 
*********_ 



165 
fgvek 
fgvek 
fgvek 
fgvek 
fgvek 
fgvek 
fgvek 
fgvek 
fgvek 

fgvek 



955 



WO 2004/018646 
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Table 65: Comparative Sequences relating to SAG0231 



SEQ ID NO. 6501 
STRAIN 2603 

ATGAAAAAGAGTACCCAAATAATACTACTAATAGTTGCA 

TT ATT CATAC TTGTTTTTAG CG GAGGATTTTATATGAAAGAACAACAAAG AAAAG AAGAA 
CT AAAACGG AAT CG AGAATATG AAGTT AGT CTAGT CAAAG CATTG AAAAATT C CTATGAG 
AATAT AG AAG AAAT AAAAAT CACACAT CCTGTTT CAACTGAAATT CCTGGAG ATTGG CAT 
TGTACTGTAAAGATTTCATTTAATGATAAAAAATCTATTGTTTATAATATTACACATAAT 
TTGGAATCGAAAAAAAATTATAG C G GAAAATT TAATGAAAAAAATATGAATTTTT TTGAT 
TCAAGAATTGGTAAAACAAAAAAAACTATAAAAATTATTTTTTCAGATGGTCAGGAGAAG 
ATACAA 

SEQ ID NO. 6502 
STRAIN 090 

GGAGGATTTTATATGAAAGAACA 

ACAAAGAAAAGAAGAACTAAAACGGAATCGAGAATATGAAGTTAGTCTAG 
TCAAAGCATTGAAAAATTCCTATGAGAATATAGAAGAAATAAAAATCACA 
CATCCTGTTT CAACTGAAATT C CTGGAGATTG G CATTGT ACT GTAAAGAT 
TTCATTTAATGATAAAAAATCTATTGTTTATAATATTACACATAATT^ 
AATCGAAAAAAAATTATAGCGGAAATTTTAATGAAAAAAATATGAATTTT 
TTTGATT CAAGAATTGGTAAAACAAAAAAAACTATAAAAATT ATTTT1T C 
AGAt GG t CAGGAGAAGATa CAA 

SEQ ID NO. 6503 
STRAIN A909 

GGAGGATTTTATATGAAAGAACAACAA 

AGAAAAGAAG AACTAAAACGGAAT CGAGAATATGAAGTTAGT CTAGT CAA 
AG CATTGAAAAATTC CTATGAG AAT AT AGAAGAAAT AAAAAT CACACAT C 
CTGTTTCAACTGAAATTCCTGGAGATTGGCATTGTACTGTAAAGA 
TTTAATGATAAAAAAT CTATTGTTTATAATATTACACATAATTTGGAATC 
GAAAAAAAATTMAGOSGAAAATTTAATGAAAAAAATA^ 
ATTCAAGAATTGGTAAAACAAAAAAAACTATAAAAATTAaTT'TiT'CAGAT 
GG t CAGGAG AAG ATACAA 

SEQ ID NO. 6504 
STRAIN H36B 

GGAGGATTTTATATGAAAGAACA 

ACAAAGAAAAGAAGAACTAAAACGGAATCGAGAATATGAAGTTAGTCTAG 
TCAAAG CATTGAAAAATT C CTATG AGAATATAGAAGAAATAAAAATCACA 
CATCCTGTTT CAACTGAAATTCCTGGAGATTGGCATTGTA 
TTCATTTAATGATAAAAAATCTATTGTTTATAATATTACACATAATTTGG 
AAT CGAAAAAAAATTATAG CGGAAAATTTAATGAAAAAAATATGAATTTT 
TTTGATT CAAGAATTGGTAAAACAAAAAAAACTATAAAAATTAt TTTTT C 
AGATGGt CAGGAGAAGATaCAA 

SEQ XD NO. 6505 
STRAIN 18RS21 

GGAGGATTTTATATGAAAGAACAAC 

AAAGAAAAGAAGAACTAAAACGGAATCGAGAATATGAAGTTAGTCTAGTC 
AAAG CATTG AAAAATTC CTATGAGAATATAGAAGAAATAAAAAT CACACA 
TCCTGTTTCAACTGAAATTCCTGGAGATTGGCATTGTACTGTAA 
CATTTAATGATAAAAAATCTATTGTTTATAATATTACACATAATTTGGAA 
TCGAAAAAAAATTATAGCGGAAAATTTAATGAAAAAAATATGAATTTTTT 
TG ATT CAAG AATTGGTAAAACAAAAAAAACTATAAAAATTATT^ CAG 
ATGG t CAGGAGAAGATaCAA 

SEQ ID NO. 6506 
STRAIN M781 

GGAGGATTTTATATGAAAGAACAACAAAGAAAA 

GAAGAACTAAAACGGAATCGAGAATATGAAGTTAGTCTAGTCAAAGCATT 
GAAAAATTC CTATG AGAATATAGAAGAAATAAAAATCACACATC CTGTTT 
CAACTGAAATTC CTGGAGATTGG CATTGTACTGTAAAGATTT CATTTAAT 
GATAAAAAAT CT ATTGTTTATAATATTACACATAATTTGGAATCG AAAAA 
AAATTATAGCGGAAAATTTAATGAAAAAAATATGAATTTTTTT^ 
GAATTGGTAAAACAAAAAAAACTATAAAAATTATTTTTTCAGATGGTCAG 
GAGAAGATACAA 

SEQ ID NO. 6507 
STRAIN CJBI10 

GGAGGATTTTATATGAAAGAACAACAAAGAAAAGAAGAA 
CTAAAACGGAATCGAGAATATGAAGTTAGT CTAGTCAAAG CATTGAAAAA 
TTCCTATGAGAATATAGAAGAAATAAAAATCACACATCCTGTTTCAACTG 
AAATT CCTGGAGATTGG CATTGTACTGTAAAGATTT CATTTAATG ATAAA 
AAAT CTATTGTTTATAATATTACACATAATTTGG AATCGAAAAAAAATT A 
TAGCGGAAATTTTAATGAAAAAAATATGAATTTTTTTGATTCAA 
GTAAAACAAAAAAAACTATAAAAATTATTTTTTCAGATGGTCAG 
ATACAA 

SEQ ID NO. 6508 
STRAIN I169NT 

GGAGGATTTTATATGAAAGAACAACAAAG 

AAAAG AAGAACTAAAACGGAAT CG AGAATATGAAGTTAGTCTAGT CAAAG 
CATTGAAAAATT CCTATGAGAATATAGAAG AAATAAAAATCACACAT CCT 
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GTTT CAACTGAAATT CCTGGAGATTGG CATTGTACTGTAAAG ATTT CATT 
TAAT GAT AAAAAAT CTATTGTTTAT AATATTACACATAATTTGGAAT CG A 
AAAAAAATT ATAGT G GAAAATT T AATG AAAAAAAT ATGA^TTTTTTT GAT 
TCAAGAATTGGTAAAACAAAAAAAACTATAAAAATTATTTTTTCAGATGG 
TCAGGAGAAGATACAA 

SEQ XD NO. 6509 
STRAIN JM9130013 
GGAGGATTTTATATGAAAGAACAAC 

AAAGAAAAGAAGAACTAAAACGGAATCGAGAATATGAAGTTAGTCTAGTC 
AAAG CATTGAAAAATTC CTATG AGAAT AT AGAAG AAATAAAAAT CACACA 
TCCTGTTTCAACTGA^TTCCTGGAGATTGGCATTGTACTGTAAAGATTT 
CATTTAATGATAAAAAATCTATTGTTTATAATATTACACATAATTTGGAA 
T CGAAAAAAAATTATAG CGG AAAATTTAATGAAAAAAATATGAATTTTTT 
TG ATT CAAGAATTGGTAAAACAAAAAAiVVCTAT AAAAATT ATTTTTTCAG 
AtGGt CAGGAGAAG ATACAA 

PRETTY of: /biotmp/msa75400 .2 {*} March 10, 2003 09:56 

1 50 

msa75400 ,2{286_090} 

msa75400 . 2 { 286_CJB110 } 

msa75400.2{286_18RS2l} 

msa75400.2{286_2603} atgaaaaaga gtacccaaat aatactacta atagttgcat tattcatact 

msa7540Q .2f 286_A909 J 

msa75400.2{286_H36B} 

msa75400.2{286_JM9130013} 

msa75400.2{286_M78l) 

msa75400.2{286_1169NT} 

Consensus ********** ********** ********** ********** ********** 

51 100 

msa75400 . 2 { 286_090 } GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

msa75400 .2 {286_CJB110 } GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

msa75400.2{286_18RS2l} GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

msa75400.2{286_2603} tgtttttagc GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

msa75400 .2(286_A909 J GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

msa75400.2{286_H36B} GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

msa75400 . 2 {286_JM9130013 } GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

msa75400.2{286_M78l} GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

msa75400 .2{286_1169NT} GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

Consensus ********** ********** ********** ********** ********** 



101 150 

msa75400 .2{286_090) TAAAACGGAA TCGAGAATAT GAAGTTAGTC TAGTCAAAGC ATTGAAAAAT 

msa75400.2{286_CJB110} TAAAACGGAA TCGAGAATAT GAAGTTAGTC TAGTCAAAGC ATTGAAAAAT 

msa75400.2{286_18RS2l} TAAAACGGAA TCGAGAATAT GAAGTTAGTC TAGTCAAAGC ATTGAAAAAT 

msa75400.2{286_2603) TAAAACGGAA TCGAGAATAT GAAGTTAGTC TAGTCAAAGC ATTGAAAAAT 

Tnsa75400.2(286_A909} TAAAACGGAA TCGAGAATAT GAAGTTAGTC TAGTCAAAGC ATTGAAAAAT 

msa75400.2{286__H36B} TAAAACGGAA TCGAGAATAT GAAGTTAGTC TAGTCAAAGC ATTGAAAAAT 

msa75400.2{286_JM9130013} TAAAACGGAA TCGAGAATAT GAAGTTAGTC TAGTCAAAGC ATTGAAAAAT 

msa75400.2{286_M78l) TAAAACGGAA TCGAGAATAT GAAGTTAGTC TAGTCAAAGC ATTGAAAAAT 

msa75400.2{286_1169NT} TAAAACGGAA TCGAGAATAT GAAGTTAGTC TAGTCAAAGC ATTGAAAAAT 

Consensus ********** ********** ********** ********** ********** 



151 200 

msa75400 .2{286_090} TCCTATGAGA ATATAGAAGA AATAAAAATC ACACATCCTG TTTCAACTGA 

msa7 5400.2{286_CJB110} TCCTATGAGA ATATAGAAGA AATAAAAATC ACACATCCTG TTTCAACTGA 

msa75400.2{286_18RS2l} TCCTATGAGA ATATAGAAGA AATAAAAATC ACACATCCTG TTTCAACTGA 

msa75400.2{286_2603} TCCTATGAGA ATATAGAAGA AATAAAAATC ACACATCCTG TTTCAACTGA 

msa75400 .2{286__A909} TCCTATGAGA ATATAGAAGA AATAAAAATC ACACATCCTG TTTCAACTGA 

msa75400 .2{286_H36B} TCCTATGAGA ATATAGAAGA AATAAAAATC ACACATCCTG TTTCAACTGA 

msa75400.2{286_JM9130013V TCCTATGAGA ATATAGAAGA AATAAAAATC ACACATCCTG TTTCAACTGA 

msa75400.2{286_M781} TCCTATGAGA ATATAGAAGA AATAAAAATC ACACATCCTG TTTCAACTGA 

msa75400.2{286_1169NT} TCCTATGAGA ATATAGAAGA AATAAAAATC ACACATCCTG TTTCAACTGA 

Consensus ********** ********** ********** ********** ********** 

201 250 

msa75400 .2{286_090} AATTCCTGGA GATTGG CATT GTACTGTAAA GATTTCATTT AATGATAAAA 

msa75400 .2{286_CJB110) AATTCCTGGA GATTGG CATT GTACTGTAAA GATTTCATTT AATGATAAAA 

msa75400 .2{286_18RS2l} AATTCCTGGA GATTGG CATT GTACTGTAAA GA TTT CATTT AATGATAAAA 

msa75400.2{286_2603) AATTCCTGGA GATTGG CATT GTACTGTAAA GATTTCATTT AATGATAAAA 

msa75400 .2{286_A909} AATTCCTGGA GATTGG CATT GTACTGTAAA GATTTCATTT AATGATAAAA 

msa75400 .2{286_H36B) AATTCCTGGA GATTGG CATT GTACTGTAAA GATTTCATTT AATGATAAAA 

msa75400.2{286_JM9130013) AATTCCTGGA GATTGG CATT GTACTGTAAA GATTTCATTT AATGATAAAA 

msa75400 .2 {286_M78l} AATTCCTGGA GATTGGCATT GTACTGTAAA GATTTCATTT AATGATAAAA 

ma a7 54 00 . 2 { 286_1169NT} AATTCCTGGA GATTGGCATT GTACTGTAAA GATTTCATTT AATGATAAAA 

Consensus ********** ********** ********** ********** ********** 



msa75400. 2 { 286_090} 
msa75400.2{286_CJB110} 
msa75400 . 2{286_18RS2l} 



251 300 
AATCTATTGT TTATAATATT ACACATAATT TGGAATCGAA AAAAAATTAT 
AATCTATTGT TTATAATATT ACACATAATT TGGAATCGAA AAAAAATTAT 
AATCTATTGT TTATAATATT ACACATAATT TGGAATCGAA AAAAAATTAT 
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msa754 00. 2 { 286_2603} AATCTATTGT 

msa75400.2{286_A909} AATCTATTGT 

msa7 5400.2{286_H36B} AATCTATTGT 

msa75400.2{286_JM9130013} AATCTATTGT 

msa75400.2{286_M78l} AATCTATTGT 

msa75400.2{286_1169NT} AATCTATTGT 

Consensus ********** 



TTATAATATT 
TTATAATATT 
TTATAATATT 
TTATAATATT 
TTATAATATT 
TTATAATATT 



ACACATAATT 
ACACATAATT 
ACACATAATT 
ACACATAATT 
ACACATAATT 
ACACATAATT 



TGGAATCGAA 
TGGAATCGAA 
TGGAATCGAA 
TGGAATCGAA 
TGGAATCGAA 
TGGAATCGAA 



********** ********** ********** 



AAAAAATTAT 
AAAAAATTAT 
AAAAAATTAT 
AAAAAATTAT 
AAAAAATTAT 
AAAAAATTAT 
********** 



msa75400.2{286_090} 
msa754 00 . 2 { 286_CJB110 } 
msa75400 . 2 (286_18RS21 } 
rasa75400 .2{286_2603} 
msa75400.2{286_A909} 
msa75400.2{286_H36B} 
msa75400.2{286_JM9130013) 
nisa75400 .2{286_M78l} 
msa7 5400.2{286_1169NT} 
Consensus 



301 

AGcGGAAAtT 
AGcGGAAAtT 
AGcGGAAAaT 
AGcGGAAAaT 
AGcGGAAAaT 
AGcGGAAAaT 
AGcGGAAAaT 
AGcGGAAAaT 
AG t GGAAAaT 
**_*****_* 



TTAATGAAAA 
TTAATGAAAA 
TTAATGAAAA 
TTAATGAAAA 
TTAATGAAAA 
TTAATGAAAA 
TTAATGAAAA 
TTAATGAAAA 
TTAATGAAAA 
********** 



AAATATGAAT 
AAATATGAAT 
AAATATGAAT 
AAATATGAAT 
AAATATGAAT 
AAATATGAAT 
AAATATGAAT 
AAATATGAAT 
AAATATGAAT 
********** 



TTTTTTGATT 
TTTTTTGATT 
TTTTTTGATT 
TTTTTTGATT 
TTTTTTGATT 
TTTTTTGATT 
TTTTTTGATT 
TTTTTTGATT 
TTTTTTGATT 
********** 



350 

CAAGAATTGG 
CAAGAATTGG 
CAAGAATTGG 
CAAGAATTGG 
CAAGAATTGG 
CAAGAATTGG 
CAAGAATTGG 
CAAGAATTGG 
CAAGAATTGG 
* * * *'* ***** 



msa75400 .2{286_090} 
msa7 54 0 0 . 2 ( 2 8 6_C JB1 10 } 
msa75400.2(286_18RS2l' 
rasa75400.2{286_2603 
msa75400 .2{286_A909. 
msa75400.2{,286_H3SB} 
msa75400.2{286_JM9130013} 
msa75400.2{286_M78l} 
msa75400.2{286_1169NT} 
Consensus 



351 

TAAAACAAAA 
TAAAACAAAA 
TAAAACAAAA 
TAAAACAAAA 
TAAAACAAAA 
TAAAACAAAA 
TAAAACAAAA 
TAAAACAAAA 
TAAAACAAAA 
********** 



AAAACTATAA 
AAAACTATAA 
AAAACTATAA 
AAAACTATAA 
AAAACTATAA 
AAAACTATAA 
AAAACTATAA 
AAAACTATAA 
AAAACTATAA 



AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 



TTCAGATGGT 
TTCAGATGGT 
TTCAGATGGT 
TTCAGATGGT 
TTCAGATGGT 
TTCAGATGGT 
TTCAGATGGT 
TTCAGATGGT 
TTCAGATGGT 



400 

CAGGAGAAGA 
CAGGAGAAGA 
CAGGAGAAGA 
CAGGAGAAGA 
CAGGAGAAGA 
CAGGAGAAGA 
CAGGAGAAGA 
CAGGAGAAGA 
CAGGAGAAGA 



********** ********** ********** ********** 



msa75400 
msa75400 .2{ 
msa75400.2{ 
msa75400 . 
msa75400 . 
msa75400 . 
msa75400.2{286 
msa75400 
msa75400.2{ 



2{286_090} 
286_CJB110} 
286_18RS2l) 
2{286_2603} 
2{286_A909' 
2{286_H36B 
__JM9130013 
2{286_M78l} 
286_1169NT} 
Consensus 



401 

TACAA 

TACAA 

TACAA 

TACAA 

TACAA 

TACAA 

TACAA 

TACAA 

TACAA 

***** 



SEQ ID NO. 6510 
STRAIN 2603 frame: 1 

MKKSTQI I LIj I VALF I LVFSGG FYMKEQQRKEELKRNRE YEVSLVKALKNS YEN I EE I KI 

THPVSTEIPGDWHCTVKISFNDKKSIVYNITHNLESKKNYSGKFNEKNM 

RTIKI IFSDGQEKIQ 

SEQ ID NO. 6511 
STRAIN 090 

GGFYMKEQQRKEELKRNRE YEVSLVKALKNS YENI EE I KI THPVSTE I PGD 

WHCTVKI SFNDKB3 1 VYNITHNLESKKNYSGNFNEKNMNFro I FSDGQ 

EKIQ 

SEQ ID NO. 6512 
STRAIN A909 

GGFYMKEQQRKEELKRNRE YEVSLVKALKNS YEN I EE I KI THPVSTE I PGD WH 
CIVKISFNDKKSI VYNITHNLESKKNYSGKFNEKNMN 

IQ 

SEQ ID NO. 6513 
STRAIN H36B 

GGFYMKEQQRKEELKRNRE YEVSLVKALKNS YEN I EE I KI THPVSTE I PGD 

WHCTVK I S FND KKS I VYNI THNLE S KKNYSGKFNEKNMNFFD SRI GKTKKT I KI I FSDGQ 

EKIQ 

SEQ ID NO. 6514 
STRAIN 18RS21 

GGFYMKEQQRKEELKRNREYEVSLVKALKNSYENIEE I KI THPVSTE I PGDW 
HCTVKISFNDKKS I VYNITHNLESKKNYSGKFNEKNMNFFDSRI GKTKKT I FSDGQE 
KIQ 

SEQ ID NO. 6515 
STRAIN CJB1 10 

GGFYMKEQQRKEELKRNRE YEVSLVKALKNSYEN I EE I KI THPVSTE I PGDWHCTVK 
ISFNDKKSIVYNITHNLESKKNYSGNFNEKNMNFFDSRIGKTKKTIKIIFSDGQEKIQ 

SEQ ID NO. 6516 
STRAIN JM9 1 300 1 3 

GGFYMKEQQRKEELKRNRE YEVSLVKALKNSYEN I EE I KI THPVSTE I PGDW 
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HCIVKI SFNDKKS I VYNITHNLESKKNYSGKF^^ 
KIQ 



SEQ ID NO. 6517 
STRAIN 1169NT frame: 1 

GGFYMKEQQRKEELKRNRE YEVSLVKALKNS YEN I EE I KITHPVSTE I PGDWHCTVKI S F 
NDKKS I VYNI THNLESKKNYSGKFNEKNMNFFDSR I GKTKKT I KI I FSDGQEKI Q 

SEQ XD NO. 6518 
STRAIN M781 frame: 1 

GGFYMKEQQRKEELKRNRE YEVSLVKALKNS YEN I EE I KITHPVSTE I PGDWHCTVKI SF 
NDKKS I VYNI THNLESKKNYSGKFNEKNMNFFDSRIGKTKKTIKI IFSDGQEKIQ 

PRETTY of : /biotmp/msa75376 . 2 { * } March 10, 2003 10:01 

1 

msa75376.2{286_090} — GGFYMKEQQR KEELKRNREY 

msa75376 . 2 ( 286_1169NT} GGFYMKEQQR KEELKRNREY 

msa75376.2{286_18RS2l} - GGFYMKEQQR KEELKRNREY 

msa75376 .2{286_2603} mkkstqiill ivalfilvfs GGFYMKEQQR KEELKRNREY 

maa75376 .2 {286_A909} GGFYMKEQQR KEELKRNREY 

msa75376 . 2 {286_CJB110 } GGFYMKEQQR KEELKRNREY 

msa75376 .2 {286_H36B} GGFYMKEQQR KEELKRNREY 

msa75376 . 2 (2 86_JM9130013 } - GGFYMKEQQR KEELKRNREY 

msa75376 . 2 {286_M78l} — ~ GGFYMKEQQR KEELKRNREY 

Consensus ********** ********** ********** ********** 



50 

EVSLVKALKN 
EVSLVKALKN 
EVSLVKALKN 
EVSLVKALKN 
EVSLVKALKN 
EVSLVKALKN 
EVSLVKALKN 
EVSLVKALKN 
EVSLVKALKN 
********** 



msa75376.2{286_090} 
msa75376 . 2 1 286_1169NT} 
msa75376.2{286_18RS2l} 
msa75376.2{286_2603} 
msa75376 . 2{286_A909} 
msa75376 . 2 {286_CJB110 } 
msa75376 . 2 {286_H36B} 
msa75376 . 2 {286_JM9130013 } 
msa75376.2{286_M78l} 
Consensus 



51 

SYENIEEIKI 
SYENIEEIKI 
SYENIEEIKI 
SYENIEEIKI 
SYENIEEIKI 
SYENIEEIKI 
SYENIEEIKI 
SYENIEEIKI 
SYENIEEIKI 
********** 



THPVSTEIPG 
THPVSTEIPG 
THPVSTEIPG 
THPVSTEIPG 
THPVSTEIPG 
THPVSTEIPG 
THPVSTEIPG 
THPVSTEIPG 
THPVSTEIPG 
********** 



DWHCTVKISF 
DWHCTVKISF 
DWHCTVKISF 
DWHCTVKISF 
DWHCTVKISF 
DWHCTVKISF 
DWHCTVKISF 
DWHCTVKISF 
DWHCTVKISF 
********** 



NDKKS I VYNI 
NDKKS I VYNI 
NDKKS I VYNI 
NDKKS I VYNI 
NDKKS I VYNI 
NDKKS I VYNI 
NDKKS I VYNI 
NDKKS I VYNI 
NDKKS I VYNI 
********** 



100 

THNLESKKNY 
THNLESKKNY 
THNLESKKNY 
THNLESKKNY 
THNLESKKNY 
THNLESKKNY 
THNLESKKNY 
THNLESKKNY 
THNLESKKNY 
********** 



msa75376.2{286__090} 
msa75376 . 2 { 286_1169NT} 
msa75376.2{286_18RS2l} 
msa75376 . 2 {286_2603 } 
msa75376 . 2 {286_A909} 
msa75376 . 2 {286_ > CJB110 } 
msa75376 . 2 {286_H36B} 
msa75376.2{286_JM9130013} 
msa75376.2{286_M78l} 
Consensus 



101 

SGnFNEKNMN 
SGkFNEKNMN 
SGkFNEKNMN 
SGkFNEKNMN 
SGkFNEKNMN 
SGnFNEKNMN 
SGkFNEKNMN 
SGkFNEKNMN 
SGkFNEKNMN 



FFDSRIGKTK 
FFDSRIGKTK 
FFDSRIGKTK 
FFDSRIGKTK 
FFDSRIGKTK 
FFDSRIGKTK 
FFDSRIGKTK 
FFDSRIGKTK 
FFDSRIGKTK 



**_******* ********** 



KTIKIIFSDG 
KTIKIIFSDG 
KTIKIIFSDG 
KTIKIIFSDG 
KTIKIIFSDG 
KTIKIIFSDG 
KTIKIIFSDG 
KTIKIIFSDG 
KTIKIIFSDG 
********** 



135 
QEKIQ 
QEKIQ 
QEKIQ 
QEKIQ 
QEKIQ 
QEKIQ 
QEKIQ 
QEKIQ 
QEKIQ 
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SEQ ID NO. 6601 
STRAIN 2603 

ttgacaagg cat ataaaaa.ttt ctatact aaatttacaaaatgaagg ag agggaactatg 
gaaatactg att gcaggtggtagtggttttttag g aaag cagat aataaaag cagcg ctt 
agaaaagggcataaagtggcttacttatcmgacatgaag'gtaaaggtgatatattta^g 

GAT CCTAGATTAAC CTACATTAGGG GAGATATTACAG AAG CTGATAAGATT CATTTAGAA 
GACAG AACTTTTGATATATT AATTG ACTGTATTGG AG CGATTAAG C C CAATCAACTAGAT 
GAGCTTAACGTTAAAGCAACCCAAAAAGCAGTAGCACTCTGTCACAAAAATCAAATACCA 
AAGTTAGTTTATATTTCAG CCAACAGCGG CT ATT CAG CTTACATTAAAAGTAAAAGG AAG 
GCAGAGCAGATAAT CAAAG CAAG CGGT CTGGATT ATCTTTTTGT AAGAC CAGGTTTGATG 
TATGGTGAAGAGCGACCTCTCTCGATTTTCCMGCC^GTGTATAAAGTTATTTAGTCAT 
TTGCCTTTCrrTAGGTATTGTTGTACAAAAGGTCTTTCCAACTAA 

GAAG CAAT CGTTACTAC GCTTAGG AAAAAAC CAAC CCAAAAAATC CTTT CTATTGAAGAA 
TTAAATAATAAA 

SEQ ID NO. 6602 
STRAIN 090 

ACAAGGCATATAAAAATTTCTATACTAAATTTACAAAAT 
GAAGGAGAGGGAACTATGGAAATACTGATTGCAGGTGGTAGTGGTTTTTT 
AGGAAAGCAGATAATAAAAGCAGCGCTTACAAAAGGGCATAAAGTGGCTT 
ACTTATCAAGACATGAAGGTAAAGGTGATATATTTAAGGATCCTAGATTA 
ACCTACATTAGGGGAGATATTACAGAAGCTGATAAGATTCATTTAGAAGA 
CAGAACTTTTGATAT ATTAATTGACTGTATTGGAG CGATTAAGCC CAAT C 
AACTAGATGAGCTT AACGTTAAAG CAACC CAAAAAGCAGTAG CACTCTGT 
CACAAAAAT CAAATACCAAAGTTAGTTTATATTT CAG C CAACAG CGG CT A 
TTCAGCTTACATTAAAAGT AAAAGG AAGG CAGAG CAGATAAT CAAAG CAA 
GCGGTCTGGATTATCTTTTTGTAAGACCAGGTTTGATGTATGGTGAAGAG 
CGACCTCTCT CX5 ATTTT CCAAG CCAAGTGTATAAAGTTATTTAGTCATTT 
G CCITTCTTAGGTATTGTTGTACAAAAGGT CTTT C CAACTAAGGTTGTG A 
TAGTGGCAGAAGCAATCGTTACTACGCTTAGGAAAAAACCAACCCAAAAA 
AT CCTTT CT ATTGAAGAATTAAATAAT AAA 

SEQ ID NO. 6603 
STRAIN A909 

ACAAGGCATATAAAAATTTCTATACTAAATTTACAAAATG 
AAGGAGAGGGAACTATGGAAATACTGATTG CAGGTGGTAGTGGTTTTTTA 
GGAAAGCAGATAATAAAAGCAGCGCTTACAAAAGGGCATAAAGTGGCTTA 
CTTATCLAAGACATGAAGGTAAAGGTGATATATTTAAGGATCCTAGATTAA 
CCTACATTAGGGGAGATATTACAGAAGCTGATAAGATTCATTTAGAAGAC 
AGAACTTTTGAT ATATTAATTGACTGTATTGGAG CGATTAAGC CCAATCA 
ACTAGATGAG CTTAACGTT AAAGCAAC CCAAAAAGCAGTAG CACT CTGT C 
ACAAAAATCAAATACCAAAGTT AGTTT ATATTTC^ 

TCAG CTTACATTAAAAGTAAAAGGAAGG CAGAG CAGATAAT CAAAGCAAG 

CGGTCTCGATTATCTTTTTGTAAGACCAGGTTTGATG 

GAC CT CTCT CGATTTT C CAAGCCAAGTGTATAAAGTTATTTAGTCATTTG 

C CTTT CTTAGGTATTGTTGT ACAAAAGGT CTTTC CAACT 

AGTGGCAGAAGCAATCGTTACTACGCTTAGGAAAAAACCAACCCAAAA^ 

TC CITTCTATTGAAGAATTAAATAATAAA 

SEQ ID NO. 6604 
STRAIN H36B 

TATAAAAATTTCTATACTAAATTTACAAAATGAAGGAGAGGGAACTATGG 
AAATACTGATTGCAGGTGGTAGTGG'ITTTTTAGGAAAGCAaATAATAAAA 
GCAGCGCTTACAAAAGGGCATAAAGTGGCTTACT 

TAAAGGTGATATATTTAAGGATCCTAGATTAACCTACATTAGGGGAGATA 
TTACAGAAG CTGATAAGATT CATTT AGAAGACAG AACTTTTG AT ATATTA 
ATTGACTGTATTGGAGCGATTAAG C C CAATCAACTAGATGAG CTTAACGT 
TAAAGC!AACCCAAAAAGCAGTAGCACTCTGTCACAAAAATC^y^TACCAA 
AGTTAGTTTATATTTCAGCCAACAGCGGCTATTCAGCri^ 
AAAAGGAAGG CAGAG CAGATAATCAAAGCAAG CGGTCTGG ATTAT CTTTT 
TGTAAGACCAGGTTTGATGTATGGT GAAGAGCGAC CT CT CTCGATTTTCC 
AAG C CAAGTGTATAAAGTTATTTAGT CATTTG CCTTT CITAGGTATTGTT 
GTACAAAAGGTCTTTCCAACTAAGGTTGTGATAGTGGCAGAAGCAATCGT 
TACTACGCTTAG GAAAAAAC CAAC C CAAAAAATC CTTTCTATTG AAGAAT 
TAAATAATAAA 

SEQ ID NO. 6605 
STRAIN 18RS21 

ACAAGGCATAT AAAAATTT CTATACTAAATTTACAAAAT 

GAAGGAGAGGGAACTATGGAAATACTGATTGCAGGTGGTAGTGGTTTTTT 

AGGAAAGC^GATAATAAAAGCAGCGCTTACAAAAGGGCATAAAGTGGCTT 

ACTTATCAAGACATGAAGGTAAAGGTGATATATTTAAGGATCCTAGATTA 

AC CTACATTAGGGGAGATATTAC^GAAG CTGATAAGATTCATTTAGAAG A 

CAGAACTTTTGATAT ATTAATTGACTGTATTGGAG CGATTAAG CC CAAT C 

AACTAGATGAGCTT AACGTTAAAGCAACCCAAAAAGCAGTAGCACTCTGT 

CACAAAAAT CAAATACCAAAGTTAGTTTATATTT CAG CCAACAG CGG CTA 

TTCAGCTTACATTAAAAGTAAAAGGAAGG CAGAGCAGATAATCAAAGCAA 

G CGGT CTGGATTATCTTTTTGT AAGACCAGGTTTG ATGTATGGTGAAGAG 

CGACCTCTCTCGATTTTCCAAGCCAAGTGTATAAAGTTATTTAGTCA 

G C CTTT'C"l"rAGGTATTGTTGTACAAA^GGTCTTT C CAACT AAGGTTGTGA 

TAGTGGCAGAAGCAATCGTTACTACGCTTAGGAAAAAACCAACCCAAAAA 

ATCCTTTCTATTGAAGAATTAAATaATAAA 



960 



WO 2004/018646 
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Table 66: Comparative Sequences relating to SAG 0754 



SEQ XD NO. 6606 
STRAIN M732 

CAAAATGAAGGAGAgGGAACTATGgAftATACTGATTGCAGGTGGTAGTGG 
TTTTCTAGGGAAGCAGATAATAAAAGCAG CGCTTACAAAAGGGCATAAGG 
TGGCTTACTTATCAAGGCATGAAGGTAAAGGTGATATATTTAAGGATCcT 
AGATTAAC CTTACATTAAGGGAG AT ATTACAGAAG CTGATAAG ATT CATTT 
AGaACATAGAAATTTTGATATATTAATTGACTGTATTGGAGCGATTAAGC 
CCAAT CAACTAG ATGAG CTTAACGTTAAAG CAAC C CAAAAAG CAGTAGCA 
CT CTGT CACAAAAAT CAAATAC CAAAGTT AGT TTACATTT CAGC CAATAG 
CGGCTATTCAGCTTACATTAAAAGTAAAAGGAAGGCAGAGCAGATAATCA 
AAG CAAG CGGTCTGGATTAT CTTTTTGTAAGAC CAGGTTTGATGTATGGT 

TCATTTGCCTTTCTTAGGTATTGTTGTACAAAAAGTCTTTCCAACTAAGG 
TTGTGATAGTGGCAGAAGCAATCGTTACTTCGCTTAGGAAAAAACCAACT 
CAAAAAATC CTTTCT ATTGAAGAATTAAAT AATAAA 

SEQ ID NO. 6607 
STRAIN COH1 

ACAAGG CATATAAAAATTT CTATACTAAATTTAC 

AAAATGAAGGAGAGGGAACTATGGAAATACTGATTGCAGGTGGTAGTGGT 

TTT CT AGGGAAG CAG AT AATAAAAG CAG CG CTTACAAAAGGG CAT AAGGT 

GGCTTACTTATCAAGGCATGAAGGTAAAGGTGATATATTTAAGGATCCrA 

GATTAAC CTACATTAAGGGAGAT ATTACAGAAG CTGATAAGATT CATTT A 

GAAC^TAGAAATTTTGATATATTAATTGACTGTATTGGAGCGATT 

CAATCAACTAGATGAGCTTAACGTTAAAGCAACCCAAAAAGCAGTAGCAC 

TCTGTCACAAAAATCAAATACGAAAGTTAGTTTACATTTCAGC 

GG CT ATT CAG CTTACATTAAAAGTAAAAGGAAGG CAGAG CAGATAATCAA 

AGCAAGCGGT CTGGATTAT CTTTTTGTAAGAC CAGGTTTGATGT ATG GTG 

AAGAGCGAC CTCT CTCGATTTT CCAAG C CAAGTGTATAAAATTATTT AGT 

CATTTGCCTTTCTTAGGTATTGTTGTACAAA 

TGTG ATAGTGGCAGAAGCAATCGTTAC^T , CGCTTAGG AAAAAAC CAACT C 
AAAAAAT CCTTTCTATTGAAGAATTAAATAATAAA 

SEQ ID NO. 6608 
STRAIN M781 

ACAAGGCATATAAAAATTTcTATACTAAATTTaCA 

AAATGAAGGAGAGGGAACTATGGAAATACTGATTGCAGGTGGTAGTGGTT 

TTCTAGGGAAGCAGATAATAAAAGCAGCGCTTACAAAAGGGCATAAGGTG 

GCTTACTTATCAAGG CATGAAGGTAAAGGTGATATATTTAAGGAT CCTAG 

ATTAACCTACATTAAGGGAGATATTACAGAAGCTGATAAGATTCATTTAG 

AACATAGAAATTTTGATATATTAATTGACTGTATTGGAGCGATTAAGCCC 

AATCAACTAGATGAGCTTAACGTTAAAGCAA^ 

CTGTCACAAAAATCAAATACCAAAGTTAGTTTAC^ 

GCTATTCAG CTTACA.TTAAAAGTAAAAGGAAGGCAGAGCAGATAATCAAA 
GCAAGCGGTCTGGATTATCTTTTTGTAAGACCAGGTTTGATGTATGGTGA 
AGAGCGACCTCTCTCGATTTTCCAAGCCAAGTGTATAAAATTATTTAGTC 
ATTTGCCTTTCITAGGTATT7GTTGTACAAA 

GTGATAGTGGCAGAAGC!AATCGTTACTTCGCTTAGGAAAAAACa^CTCA 
AAAAATCCTTT CTA t TGAAGAATTAAATAATAAA 

SEQ ID NO. 6609 
STRAIN 1169NT 

AG^GG CATATAAAAATTT CTATACTAAATTTACAAA 

ATGAAGGAGAGGGAACTATGGAAATACTGATTGGAGGTGGTAGTGGTTTT 

TTAGGAAAG CAGATAATAAAAG CAG CG CTTACAAAAGGG CAT AAGTTGGC 

TTACTTATCAAGACATGAAGGTAAAGGTGATATATTTAAGGATCCTAGAT 

TAAC CTACATTAAGGGAGATATTACAGAAG CTG ATAAGATTCATTTAGAA 

GACAG^CTTTTGATATATTAATTGACTGTATTGGAGCGATTAAGCCCT^ 

TC^CTAGATGAGCTTAACGTTAAAGC^CCaU^AAAGCAGTAGCACrCT 

GTCACAAAAATCAAATACCAAAGTTAGTTTACATTTCAGCCAAC^GCGGC 

TATTCAG CTT ACATTAGAAGTAAAAGG AA.GG CAG AGCAGATAATCAAAGC 

AAGCGGTCTGGATTATCTTTTTGTAAGACCAGGTTTGATGTATGGTGAAG 

AG CGACCTCT CT CG ATTTT C CAAGCCAAGTGTATAAAATTATTTAGT CAT 

TTGCCTTTCTTAGGTATTGTTGTACAAAAGGTCTTTCCAACTAAGGTTG 

GATAGTGGCAGAAGCAATCGTTACTACGCTTAGGACAAAACCAACTCAAA 

AAAT C CTTT CTATTG AAGAATTAAATAATAAA 

SEQ ID NO. 6610 
STRAIN CJB110 

ACAAGG CATATAAAAATTT CTATACT AAATTTACAAA 

ATaAAGGAGAGGGAACTATGGAAATACTGATTGC^GGTGGTAGTGGTTTT 

TTAGGAAAG CAGATAATAAAAG CAG CG CTTACAAAAGGG CAT AAAGTGGC 

TTACTTATCAAGACATGAAGGTAAAGGTGATATATTTAAGGATCCTAGAT 

TAACCTACATTAG<3GGAGATATTACAGAAGCTGATAAGATTCATTTAGAA 

GACAGAACTTTTGATATATTAATTGACTGTATTGGAGCGATTAAGCCCAA 

TCAACTAGATGAGCTTAACGTTAAAGCAAC CCAAAAAGCAGTAG CACTCT 

GTCACAAAAATCAAATACCAAAGTTAGTTTATATT^ 

TATTCAGCTTACATTAAAAGTAAAAGGAAGGCAGAGCAGATAATCAAAGC 

AAGCGGTCTGGATTATCTT ; lT u rGTAAGACCAGGTTTGATGTATGGTGAAG 

AG CGACCTCT CT CGATTTT C CAAG C CAAGTGTATAAAGTTATTT AGT CAT 

TTGCCTTTCTTAGGTATTGiTTGTACAAAA 

GATAGTGGCAGAAGCAATCGTTACTACGCTTAGGAAAAAACCAACCCAAA 
AAATCCTTT CTATTGAAGAATTAAATAATAAA 



961 
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PCT/US2003/026827 



Table 66: Comparative Sequences relating to SAG 0754 



SEQ XD NO. 6611 
STRAIN JM9130013 

ACAAGGCATATAAAAATTTCTATACTAAATTTACAAAATG 
AAGGAGAGGGAACTATGGAAATACTGATTGCAGGTGGTAGTGGTTTTTTA 
GGAAAGCAGATAAT AAAAG CAG CG CTTACAAAAGGG CAT AAAGTGG CTTA 
CTTAT CAAG ACATG AAGGTAAAGGTGATAT ATTTAAGGAT CCTAG ATTAA 
CcTACATTAGGGGAGATATTACAGAAGCTGATAAGATTCATTTAGAAGAC 
AGAACTTTTGATAT ATTAATTGACTGTATTGGAGCGATTAAG C CCAATCA 
ACTAGATGAG CTTAACGTT AA^G CAAC CCAAAAAG CAGT AG CACT CTGTC 
ACAAA^ATCAAATAC CAAAGTTAGTTT ATATTTCAG C CAACAGCGG CTAT 
TGAG CTTACATTAAAAGTAAAAGGAAGG CAGAG CAGATAATCAAAGCAAG 
CGGTCTGGATTATCTTTTTGTAAGACCAGGTTTGATGTATGGTGAAGAGC 
GACCTCTCTCGATTTTCCAAGCCAAGTGTATAAAGTTATTTAGTCATTTG 
CCtTTCTTAgGTATTGTTGTACAAAAGGTCTTTCCAACTAAGGTTGTGAT 
AGTGG CAGAAG CAAT CGTTACT ACG CTTAGGAAAAAACCAAC C CAAAAAA 
TC CTTTCTATTGAAGAATTAAATAATAAA 

PRETTY of; /biotmp/msal37119 . 2 { * } April 10, 2003 03:30 



msal3 7119 .2(3 03_COH1 } 
msal37119.2{303_M732} 
msal37119.2{303_m78l} 
msal37119 .2{303_090} 
msal37119.2{303_18RS2l} 
msa!37119 . 2 { 303_2603 } 
msal37119.2{303_A909j 
rasal37119.2{303_CJB110} 
msal37119.2{303_H36B} 
msal37119.2{303_JM9130013} 
msal37119 . 2 {303_1169NT} 
Consensus 



acaaggc atataaaaat ttctatacta 



acaaggc 

acaaggc 

acaaggc 

ttgacaaggc 

acaaggc 

acaaggc 

acaaggc 

acaaggc 



atataaaaat 
atataaaaat 
atataaaaat 
atataaaaat 
atataaaaat 
atataaaaat 
-tataaaaat 
atataaaaat 
atataaaaat 



ttctatacta 
ttctatacta 
ttctatacta 
ttctatacta 
ttctatacta 
ttctatacta 
ttctatacta 
ttctatacta 
ttctatacta 



aatttaCAAA 

CAAA 

aatttaCAAA 
aatttaCAAA 
aatttaCAAA 
aatttaCAAA 
aatttaCAAA 
aatttaCAAA 
aatttaCAAA 
aatttaCAAA 
aatttaCAAA 
***★ 



50 

ATGAAGGAGA 
ATGAAGGAGA 
ATGAAGGAGA 
ATGAAGGAGA 
ATGAAGGAGA 
ATGAAGGAGA 
ATGAAGGAGA 
ATGAAGGAGA 
ATGAAGGAGA 
ATGAAGGAGA 
ATGAAGGAGA 
********** 



51 100 

msal37119.2{303_COHl} GGGAACTATG GAAATACTGA TTGCAGGTGG TAGTGGTTTT cTAGGgAAGC 

msal37119.2{303_M732} GGGAACTATG GAAATACTGA TTGCAGGTGG TAGTGGTTTT cTAGGgAAGC 

msal37119.2{303_m78l} GGGAACTATG GAAATACTGA TTGCAGGTGG TAGTGGTTTT cTAGGgAAGC 

msa!37119 .2{303_090} GGGAACTATG GAAATACTGA TTGCAGGTGG TAGTGGTTTT tTAGGaAAGC 

msal37119.2{303_18RS2l} GGGAACTATG GAAATACTGA TTGCAGGTGG TAGTGGTTTT tTAGGaAAGC 

msal37119 .2{303_2603> GGGAACTATG GAAATACTGA TTGCAGGTGG TAGTGGTTTT tTAGGaAAGC 

msal37119.2{303_A909} GGGAACTATG GAAATACTGA TTGCAGGTGG TAGTGGTTTT tTAGGaAAGC 

msal37119.2{303_CJB110} GGGAACTATG GAAATACTGA TTGCAGGTGG TAGTGGTTTT tTAGGaAAGC 

msal37119.2{303_H36B} GGGAACTATG GAAATACTGA TTGCAGGTGG TAGTGGTTTT tTAGGaAAGC 

msal37119.2{303_JM9130013} GGGAACTATG GAAATACTGA TTGCAGGTGG TAGTGGTTTT tTAGGaAAGC 

msal37119.2{303_1169NT} GGGAACTATG GAAATACTGA TTGCAGGTGG TAGTGGTTTT tTAGGaAAGC 

Consensus ********** ********** ********** ********** _****_**** 



msal37119 
msal37119 
msal37119 
msal37119 

msal37119.2{ 
msal37119 
msal37119 

msal37119.2{ 
msal37119. 
msal37119.2{303 

msa!37119.2{ 



2{303_COHl} 
2{303_M732} 
2{303_m78l} 
2{303_090} 
303_18RS21} 
2 {303^2603} 
2{303_A909> 
303_C*JB110) 
2{303__H36B} 
_JM9130013} 
303__1169NT} 
Consensus 



101 

AGATAATAAA 
AGATAATAAA 
AGATAATAAA 
AGATAATAAA 
AGATAATAAA 
AGATAATAAA 
AGATAATAAA 
AGATAATAAA 
AGATAATAAA 
AGATAATAAA 
AGATAATAAA 
********** 



AGCAGCGCTT 
AGCAGCGCTT 
AGCAGCGCTT 
AGCAGCGCTT 
AGCAGCGCTT 
AGCAGCGCTT 
AGCAGCGCTT 
AGCAGCGCTT 
AGCAGCGCTT 
AGCAGCGCTT 
AGCAGCGCTT 
********** 



ACAAAAGGGC 
ACAAAAGGGC 
ACAAAAGGGC 
ACAAAAGGGC 
ACAAAAGGGC 
ACAAAAGGGC 
ACAAAAGGGC 
ACAAAAGGGC 
ACAAAAGGGC 
ACAAAAGGGC 
ACAAAAGGGC 
********** 



ATAAggTGGC 
ATAAggTGGC 
ATAAggTGGC 
ATAAagTGGC 
ATAAagTGGC 
ATAAagTGGC 
ATAAagTGGC 
ATAAagTGGC 
ATAAagTGGC 
ATAAagTGGC 
ATAAgtTGGC 
****_ _***★ 



150 

TTACTTATCA 
TTACTTATCA 
TTACTTATCA 
TTACTTATCA 
TTACTTATCA 
TTACTTATCA 
TTACTTATCA 
TTACTTATCA 
TTACTTATCA 
TTACTTATCA 
TTACTTATCA 
********** 



msal37119 . 2 { 303_C0H1 J 
msal37119 . 2 {303_M732 ] 
msal37119 . 2 { 303_m781 ] 
msal37119.2{303_090] 
msal37119.2{303_18RS2l] 
msal37119 . 2 { 303_2603 ] 
msal37119.2{303_A909; 
msal37119.2{303_CJB110l 
msal37119 . 2 { 303_H36B 
msal37119 . 2 {303_JM9130013 ) 
msal37119.2{303_1169NT] 
Consensus 



151 

AGg CATGAAG 
AGgCATGAAG 
AGg CATGAAG 
AGaCATGAAG 
AGaCATGAAG 
AGaCATGAAG 
AGaCATGAAG 
AGaCATGAAG 
AGaCATGAAG 
AGaCATGAAG 
AGaCATGAAG 



GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 



**_******* ********** 



TATATTTAAG 
TATATTTAAG 
TATATTTAAG 
TATATTTAAG 
TATATTTAAG 
TATATTTAAG 
TATATTTAAG 
TATATTTAAG 
TATATTTAAG 
TATATTTAAG 
TATATTTAAG 
********** 



GATCCTAGAT 
GATCCTAGAT 
GATCCTAGAT 
GATCCTAGAT 
GATCCTAGAT 
GATCCTAGAT 
GATCCTAGAT 
GATCCTAGAT 
GATCCTAGAT 
GATCCTAGAT 
GATCCTAGAT 



200 

TAA CCTACAT 
TAACCTACAT 
TAACCTACAT 
TAACCTACAT 
TAACCTACAT 
TAACCTACAT 
TAACCTACAT 
TAACCTACAT 
TAACCTACAT 
TAACCTACAT 
TAACCTACAT 



********** ********** 



msal3 71 1 9 . 2 { 3 03_COH1 } 
msal37119 . 2(303_M732} 
msal37119.2{303_m781} 
msal37119.2{303_090} 
msal37119.2{303_18RS2l} 
msal37119 . 2 {303_2603 } 



201 

TAaGGGAGAT 
TAaGGGAGAT 
TAaGGGAGAT 
TAgGGGAGAT 
TAgGGGAGAT 
TAgGGGAGAT 



ATTACAGAAG 
ATTACAGAAG 
ATTACAGAAG 
ATTACAGAAG 
ATTACAGAAG 
ATTACAGAAG 



CTGATAAGAT 
CTGATAAGAT 
CTGATAAGAT 
CTGATAAGAT 
CTGATAAGAT 
CTGATAAGAT 



TCATTTAGAA 
TCATTTAGAA 
TCATTTAGAA 
TCATTTAGAA 
TCATTTAGAA 
TCATTTAGAA 



250 

cAtAGAAaTT 
cAtAGAAaTT 
CAtAGAAaTT 
gAcAGAAcTT 
gAcAGAAcTT 
gAcAGAAcTT 
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Table 66: Comparative Sequences relating to SAG 0754 



msal37119.2{303_A909} 
msal37119 . 2 { 303_CJB110 } 
msal37119 .2{303_H36B} 
msal37119 . 2 { 303_JM9130013 } 
msal37119 . 2 {303_1169NT} 
Consensus 



TAgGGGAGAT 
TAgGGGAGAT 
TAgGGGAGAT 
TAgGGGAGAT 
TAaGGGAGAT 



ATTACAGAAG 
ATTACAGAAG 
ATTACAGAAG 
ATTACAGAAG 
ATTACAGAAG 



**_******* ********** 



CTGATAAGAT 
CTGATAAGAT 
CTGATAAGAT 
CTGATAAGAT 
CTGATAAGAT 
********** 



TCATTTAGAA 
TCATTTAGAA 
TCATTTAGAA 
TCATTTAGAA 
TCATTTAGAA 



gAcAGAAcTT 
gAcAGAAcTT 
gAcAGAAcTT 
gAcAGAAcTT 
gAcAGAAcTT 



********** _★_****_** 



msal37119.2{303_COHl) 
msal37119 . 2 {303_M732 * 
msal37119 . 2 {303_m781 : 
msal37119 . 2 {303_090 [ 
msal37119.2{303_18RS2lj 
msal37119.2{303_2603 i 
msal37119.2{303_A909; 
msal37119.2{303_CJB110l 
msal37119.2{303_H36B} 
msal37119 . 2 {303_JM9130013 } 
msal37119 . 2 {303_1169NT} 
Consensus 



251 

TTGATATATT 
TTGATATATT 
TTGATATATT 
TTGATATATT 
TTGATATATT 
TTGATATATT 
TTGATATATT 
TTGATATATT 
TTGATATATT 
TTGATATATT 
TTGATATATT 



AATTGACTGT 
AATTGACTGT 
AATTGACTGT 
AATTGACTGT 
AATTGACTGT 
AATTGACTGT 
AATTGACTGT 
AATTGACTGT 
AATTGACTGT 
AATTGACTGT 
AATTGACTGT 



ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 



TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 



********** ********** ********** ********** 



300 

TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
********** 



301 350 

msal37119.2{303_COHl} GAGCTTAACG TTAAAGCAAC CCAAAAAGCA GTAGCACTCT GTCACAAAAA 

msal37119.2{303_M732} GAGCTTAACG TTAAAGCAAC CCAAAAAGCA GTAGCACTCT GTCACAAAAA 

msal37119 .2{303_m78l} GAGCTTAACG TTAAAGCAAC CCAAAAAGCA GTAGCACTCT GTCACAAAAA 

msal37119.2{303_090} GAGCTTAACG TTAAAGCAAC CCAAAAAGCA GTAGCACTCT GTCACAAAAA 

msal37119 . 2 { 303_18RS21 } GAGCTTAACG TTAAAGCAAC CCAAAAAGCA GTAGCACTCT GTCACAAAAA 

msal37119.2{303_2603) GAGCTTAACG TTAAAGCAAC CCAAAAAGCA GTAGCACTCT GTCACAAAAA 

msal37119.2{303_A909} GAGCTTAACG TTAAAGCAAC CCAAAAAGCA GTAGCACTCT GTCACAAAAA 

msal37119.2{303_CJB110} GAGCTTAACG TTAAAGCAAC CCAAAAAGCA GTAGCACTCT GTCACAAAAA 

msal37119.2{303_H36B} GAGCTTAACG TTAAAGCAAC CCAAAAAGCA GTAGCACTCT GTCACAAAAA 

msal37119.2{303_JM9130013) GAGCTTAACG TTAAAGCAAC CCAAAAAGCA GTAGCACTCT GTCACAAAAA 

msal37119 .2{303_1169NT} GAGCTTAACG TTAAAGCAAC CCAAAAAGCA GTAGCACTCT GTCACAAAAA 

Consensus ********** ********** ********** ********** ********** 



351 

msal3 7119.2{303_COHl} TCAAATACCA AAGTTAGTTT 

msal37119 - 2{303_M732 } TCAAATACCA AAGTTAGTTT 

msal37119.2{3 03_m781> TCAAATACCA AAGTTAGTTT 

msal37119.2{303_090} TCAAATACCA AAGTTAGTTT 

msal37119 .2{303_18RS2l} TCAAATACCA AAGTTAGTTT 

msal37119 .2{303_2603} TCAAATACCA AAGTTAGTTT 

msal37119.2{303_A909 } TCAAATACCA AAGTTAGTTT 

msal37119.2{303_CJB110} TCAAATACCA AAGTTAGTTT 

msal37119.2{303_H36B} TCAAATACCA AAGTTAGTTT 

msa!37119 .2{303_JM9130013} TCAAATACCA AAGTTAGTTT 

msal37119.2{303_1169NT} TCAAATACCA AAGTTAGTTT 

Consensus ********** ********** 



AcATTTCAGC 
AcATTTCAGC 
AcATTTCAGC 
AtATTTCAGC 
AtATTTCAGC 
AtATTTCAGC 
AtATTTCAGC 
AtATTTCAGC 
AtATTTCAGC 
AtATTTCAGC 
AcATTTCAGC 



CAAtAGCGGC 
CAAtAGCGGC 
CAAtAGCGGC 
CAAcAGCGGC 
CAAcAGCGGC 
CAAcAGCGGC 
CAAcAGCGGC 
CAAcAGCGGC 
CAACAGCGGC 
CAAcAGCGGC 
CAAcAGCGGC 



*„******** ***_****** 



msal37119. 

msa!37119. 

msa!37119. 
msal37119 
msal37119.2{ 

msal37119. 

msal37119. 
msal37119.2{ 

tnsal37119 
msal37119.2{303 
msal37119.2{ 



2f303_COHl} 
2{303_M732) 
2{303_m78l} 
2{303_090} 
303_18RS2ll 
2{303_2603> 
2{303_A909} 
303_CJB110} 
2{303_H36B) 
JM9130013) 
303_1169NT} 
Consensus 



401 

ACATTAaAAG 
ACATTAaAAG 
ACATTAaAAG 
ACATTAaAAG 
ACATTAaAAG 
ACATTAaAAG 
ACATTAaAAG 
ACATTAaAAG 
ACATTAaAAG 
ACATTAaAAG 
ACATTAgAAG 



TAAAAGGAAG 
TAAAAGGAAG 
TAAAAGGAAG 
TAAAAGGAAG 
TAAAAGGAAG 
TAAAAGGAAG 
TAAAAGGAAG 
TAAAAGGAAG 
TAAAAGGAAG 
TAAAAGGAAG 
TAAAAGGAAG 



******_*** ********** 



msal37119.2{303_COHl} 
msal37119.2(303_M732j 
msal37119 . 2 { 303_m78 1 , 
msal37119.2{303__090; 
msal37119.2{303_18RS2i; 
msal37119 . 2{303_2603 ' 
msal37119.2{303_A909l 
msal37119 . 2 { 303_CJB110 } 
ltisal37119 . 2 { 303_H36B} 
msal37119 . 2 {303_JM9130013 J 
msal37119 . 2{303_1169NT} 
Consensus 



msal37119.2{303_COHl} 
msal37119.2{303__M732} 
tnsal37119.2{303_m78l} 
msal37119.2{303_090) 
msal37119.2{303__18RS2l} 



451 

GATTATCTTT 

GATTATCTTT 

GATTATCTTT ■ 

GATTATCTTT 

GATTATCTTT 

GATTATCTTT 

GATTATCTTT 

GATTATCTTT 

GATTATCTTT 

GATTATCTTT 

GATTATCTTT 

********** 

501 

CTCGATTTTC 
CTCGATTTTC 
CTCGATTTTC 
CTCGATTTTC 
CTCGATTTTC 



TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
********** 



CAAGCCAAGT 
CAAGCCAAGT 
CAAGCCAAGT 
CAAGCCAAGT 
CAAGCCAAGT 



400 

TATTCAGCTT 
TATTCAGCTT 
TATTCAGCTT 
TATTCAGCTT 
TATTCAGCTT 
TATTCAGCTT 
TATTCAGCTT 
TATTCAGCTT 
TATTCAGCTT 
TATTCAGCTT 
TATTCAGCTT 
********** 

450 

AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
********** 

500 

AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
********** ********** ********** 



GCAGAGCAGA 
GCAGAGCAGA 
GCAGAGCAGA 
GCAGAGCAGA 
GCAGAGCAGA 
GCAGAGCAGA 
GCAGAGCAGA 
GCAGAGCAGA 
GCAGAGCAGA 
GCAGAGCAGA 
GCAGAGCAGA 
********** 



AGGTTTGATG 
AGGTTTGATG 
AGGTTTGATG 
AGGTTTGATG 
AGGTTTGATG 
AGGTTTGATG 
AGGTTTGATG 
AGGTTTGATG 
AGGTTTGATG 
AGGTTTGATG 
AGGTTTGATG 



TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
********** 



TATGGTGAAG 
TATGGTGAAG 
TATGGTGAAG 
TATGGTGAAG 
TATGGTGAAG 
TATGGTGAAG 
TATGGTGAAG 
TATGGTGAAG 
TATGGTGAAG 
TATGGTGAAG 
TATGGTGAAG 



GTATAAAaTT 
GTATAAAaTT 
GTATAAAaTT 
GTATAAAgTT 
GTATAAAgTT 



ATTTAGTCAT 
ATTTAGTCAT 
ATTTAGTCAT 
ATTTAGTCAT 
ATTTAGTCAT 



550 

TTGCCTTTCT 
TTGCCTTTCT 
TTGCCTTTCT 
TTGCCTTTCT 
TTGCCTTTCT 



963 



WO 2004/018646 
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Table 66: Comparative Sequences relating to SAG 0754 



msal37119 .2{303_2603} 
msal37119 .2{303_A9Q9} 
msa!37119 . 2 { 303_CJB110 } 
msal37119.2{303_H36B} 
msal37119 . 2 {303_JM9130013 } 
msal37119 . 2 {303_1169NT} 
Consensus 



CTCGATTTTC 
CTCGATTTTC 
CTCGATTTTC 
CTCGATTTTC 
CTCGATTTTC 
CTCGATTTTC 



CAAGCCAAGT 
CAAGCCAAGT 
CAAGCCAAGT 
CAAGCCAAGT 
CAAGCCAAGT 
CAAGCCAAGT 



GTATAAAgTT 
GTATAAAgTT 
GTATAAAgTT 
GTATAAAgTT 
GTATAAAgTT 
GTATAAAaTT 



ATTTAGTCAT 
ATTTAGTCAT 
ATTTAGTCAT 
ATTTAGTCAT 
ATTTAGTCAT 
ATTTAGTCAT 



TTGCCTTTCT 
TTGCCTTTCT 
TTGCCTTTCT 
TTGCCTTTCT 
TTGCCTTTCT 
TTGCCTTTCT 



********** ********** *******_** ********** ********** 



551 

rasal37119 .2{303_COHl} TAGGTATTGT 

msal37119 .2{303__M732} TAGGTATTGT 

msal37119.2{303__m78l) TAGGTATTGT 

msal37119.2{303_090} TAGGTATTGT 

msal37119.2{303__18RS2l} TAGGTATTGT 

msal37119. 2 {303J2603} TAGGTATTGT 

msal37119.2{303_A909} TAGGTATTGT 

msal37119.2{303_CJB110} TAGGTATTGT 

msal37119.2{303_H36B} TAGGTATTGT 

msal3 7 1 19 . 2 { 3 03_JM9 13 0 013} TAGGTATTGT 

msal37119.2{303_1169NT} TAGGTATTGT 

Consensus ********** 



TGTACAAAAa 
TGTACAAAAa 
TGTACAAAAa 
TGTACAAAAg 
TGTACAAAAg 
TGTACAAAAg 
TGTACAAAAg 
TGTACAAAAg 
TGTACAAAAg 
TGTACAAAAg 
TGTACAAAAg 



GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 



*********_ ********** 



CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
********** 



600 

GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
********** 



601 650 

msal37119.2(303_COHl} GAAGCAATCG TTACTt CGCT TAGGAaAAAA CCAACtCAAA AAATCCTTTC 

msal37119.2{303_M732} GAAGCAATCG TTACTt CGCT TAGGAaAAAA CCAACtCAAA AAATCCTTTC 

msal37119 . 2{3Q3__m78l} GAAGCAATCG TTACTt CGCT TAGGAaAAAA CCAACtCAAA AAATCCTTTC 

msal37119 .2 {303_090 } GAAGCAATCG TTACTaCGCT TAGGAaAAAA CCAACcCAAA AAATCCTTTC 

msal37119.2{303_18RS2l) GAAGCAATCG TTACTaCGCT TAGGAaAAAA CCAACcCAAA AAATCCTTTC 

msal37119.2{303J2603) GAAGCAATCG TTACTaCGCT TAGGAaAAAA CCAACcCAAA AAATCCTTTC 

msal37119.2{303_A909} GAAGCAATCG TTACTaCGCT TAGGAaAAAA CCAACcCAAA AAATCCTTTC 

msal37119.2{303__CJB110} GAAGCAATCG TTACTaCGCT TAGGAaAAAA CCAACcCAAA AAATCCTTTC 

msal37119.2{303_H36B) GAAGCAATCG TTACTaCGCT TAGGAaAAAA CCAACcCAAA AAATCCTTTC 

msal37119.2{30 3_JM9 130013} GAAGCAATCG TTACTaCGCT TAGGAaAAAA CCAACcCAAA AAATCCTTTC 

msal37119.2{303__1169NT} GAAGCAATCG TTACTaCGCT TAGGAcAAAA CCAACtCAAA AAATCCTTTC 

Consensus ********** *****_**** *****„**** *****_**** ********** 



msal37119.2{303_COHl} 
msal37119.2{303_M732} 
msal37119.2{303_m781} 
msal37119.2{303_090} 
msal37119.2{303_18RS2l} 
msal37119.2(303_2603} 
rasal37119.2{303_A909) 
msal37119 . 2{303_CJB110 ) 
tnsal37119.2{303_H36B} 
msal37119.2{303_JM9130013} 
msal37119.2{303_1169NT} 
Consensus 



651 672 
TATTGAAGAA TTAAATAATA AA 
TATTGAAGAA TTAAATAATA AA 
TATTGAAGAA TTAAATAATA AA 
TATTGAAGAA TTAAATAATA AA 
TATTGAAGAA TTAAATAATA AA 
TATTGAAGAA TTAAATAATA AA 
TATTGAAGAA TTAAATAATA AA 
TATTGAAGAA TTAAATAATA AA 
TATTGAAGAA TTAAATAATA AA 
TATTGAAGAA TTAAATAATA AA 
TATTGAAGAA TTAAATAATA AA 
********** ********** ** 



SEQ ZD MO. 6612 
STRAIN 2603 frame: 1 

TRHI KI S I LNLQNEGEGTME I L I AGGSGFLGKQ 1 1 KAALTKGHKVAYLSRHEGKGDI FKD 
PRLT Y IRGD I TEADKI HLEDRTFD I L I DC I GAI KPNQLDELNVKATQKAVALCHKNQ I PK 
LVYI SANSGY SAYI KSKRKAEQ 1 1 KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHL 
PFLGIVVQKVFPTKVVIVAEAIVTTLRKKPTQKILSIEELNNK 

SEQ ID NO. 6613 

STRAIN 090 frame: 1 

TRHI KI S I LNLQNEGEGTME I L I AGGSGFLGKQ 1 1 KAALTKGHKVAYLSRHEGKGDI FKD 
PRLT Y I RGD I TEADKI HLEDRTFD I LI DCIGAI KPNQLDELNVKATQKAVALCHKNQ I PK 
LVYI SANSGY SAY I KSKRKAEQ 1 1 KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHL 
PFLG I WQKVFPTKWI VAEAI VTTLRKKPTQKI LS I EELNNK 

SEQ ID NO. 6614 
STRAIN A909 frame: 1 

TRHI KI S I LNLQNEGEGTME ILIAGGSGFLGKQI I KAALTKGHKVAYLSRHEGKGDI FKD 
PRLTYIRGD ITEADKI HLEDRTFD I LI DCIGAI KPNQLDELNVKATQKAVALCHKNQ I PK 
LVYI SANSGYSAYI KSKRKAEQ 1 1 KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHL 
PFLGI WQKVFPTKWI VAEAI VTTLRKKPTQKI LS I EELNNK 

SEQ ID NO. 6615 

STRAIN H36B frame: 2 

IKI S I LNLQNEGEGTME I L I AGGSGFLGKQ 1 1 KAALTKGHKVAYLSRHEGKGDI FKDPRL 
TY I RGD I TEADKI HLEDRTFDI L I DCI GAI KPNQLDELNVKATQKAVALCHKNQ I PKLVY 
I SANSGYSAYI KSKRKAEQ 1 1 KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHLPFL 
GI WQKVFPTKWI VAEAI VTTLRKKPTQKI LS I EELNNK 

SEQ ID NO. 6616 
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STRAIN 18RS21 frame: 1 

TEH I KIS ILNLQNEGEGTME ILIAGGSGFLGKQI I KAALTKGHKVAYLSRHEGKGDI FKD 
PRLTYIRGD I TEADKIHLEDRTFDI LI DCIGAIKPNQLDELNVKATQKAVALCHKNQI PK 
LVYI SANSGYSAYIKSKRKAEQI I KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHL 
PFLG I WQKVFPTKWI VAEAI VTTLRKKPTQKI LS I EELNNK 

SEQ ID NO. 6617 

STRAIN M732 frame: 1 

QNEGEGTME I L I AGGSG FLGKQ 1 I KAALTKGHKVAYLSRHEGKGD I FKDPRLTY I KGD I T 
EADKI HLEHRNFD I LIDCIGAI KPNQLDELNVKATQKAVALCHKNQI PKLVYI SANSGYS 
AYIKSKRKAEQI I KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHLPFLGIWQKVF 
PTKWIVAEAIVTSLRKKPTQKI LSI EELNNK 

SEQ ID NO. 6618 

STRAIN COH1 frame: 1 

TRHI KI S I LNLQNEGEGTME I L I AGGSGFLGKQ 1 1 KAALTKGHKVAYLSRHEGKGDI FKD 
PRLTYIKGD I TEADKIHLEHRNFDILI DC IGAI KPNQLDELNVKATQKAVALCHKNQI PK 
LVYI SANSGYSAYIKSKRKAEQI I KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHL 
PFLGI WQKVFPTKWI VAEAI VTSLRKKPTQKI LSI EELNNK 

SEQ ID NO. 6619 
STRAIN M781 frame: 1 

TRHIKISILNLQNEGEGTMEILIAGGSGFLGKQIIKAALTKGHKVAYLSRHEGKGDIFKD 
PRLTYIKGD ITEADKIHLEHRNFD I LI DC I GAI KPNQLDELNVKATQKAVALCHKNQI PK 
LVYI SANSGYSAYI KSKRKAEQI I KASGLDYLFVRPGLMYGEERPLS I FQAKCIKLFSHL 
PFLGI WQKVFPTKWI VAEAI VTSLRKKPTQKI LSI EELNNK 

SEQ ID NO. 6620 
STRAIN 1169NT frame: 1 

TRHI KI S ILNLQNEGEGTME I L I AGGSGFLGKQ 1 1 KAALTKGHKLAYLSRHEGKGD I FKD 
PRLTYIKGD ITEADKI HLEDRTFD I L I DC I GAI KPNQLDELNVKATQKAVALCHKNQ I PK 
LVYI SANSGYSAYI RS KRKAEQI I KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHL 
PFLGI WQKVFPTKWI VAEAI VTTLRTKPTQKI LS I EELNNK 

SEQ ID NO. 6621 

STRAIN CJB1 10 frame: 1 

TRHI KI S I LNLQNEGEGTME I L I AGGS GFLGKQ 1 1 KAALTKGHKVAYLSRHEGKGD I FKD 
PRLTYIRGD I TEADKIHLEDRTFDI LIDCIGAI KPNQLDELNVKATQKAVALCHKNQI PK 
LVYI SANSGYSAYI KSKRKAEQI I KASGLDYLFVRPGLMYGEERPLS I FQAKCIKLFSHL 
PFLGI WQKVFPTKWI VAEAI VTTLRKKPTQKI LS I EELNNK 

SEQ ID NO. 6622 

STRAIN JM91 30013 frame: 1 

TRHI KIS ILNLQNEGEGTME I L I AGGSGFLGKQ 1 1 KAALTKGHKVAYLS RHEGKGD I FKD 
PRLTYI RGD I TEADKI HLEDRTFD I L I DC I GA I KPNQLDELNVKATQKAVALCHKNQ I PK 
LVYI SANSGYSAYI KSKRKAEQI I KASGLDYLFVRPGLMYGEERPLS I FQAKCIKLFSHL 
PFLGI WQKVFPTKWI VAEAI VTTLRKKPTQKI LS I EELNNK 

PRETTY of: /biotmp/msal37299 . 2 { * } April 10 # 2003 03:37 



msal37299. 

msal37299. 

msal37299 . 
msal37299 
meal372?9.2{ 

msal37299. 

msal37299 
msal37299.2{ 
msal37299.2{303 

msal37299. 
msal37299.2{ 



2{303_COH1} 
2(303_M732) 
2{303_M781} 
2{303_090} 
303_18RS2l} 
2{303_2603) 
2{303_A909) 
303_CJB110} 
_JM9130013} 
2{303_H36B) 
303_1169NT} 
Consensus 



msal37299. 

msal37299. 

msal37299. 
msal37299 
msal37299.2{ 

msal37299. 

msa!37299. 
msal37299.2{ 
msal37299.2{303 

msal37299 . 
msa!37299.2{ 



2{303_C0H1} 
2{303_M732} 
2{303_M78l} 
2{303_090} 
303_18RS2l} 
2{303__2603} 
2{303_A909} 
303_CJB110} 
_JM9130013} 
2{303_H36B} 
303_1169NT} 
Consensus 



trhikisiln 

trhikisiln 
trhikisiln 
trhikisiln 
trhikisiln 
trhikisiln 
trhikisiln 
trhikisiln 

ikisiln 

trhikisiln 



51 

HEGKGDI FKD 
HEGKGDIFKD 
HEGKGDI FKD 
HEGKGDIFKD 
HEGKGDIFKD 
HEGKGDIFKD 
HEGKGDIFKD 
HEGKGDIFKD 
HEGKGDIFKD 
HEGKGDIFKD 
HEGKGDIFKD 
********** 



1 QNEGEGTME 
- QNEGEGTME 
1 QNEGEGTME 
1 QNEGEGTME 
1 QNEGEGTME 
1 QNEGEGTME 
1 QNEGEGTME 
1 QNEGEGTME 
1 QNEGEGTME 
1 QNEGEGTME 
1 QNEGEGTME 
_ ********* 



PRLTYIkGDI 
PRLTYI kGDI 
PRLTYIkGDI 
PRLTYI rGD I 
PRLTYI rGDI 
PRLTYI rGD I 
PRLTYI rGDI 
PRLTYI rGDI 
PRLTYI rGDI 
PRLTYI rGDI 
PRLTYIkGDI 
******_*** 



ILIAGGSGFL 
ILIAGGSGFL 
ILIAGGSGFL 
ILIAGGSGFL 
ILIAGGSGFL 
ILIAGGSGFL 
ILIAGGSGFL 
ILIAGGSGFL 
ILIAGGSGFL 
ILIAGGSGFL 
ILIAGGSGFL 
********** 



TEADKI HLEh 
TEADKI HLEh 
TEADKIHLEh 
TEADKIHLEd 
TEADKI HLEd 
TEADKIHLEd 
TEADKIHLEd 
TEADKIHLEd 
TEADKIHLEd 
TEADKIHLEd 
TEADKIHLEd 
*********_ 



GKQIIKAALT 
GKQIIKAALT 
GKQIIKAALT 
GKQIIKAALT 
GKQIIKAALT 
GKQIIKAALT 
GKQIIKAALT 
GKQIIKAALT 
GKQIIKAALT 
GKQIIKAALT 
GKQIIKAALT 
********** 



RnFDILIDCI 
RnFDILIDCI 
RnFDILIDCI 
RtFDILIDCI 
RtFDILIDCI 
RtFDILIDCI 
RtFDILIDCI 
RtFDILIDCI 
RtFDILIDCI 
RtFDILIDCI 
RtFDILIDCI 
*_******** 



50 

KGHKvAYLSR 
KGHKvAYLSR 
KGHKvAYLSR 
KGHKvAYLSR 
KGHKvAYLSR 
KGHKvAYLSR 
KGHKvAYLSR 
KGHKVAYLSR 
KGHKvAYLSR 
KGHKvAYLSR 
KGHK1AYLSR 
****_***** 

100 

GAI KPNQLDE 
GAIKPNQLDE 
GAI KPNQLDE 
GAIKPNQLDE 
GAIKPNQLDE 
GAIKPNQLDE 
GAIKPNQLDE 
GAIKPNQLDE 
GAIKPNQLDE 
GAIKPNQLDE 
GAIKPNQLDE 
********** 
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msal37299 .2{303_COH1} 
msal37299.2{303_M732} 
msal37299 . 2 { 303_M78 1 } 
msal37299.2{303_090) 
msal37299.2{303_18RS2l} 
msal37299 . 2 { 303_2603 } 
msa!37299.2{303_A909} 
rasa!37299 . 2 { 303_CJB110 } 
msal37299 . 2{303_JM9130013 } 
msal37299 . 2 { 303_H35B} 
msal37299.2{303_1169NT} 
Consensus 



101 

LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 



ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQIPK 



********** ********** 



LVYISANSGY 
LVYI SANSGY 
LVYISANSGY 
LVYISANSGY 
LVYISANSGY 
LVYISANSGY 
LVYISANSGY 
LVYISANSGY 
LVYISANSGY 
LVYISANSGY 
LVYISANSGY 
********** 



SAY I kS KRKA 
SAY I kS KRKA 
SAY I kS KRKA 
SAY IkS KRKA 
SAY IkS KRKA 
SAY IkS KRKA 
SAY I kS KRKA 
SAY I kS KRKA 
SAYIkSKRKA 
SAY I kSKRKA 
SAYIrSKRKA 



150 

EQIIKASGLD 
EQI IKASGLD 
EQIIKASGLD 
EQIIKASGLD 
EQIIKASGLD 
EQIIKASGLD 
EQIIKASGLD 
EQIIKASGLD 
EQIIKASGLD 
EQIIKASGLD 
EQIIKASGLD 



****_***** ********** 



151 

msal37299.2(303_COHl} YLFVRPGLMY 

msal37299.2{303_M732} YLFVRPGLMY 

msal37299 . 2{303_M78l} YLFVRPGLMY 

msal37299.2{303_090} YLFVRPGLMY 

msal37299.2{303_18RS2l} YLFVRPGLMY 

msal37299.2{303_2603} YLFVRPGLMY 

msal37299.2{303_A909} YLFVRPGLMY 

msal37299.2{303_CJB110} YLFVRPGLMY 

msal37299.2{30 3_JM9 1 3 0 0 1 3 J YLFVRPGLMY 

msal37299 .2{303_H36B) YLFVRPGLMY 

msal37299.2{303_1169NT} YLFVRPGLMY 

Consensus ********** 



GEERPLSIFQ 
GEERPLS I FQ 
GEERPLSIFQ 
GEERPLSIFQ 
GEERPLSIFQ 
GEERPLSIFQ 
GEERPLSIFQ 
GEERPLSIFQ 
GEERPLSIFQ 
GEERPLSIFQ 
GEERPLSIFQ 



AKCI KLFSHL 
AKCI KLFSHL 
AKCI KLFSHL 
AKCI KLFSHL 
AKCI KLFSHL 
AKCI KLFSHL 
AKCI KLFSHL 
AKCI KLFSHL 
AKCI KLFSHL 
AKCI KLFSHL 
AKCI KLFSHL 



PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 



200 

FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 



********** ********** ********** ********** 



msal37299.2{303_COHl} 
msal3 72 9 9 . 2 f 3 03_M73 2 } 
msal37299 . 2 { 303_M78l} 
msal37299.2{303_090} 
msal37299 . 2 { 303_18RS21} 
msal37299.2{303_2603} 
msal37299 . 2 { 303_A909} 
msal37299 . 2 {303_CJB110 } 
msal37299.2{303_JM9130013} 
msal37299.2{303__H36B) 
msal37299.2{303_1169NT} 
Consensus 



201 223 
AIVTsLRkKP TQKILSIEEL NNK 
AIVTsLRkKP TQKILSIEEL NNK 
AIVTsLRkKP TQKILSIEEL NNK 
AIVTtLRkKP TQKILSIEEL NNK 
AIVTtLRkKP TQKILSIEEL NNK 
AIVTtLRkKP TQKILSIEEL NNK 
AIVTtLRkKP TQKILSIEEL NNK 
AIVTtLRkKP TQKILSIEEL NNK 
AIVTtLRkKP TQKILSIEEL NNK 
AIVTtLRkKP TQKILSIEEL NNK 
AIVTtLRtKP TQKILSIEEL NNK 
****„**_** ********** *** 
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SEQ ID NO. 6701 
STRAIN 090 

CAAT AACAACATTTG AAAAT AAAAAAGTT TTAGTC CTTG GT^^ 

TCTGGAGAAGCCGCTGCACGTTTGTTAGCTAAGTTAGGAGCAATAGTGAC 

AGTTAATGATGGCAAACCATTTGATGAAAATCCAACAGCACAGTCTTTGT 

TGGAAGAGGGTATTAAAGTGGTTTGTGGTAGTCATCCTTTAGAATTGTTA 

GATGAGGATTTTTGTTACATGATTAAAAA.TCCAGGAATACCTTATAACAA 

TC CTATGGT CAAAAAAG CATTAGAAAAACAAATC CCT GTTTT GACTG AAG 

TGGAATTAG CATACTTAGTTTCAGAAT CTCAG CTAAT AGGTATTACAGG C 

TCTAACGGGAAAACGACAACGACAACGATGATTGCAGAAGTCTTAAATGC 

TGGAGGTCAGAGAGGTTTGTTAGCTGGGAATATCGGCTTTCCTGCTAGTG 

AAGTTGTT CAGG CTG CGGATGATAAAGAT ATT CTAGT TATGGAAT TAT CA 

AGTTTTCAG CTAAT GGGAGTTAAGG AATTT CGTC CTCATATTGCAGTAAT 

TACT AATTT AATGCCAACTCATTTAGATTATCATGG^ 

ATGTTG CTG CAAAATGGAATAT CCAAAAT CAAATGT CTT CAT CTGATTTT 

TTGGTACTTA^TTTTAATCAAGGTATTTCTAAAGAGTTAGcTAAAACTAC 

TAAAGCAACAATCGTTCCTTTCTCTACTACGGAAAAAGTTGATGGTGCTT 

ACGTACAAG ACAAG CAACTTTT CTATAAAGGGGAG AATATTATGTTAGTA 

GATGACATTGGTGT CCGAGGAAG CCATAACGTAG AGAATG CT CT AGCAAC 

TATTG CGGTTGCTAAACTAG CT GGTAT CAGTAATCAAGTTATT AGAGAAA 

CTTTAAGCAATTTTGGAGGTGTTAAACACCGCTTGCAATCACTCGGTAAG 

GTTCATGGTATTAGTTT CTATAACGACAG CAAGT CAACTAATATATTGG C 

AACTCAAAAAGCATTATCTGGCTTTGATAATACT 

CAGGAGGT CTTGAT CG CGGTAATGAGTTTGATGAATTGATAC CAGAT AT C 
ACTGGACITAAACAT ATGGTTGTTTTAGGGGAAT CGGCAT CT CGAGT AAA 
ACGTGCTGCACAAAAAGC^GGAGTAACITATAG 

GAGATGCGGTACATAAAGCTTATGAGGTGGCACAACAGGGCGATGTTATC 
TTGCTAAGT CCTG CAAATG CAT CATG^GACATGTATAAGAATTTCGAAGT 
CCGTGGTGATGAATTCATTGATAC t TTCG AAAGT CTTAGAGGAGAG 

SEQ ID NO. 6702 
STRAIN A909 

CAATAAGAACATTTGAAAATAAAAAAGT^ 

TCTGG AGAAGCTGCT G CACGTTTGTTAG CTAAGTT AGGAG CAATAGTGAC 
AGTTAATGATGGCAAACCATTTGATGAAAATCCAACAGCACAGTC 
TGGAAGAGGGTATTAAAGTGGTTTGTGGTAGT CAT C CTTTAGAATTGTTA 
GATGAGGATTTTTGTTACATGATTAAAAATCCAGGAATACC^ 
TC CTATGGT CAAAAAAG CATTAGAAAAACAAATCCCTGTTTTGACTGAAG 
TGGAAT/TAGCATACTTAGTTTCAGAATCrCAGCTAATAGGTATTAC^ 
TCTAACX3GGAAAACGACAACGACAACGATGATTG CAGAAGTCTTAAATG C 
TGGAGGTCAGAGAGGTTTGTTAGCTGGGAATATCGG CTTT CCra 
AAGTTGTT CAGG CTGCGAATGATAAAGATACT CT AGTTATGGAATTATCA 
AGTTTTCAGCTAATGGCAGTTAAGGAATTTCGTCCTCATATTGCAGTAAT 
TACTAATTTAAT GCCAACT CATTTAGATT ATCATGGGTCTTTTG AAGATT 
ATGTTGCTG CAAAAT GGAATAT CCAAAAT CAAATGT CTTCATCTGATTTT 
TTGGTACITAATTTTAATCAAGGT ATTT CTAAAGAGTTAG CTAAAACTAC 
TAAAG CaACAATCGTTC CTTT CT CTACTACGGAAAAAGTT GATGGTG CTT 
ACGTACAAGACAAG CAACTTTT CTATAAAGGGGAGAATATTATGT CAGTA 
GATGACATTGGTGTCCCAGGAAGCCATAACGTAnAGAATGCT CTAG CAAC 
TATTGCGGTTGCTAAACTGCCTGGTATCAGTAATCAAGTTATTAgAGAA^ 
CTTTAAGCAATTTTGGAGGtGTTAAACAC'CGC^ 

GTTCATGGTATTAGTTT CTATAACGACAGCAAGT CAACTAATATATTGG C 
AACTCAAAAAGCATTAT CTGG CTTTGATAATACTAAAGTTAT C CTAATTG 
CAGGAC4GTCITG AT CG CGGTAATGAGTTTGATGAATTGAT ACCAGATATC 
ACTGGACTTAAACATATGGTTGTTTTAGGGGAAT CGG CATCT CGAGTAAA 
ACGTGCTGCACAAAAAGCAGGAGTAACTTATAGCGATGCTITTAGAT^ 
GAGATGCGGT ACAT AAAGCTTATG AGGTGG CACAACAGGG CG AT GTT AT C 
TTGCTAAGTCCTGCAAATG CAT CATGGGACATGTATAAGAATTT CGAAGT 
CCGTGGTGATGAATT CATTGAT ACTTTCGAAAGT CTTAGAGGAGAG 

SEQ ID NO. 6703 
STRAIN H36B 

GGACGAGT AATGAAAACAATAACAACATTT GAAAAT 

AAAAAAGTTTTAGTC CTTGGTTTAG CACGATCTGG AGAAG CTGCTG CACG 
TTTGTTAGCTAAGTT AGGAG CAAT AGTGACAGTTAATGATGG CAAACCAT 
TTGATGAAAAT C CAACAGCACAGT CTTTGTTGGAAGAGGGTATTAAAGTG 
GTTTGTGGTAGT CAT CCTTTAGAATTGTTAGATGAGGATTTTTGTTACAT 
GATTAAAAATCCAGGAATACCTTATAACAATCCTATGGTCAAAAAAGCAT 
TAGAAAAACAAATC CCTGTTTTGACTGAAGTGGAATTAG CATACTTAGTT 
TCAGAAT CTCAG CTAAT AGGTATTACAGG CT CTAACGGGAAAACGACAAC 
GACAACGATGATTGCAGAAGTCrTAAATGCTGGAGGTC^ 
TAGCTGGGAATATCGGCITTCCTGCTAGTGAAGTTGTTCAGGOT 
GATAAAGATACT CTAGTTATGGAATTATCAAGTTTTCAG CTAATGGGAGT 
TAAGG AATTT CGTC CTCATATTG CAGTAATTACT AATTT AATG CCAACT C 
ATTTAGATTATCATGGGTCTTTTGAAGATTATGTTGCT 
ATC CAAAAT CAAATGT CTT CAT CTG ATTTTTTGGTACTTAATTTTAATCA 
AGGTATTTCTAAAGAGTTAG CTAAAACTACTAAAG GAACAAT CGTTC CTT 
TCT CT ACTACGGAAAAAGTTGATGGTG CITACGTACAAGACAAG CAACTT 
TT CTATAAAGGGGAGAATATTATGT CAGT AGATGACATTGGTGT CCCAGG 
AAG C CATAACGTAGAGAATG CT CTAGCAACTATTG CGGTTG CTAAACTGG 
CTGGTAT CAGTAATCAAGTT ATTAGAGAAACTTTAAG CAATTTTGGAGGT 
GTTAAACACCGCTTGCAATCACTCGGTAAGGTTCATGGTATTAGTTTCTA 
TAACGACAG CAAG 



967 



WO 2004/018646 



PCT/US2003/026827 



Table 67: Comparative Sequences relating to SAG0475 



SEQ ID NO. 6704 
STRAIN 18RS21 

GGAC GAGTAATG AAAACAAT AACAACATTTG 

AAAATAAAAAAGTTTTAGTCCTTGGTTTAGCACGATCTGGAGAAGCrGCT 

GCACGTTTGTTAGCTAAGTTAGGAGCMTAGTGACAGTTAATGATGGCAA 

AC CATTT GATGAAAATC CAACAGCACAGT CTTTGTTG GAAGAGGGTATTA 

AAGTGGTTTGTGGTAGTCATCCTTTAGAATTGTTAGATGAGGATTTTTGT 

TACATGATTAAAAAT CCAG G AATAC CTTAT AACAAT C CTATGGT CAAAAA 

AG CATTAGAAAAACAAAT C C CTGTTTT GACTGAAGTGGAATT AG CATACT 

TAGTTTCAGAATCTCAGCTAATAGGTATTACAGGCTCTAACGGGAAAACG 

ACAACGACAACGATGATTGCAGAAGTCTTAAATGCTGGAGGTCAGAGAGG 

TTTGTTAGCTGGGAATATCGGCTTTCCTGCTAGTGAAGTTGTTCAGGCTG 

CGAATGATAAAGATACTCTAGTTATGGAATTATCAAGTTTTCAGCTAATG 

GGAGTTAAGGAATTTCGTCCTCATATTGCAGTAATTACTAATTTAATGCC 

AACT CATTTAGATTATCATGGGT CTTTTG AAGATT ATGTTG CTG CAAAAT 

GGAATAT C CAAAAT CAAATGTCTTC1ATCTGATTTTTTGGTACTTAATTTT 

AATCAAGGT ATTTCTAAAGAGTTAG CT AAAACTACTAAAG CAACAATCGT 

TCCTTTCTCTACTACGGAAAAAGTTGATGGTGCTTACGTACAAGAC^^ 

AA CrriU 'CTATAAAGGGGAGAATATTATGTCAGTAGATGACATTGGTGTC 

CCAGGAAGCCAT AACGT AGAGAATGCT CTAGCAACTATTG CGGTTGCTAA 

ACTGGCTGGTAT CAGTAAT G^GTTATTAGAG AAACTTTAAGCAATTTTG 

GAGGTGTTAAACACCGCTTGCAATCACTCGGTAAGGTTCATGGTATTAGT 

TTCTATAACGACAG CAAGT CAACTAATATATTGG CAACT CAAAAAGCATT 

ATCTGGCTTTGATAATACTAAAGTTATCCTAATTGCAGGAGGTCTTGATC 

GCGGTAATGAGTTTGATGAATTGATACCAGATATCACTGGACriTAAAC^ 

ATCX3TTGTTTTAGGGGAAT CGG CAT CT CGAGT AA7VACGTG CTG CACAAAA 

AG CAGGAGTAACITATAGCG ATGCTTTAGATGTTAGAGATG C GGTACAT A 

AAGCTTATGAGGTGGCACAACAGGGCGATGITATCIT 

AATGCATCATGGGACATGTATAAGAATTTCGAAGTCCGTGGTGATGAATT 

CATTGATACTTTCGAAAGT CTTAGAGG AGAG 

SEQ ID NO. 6705 
STRAIN M732 

GGACGAGTAATGAAAACAAT AACAACATTT GAAA 
ATAAAAAAGTTTTAGTCCITGGTTTAGCACGATCTGGAGAA 
CGTTTGTTAG CTAAGTTAGG AG CAATAGTG ACAGTTAATGATGG CAAAC C 
ATTTGATGAAAATCCAACAG CACAGT CTTTGTTG GAAGAGGGTATTAAAG 
TGGTTTGTGGTAGTCATCCTTTAGAATTGTTAGATGAGGA 
ATGATTAAAAATCCAGGAATACCTTATAACAATCCTATGGTCAAAAAAGC 
ATTAGAAAAACAAATCC CTGTTTTGACTGAAGTGGAATTAG CATACTTAG 
TTT CAGAAT CTCAG CTAATAGGTATTACAGG CT CTAACGGGAAAACG ACA 
ACGACAACGATGATTGCAGAAGTCTTAAATGCTGGAGGTCAGAGAG^ 
GTTAG CTGGGAATAT CGGCTTTC CTGCTAGTGAAGTTGTT CAGG CTG CGG 
aTGATAAAGATATTCTAGTTATGGAATTATCAAGTTTTCAGCTAATGGGA 
GTTAAGGAATTT CGT CCT CATATTG CAGTAATTACTAATTTAATG C CAAC 
TC^tTTAGATTATCATGGGTCITTTGAAGATTATGtTGCT 
ATATCCAAAATCAAATGTCTTCATCTGATTTTTTGG 

CAAGGTATTTCTAAAGAGTTAG CTAAAACTACTAAAG CAACAaT CGTTC C 
TTTCTCTACTACGGAAAAAGTTGATGGTG CTTACGTACAAGACAAG CAAC 
TTTTCTATAAAGGGGAG AAT ATTATGT CAGTAGATGACATTGGTGTCC CA 
GGAAGCCATAACGTAGAGAATG CTCTAG CAACTATTGCGGTTG CT 
AGCTGGTATCAGTAATCAAGTTATTAGAGAAACTTTAAGCAATT^ 
GTGTTAAACACCG CTTG CAATCACTCGGTAAGGTTCATGGTATTAGTTTC 
TATAACGACAG CAAGTCAACTAATATATTGG CAACTCAAAAAGCATTATC 
TGGCTTTGATAATACTAAAGTTATC CTAATTG CAGGAGGT CTTGATCGC G 
GTAATGAGTTTGATGAATTGATACCAGATATCACTC^m 
GTTGTTITAGC<?GAATCG^CATCTCCAGTAAAACGTGCTGCACAAAAAGC 
AGGAGTAACTTATAG CX1ATG CTTTAGATGTTAGAGATGCGGTACATAAAG 
CTTATGAGGTGG CACAACAGGG CGATGTTATCTTG CTAAGTC CTG CAAAT 
G CAT CATGGG ACATGTATAAGAATTTCGAAGT CCGTGGTGATGAATT CAT 
TGATACTTTCGAAAGTCTTAGAGGAGAG 

SEQ ID NO. 6706 
STRAIN COH1 

GGACGAGTAATGAAAACAATAACAACATTTGA 

AAATAAAAAAGTTTTAGTC CTTGGTTTAGCACGAT CTGGAGAAG CCG CTG 

CCATTTGATGAAAATCCAACAGCACAGTC^ 
AGTGGTTTGTGGTAGTCATCCTTTAGAATTGTTAGATGAG 

acatgattaaaaatccac<;aataccttataacaatcctatggtcaaaa 
gcattagaaaaacaaatcccixntttga 

agttt cagaat ctcag ctaataggtattacagg ct ctaacgggaaaacga 

CAACGACAACGATGATTGCAGAAGTCTTAAATGCTG^AC^TCAGAGAGGT 
TTGTTAG CTGGGAATAT CGG CTTT C CTG CT AGTGAAGTTGTT CAGGCTG C 
GGaTGATAAAGATATTCTAGTTATGGAATTATCAAGTTTTCAGCTAATGG 
GAGTTAAGGAATTT CGTCCTCATATTG CAGTAATTACTAATTTAATG CCA 
ACT CATTTAGATTATCATGGGT CTTTTGAAGATTATGTTGCTG CAAAATG 
GAATATCCAAAATCAAATGTCTTCATCTGATTTT^ 

ATCAAGGTATTTCTAAAGAGTTAG CTAAAACTACTAAAGCAaCAATCGTT 

CCTITTCTCTACTACGGAAAAAGTTGATGGTGCTTACG 

ACTTTTCTAT AAAGGC^ AGAATATTATGTCAGTAGATGACATTGGTGT C C 

CAGGAAG CCATAACGTAGAGAATG CT CTAG CAACT ATTG CGGTTGCT AAA 
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CTAG CTGGTAT CAGTAAT CAAGTTATT AG AGAAACTTTAAG CAATTTTG G 
AGGTGTTAAACACCGCTTGCAATCACTCX3GTAAGGTTCATGGTATTAGTT 
TCTATAACGACAGCAAGTCAACTAATATATTGGCAACTCAAAAAGCATTA 
TCTGGCTTTGATAATACTAAAGTTATCCTAATTGCAGGAGGTCTTGATCG 
CGGTAATGAGTTTGATGAATTGATACCAGATATCACTGGACTTAAACATA 
TGGTTGTTTTAGGGGAATCGGCATCTCGAGTAAAACGTGCTGCACAAAAA 
GCAGGAGTAACTTATAGCGATGCTTTAGATGTTAGAGATGCGGTACATAA 
AGCTTATGAGGTGGCACAACAGGGCGATGTTATCTTGCTAAGTCCTGCAA 
ATGCATCATGGGACATGTATAAGAATTTCGAAGTCCGTGGTGATGAATTC 
ATTGATACTTTCGAAA 

SEQ ID NO. 6707 
STRAIN M781 

GGACGAGTAATGAAAACAATAACAACATT 

TGAAAATAAAAAAGTTTTAGTCCTTGGTTTAGCACGATCTGGAGAAGCCG 

CTG CACGTTTGTTAG CTAAGTTAGGAG CAATAGT GACAGTTAATGATGG C 

AAAC CATTTG ATGAAAATC CAACAG CACAGT CTTTGTTGGAAGAGGGTAT 

TAA^GTGGTTTGTGGTAGTCATCCrTTAGAATTGTTAGATGAGGATTTTT 

GTTACATGATTAAAAATC CAGGAATACCTTATAACAAT C CTATGGTCAAA 

AAAGCATTAGAAAAAaAAATCCCTGTTTTGACTGAAGTGGAATTAGCATA 

CTTAGTTTCAGAATCTCAGCTAATAGGTATTACAGGCTCTAACGGGAAAA 

CGACAACGACAACGATGATTGCAGAAGT CTTAAAT G CTG GAGGTCAGAG A 

GGTTTGTTAGCTGGGAATATCGGCTTTCCTGCTAGTGAAGTTGTTCAGGC 

TG CGG ATGATAAAGATATTCTAGTT ATGG AATTAT CAAGTTTT CAGCTAA 

TGGGAGTTAAGGAATTTCGTCCTCATATTGCAGTAATTACTAATTTAATG 

CCAACTCATTTAGATTATCATGGGTCU-lU J lGAAGAlT ATGTTG CTG CAAA 

ATGGAATATCCAAAATCyy^ATGTCTTCATCTGATTTTTTGGTA 

TT AAT CAAGGTATTT CTAA^GAGTTAG CTAAAACTACTAAAG CAaCAAT C 

GTTC CTTTCT CTACT ACGGAAAAAGTTGATGGTGCTTACGTACAAGACAA 

GCAACTTTTCTATAAAGGGGAGAATATTATGTCAGTAGATGACATTGGTG 

T C CCAGGAAG CCATAACGT AGAGAATG CT CTAGCAACTATTG CGGTTGCT 

AAACTAG CTGGTAT CAGTAAT CAAGTTATTAGAGAAACTTTAAG CAATTT 

TGGAGGTGTTAA^CACCG CTTG CAAT CACT CGGTAAGGTT CATGGTATTA 

GTTTCTATAACGACAGCAAGTCAACTAATATATTGGC^ 

TTATCTGGCITTGATAATACTAAAGTTAT C CTAATTG CAGGAGGTCTTGA 

TCG CG GT AATGAGTTTGATG AATTGATAC CAGATATCACTGG ACTTAAAC 

ATATGGTTGTTTTAgGGGAATCGGCATCTCGAGTAAAACGTGCTGCACAA 

AAAG CAGGAGT aACTT ATAG CGATG CTTTAGATGTTAGAGATG CGGTACA 

TAAAGCTTATGAGGTGGCACAACAGGGCGATGTTATCT^ 

CAAATGCATCATGGGACATGTATAAGAATTTCGAAGTCCGTGGTGATGAA 

TT CATTGATACTTT CGAAAGTCTTAGAGGAGAG 

SEQ ID NO. 6708 
STRAIN CJB110 

GGACGAGTAATGAAAACAATAACAACATTTGA 
AAATAAAAAAGTTTTAGTCCIT'GGTTTAGCACGATCT 
CACGTTTGTTAG CTAAGTTAGGAG CAATAGTGACAGTTAATGATGG CAAA 
C CATTTGATGAAAAT CCAACAG CACAGTCTTTGTTGG AAGAGGGTATTAA 
AGTGGTTTGTGGTAGTCAT C CTTTAGAATTGTTAGATGAGGATTTTTGTT 
ACATGATTAAAAATCCAGGA^TACCTTATAACAATCCTATGGTCAAAAA^ 
GCATTAGAAAAA.CAAATCCCrGTTTTGACrrGAAGTGGAATT 
AGTTT CAGAAT CTCAGCTAATAGGTATTACAGGCT CTAACGGGAAAACGA 
CAACGACAACGATG ATTG CAGAAGT CTTAAATGCTGGAGGTCAGAGAGGT 
TTGTTAG CTGGGAATAT CGGUTTT CCTG CIAGTGAAGTTGTT CAGG CTGC 
GGATGATAAAGATATTCTAGTTATGGAATTATCAAGTTTTCAGCTAATGG 
GAGTTA^GGAATTT CGT CCT CATATTG CAGTAATTACTAATTTAATGCCA 
ACTCATTTAGATTAT CATGGGTCTTTTGAAGAATATGTTG CTGCAAAATG 
GAATATC CAAAATCAAATGT CTTCATCTGATTTTT^ 

AT CAAGGTATTTCTAAAGAGTTAG CTAAAACTACTAAAG CAACAAT CGTT 
CCTTTCTCTACTACGGAAAAAGTTGATGGTGCTTACGTACAAGACAAGC^ 
ACTTTTCTATAAAGGGGAGAATATTATGTTAGTAGATGACATTGGTGTCC 
CAGGAAG CCATAACGTAGAGA^TG CTCTAG CAACTATTG CGGTTG CT AAA 
CTAG CTGGTATCAGT AAT CAAGTTATTAG AGAAACTTTAAGCAATTTTGG 
AGGTGTTAAACACCGCTTGCAATCACTCGGTAAGGTTCATGGTATTAGTT 
TCTATAATGACAGCAAGTCAACTAATATATTGGa^ACTCAAAAAGCA 
TCIGG CTTTGATAAT ACTAAAGTT AT C CTAATTG CAGGAGGT CTTGATCG 
CGGTAATGAGTTTGATGAATTGATACCAGATATCACTGGACT^ 
TGGTTGTTTT AGGGGAATCGG CATCTCGAGTAAAACGTG CTG CACAAAAA 
GCAGGAGTAACTTATAG CGATG CTTTAGATGTTAGAGATG CGGTACATAA 
AG CTTATGAGGTGG CACAACAGGGCGATGTTAT CTTG CTAAGTCCTGCAA 
ATGCATCATGGGACATGTATAAGAATTT CGAAGT C CGTGGTGATGAATTC 
ATTGATACTTTCGAAAGTCTTAGAGGAGAG 

SEQ ID NO. 6709 
STRAIN 1169NT 

CAATAACAACATTTGAAAATAAAAAAGTTTTAGT^ CACGA 
TCIGGAGAAG CCG CTGCACGTTTGTTAGCTAAGTTAGGAG CAATAGTGAC 
AGTTAATGATGGCAAACCATTTGATGAAAATCCAAC^ 
TGGAAGAGGGTATTAAAGTGGTTTGTGGTAGTCATCCTTTAG 
GATGAGGATTTTTGTTACATGATTAAAAAT CCAGGAATACCTTAT AACAA 
TCCTATGGTCAAAAAAG CATTAGAAAAACAAATCCCTGTTTTGACTGAAG 
TGGAATTAG CATACTTAGTTT CAGAATCT CAG CTAATAGGTATTACAGG C 
TCTAACGGGAAAACGACAACGACAACGATGATTGCAGAAGTCTTGAATGC 
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TGGAGGTCAGAGAGGTTTGTTAGCTGGGAATATCGGCTTTCCTGCTAGTG 
AAGTTGTTCAGGCTGCGGATGATAAAGATACTCTAGTTATGGAATTATCA 
AGTTTTCAGCTAATGGGAGTTAAGGAATTTCGTCCTCATATTGCAGTAAT 
TACT AATTT AATG CCAA.CT CATTT AGATT ATCAT G GGTCTTTTGAAGA t T 
ATG t TGCTG CAAAATGG AATAT CCAAAAT CAAATGT CTTCAT CTG ATTTT 
TTGGTACIT AATTTTAATCAAGGTATTTCTAAAG AGTTAG cTAAAACTAC 
TAAAGCAACAATCGTTCCTTTCTCTACTACGGAAAAAGTTGATGGTGCTT 
ACGTACAAGACAAGCAACTTTTCTATAAAGGGGAGAATATTATGTCAGTA 
GACGACATTGGTGTC C CAGG AAG C CATAACGTAGAGAATG CT CT AGCAAC 
TATTGaSGTTGCTAAACTAGCTGGTATCAGTAATCAAGTTATTAGAGAAA 
CTTT AAG CAATTTTGGAGGTGTTAAACAC CG CTTG CAAT CACT CGGTAAG 
GTTCATGGTATTAGTTT CT ATAACG ACAGT AAGT CAACTAATATATTGG C 
AACTCAAAAAGCATTAT CTGGCCTTGATAATACTAAAGTT AT CCT AATT G 
CAGGAGGTCTTG AT CG CGGTAATG AGTTTGATGAATTGAT AC CAGATATC 
ACTGX3ACTTAAGCATATGGTTGTTTTAGGGGAATCGGCATCTCGAGTAA 
ACGTGCTGCACAAAAAGCAGGAGTAACTTATAGCAATGCTTTAgATGTTA 
GAgATGCgGTACATAAAGCTTATGAGGTGGCACAACAGGGCGATGTTATC 
TTGTTnAGTcCTGCGAATGCATCATGGGACATGTATAAGAATTTCGAAGT 
CCGTG GTGATGAATT CATTGATACTTT CG 

SEQ ID NO. 6710 

STRAIN JM9130013 

GGACGAGTAATGAAAACAATAACAACA 

TTTGAAAATAAAAAAGTTTTAGTCCTTGGTTTAG CACGATCTGGAGAAGC 
TG CTGCACGTTTGTTAGCTAAGTTAGGAG CAATAGTG ACAGTTAATGATG 
G CAAACCATTTGATG AAAAT C CAACAG CACAGTCTTTGTTGGAAGAGGGT 
ATTAAAGTGGTTTGTGGTAGT CAT CCITTAGAATTG t TAGATGAGGATTT 
TTGTTACATGATT aAAAAT CCAGGAAT AC CTTAT AACAAT CCTATGGT CA 
AAAAAGCATTAGAAAAACAAATCCCTGTTTTGACTGAAGTGGAA 
TACTTAGTTTCAGAATCTCAGCTAATAGGTATTACAGG 
AACGACAACGACAACGATGATTGCAGAAGTCTTAAATGCTGGAGGTCAGA 
GAGGTTTGTTAG CTGGG AATAT CGG CTTTC CTGCTAGTGAAGTTGTT CAG 
GCTGCGAATGATAAAGATACTCTAGTTATGGAAT TAT CAAGTTTT CAGCT 
AATGGGAGTTAAGG AATTT CGT CCTCAT ATTG CAGTAATTACTAATTTAA 
TGCCAACTCATTTAGATTATCATGGGT CTTTTGAAGATTATGTTG CTGCA 
AAATGGAATATCCAAAATCAAATGTCTTCATCTGATT^ 
TITTAATCAAGGTATTT CTAAAGAGTTAG CTAAAACTACTAAAGCaACAA 
TCGTT CCTTTCT CTACT ACGGAAAAAGTTGATGGTG CITACGTAC^ 
AAGCAACTTTTCTATAAAGGGGAGAATATTATGTCAGTAGATGACATTGG 
TGTCCCAGGAAGCCATAACGTAGAGAATGCTCTAGCAACTATTGCGGTTG 
CTAAACTGG CTGGTAT CAGTAATCAAGTTATTAG AGAAACTTTAAG CAAT 
TTTGGAGGTGTTAAACACCGCTTGCAATCACTCGGTAAGGTTCATGG 
TAG t TTCTATAACGACAG CAAGTC1AACTAATATATTGGCAACTCAAAAAG 
CATTATCTGG CTTTGATAATACTAAAGTTATCCTAATTGCAGGAGGTCTT 
GATCGCACTAATGAGTTTGATGAACTGATACCAGATATCACTGGACTTAA 
ACATATGGTTGTTTTAGGGGAATCGGCATCTCGAGTAAAACGTGCTGCAC 
AAAAAGCAGGAGTAACTTATAG CGATGCTTTAGATGTTAGAGATG CGGTA 
CATAAAGCTTATGAGGTGG CACAACAGGGCGATGTTATCTTG CT AAGTC C 
TG CAAATGCATCATGGGACATGTATAAGAATTT CGAAGT C CGTGGTGATG 
AATT CATTGATAC t TT CGAAAGTCTTAGAGGAGAG 

SEQ ID NO. 6710 
STRAIN 2603 

ggacgagtaatgaaaacaataacaacatttgaaaataaaaaagttttagt 
ccttggtttagcacgatctggagaagctgctgcacgtttgttagctaagt 
taggagcaatagtgacagttaatgatggcaaaccatttgatgaaaatcca 
acagcacagtctttgttggaagagggtattaaagtggtttgtggtagtca 
tcctttagaattgttagatgaggatttttgttacatgattaaaaatccag 
gaataccttataacaatcctatggtcaaaaaagcattagaaaaacaaatc 
cctgttttgactgaagtggaattagcatacttagtttcagaatctcagct 
aataggtattacaggctctaacgggaaaacgacaacgacaacgatgattg 
cagaagtcttaaatgctggaggtcagagaggtttgttagctgggaatatc 
ggctttcctgctagtgaagttgttcaggctgcgaatgataaagatactct 
agttatggaattatcaagttttcagctaatgggagttaaggaatttcgtc 
ctcatattgcagtaattactaatttaatgccaactcatttagattatcat 
gggtcttttgaagattatgttgctgcaaaatggaatatccaaaatcaaat 
gtcttcatctgattttttggtacttaattttaatcaaggtatttctaaag 
agttagctaaaactactaaagcaacaatcgttcctttctctactacggaa 
aaagttgatggtgcttacgtacaagacaagcaacttttctataaagggga 
gaatattatgtcagtagatgacattggtgtcccaggaagccataacgtag 
agaatgctctagcaactattgcggttgctaaactggctggtatcagtaat 
caagttattagagaaactttaagcaattttggaggtgttaaacaccgctt 
gcaatcactcggtaaggttcatggtattagtttctataacgacagcaagt 
caactaatatattggcaactcaaaaagcattatctggctttgataatact 
aaagttatcctaattgcaggaggtcttgatcgcggtaatgagtttgatga 
attgataccagatatcactggacttaaacatatggttgttttaggggaat 
cggcatctcgagtaaaacgtgctgcacaaaaagcaggagtaacttatagc 
gatgctttagatgttagagatgcggtacataaagcttatgaggtggcaca 
acagggcgatgttatcttgctaagtcctgcaaatgcatcatgggacatgt 
ataagaatttcgaagtccgtggtgatgaattcattgatactttcgaaagt 
cttagaggagag 
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MSA Alignment Results: Pretty output 

PRETTY of : /biotmp/msa30176 . 2 { * } April 29, 2002 02:09 

1 50 

msa3 0176.2{305_18RS2l} ggacgagtaa tgaaaaCAAT AACAACATTT GAAAATAAAA AAGTTTTAGT 

msa30176.2{305_26 03} ggacgagtaa tgaaaaCAAT AACAACATTT GAAAATAAAA AAGTTTTAGT 

tnsa30176 .2{305_A909} CAAT AACAACATTT GAAAATAAAA AAGTTTTAGT 

msa30176.2{305_H36B} ggacgagtaa tgaaaaCAAT AACAACATTT GAAAATAAAA AAGTTTTAGT 

msa3 0176.2{305_JM9130013} ggacgagtaa tgaaaaCAAT AACAACATTT GAAAATAAAA AAGTTTTAGT 

msa30176.2{305_COHl} ggacgagtaa tgaaaaCAAT AACAACATTT GAAAATAAAA AAGTTTTAGT 

msa30176 .2{305_M78l} ggacgagtaa tgaaaaCAAT AACAACATTT GAAAATAAAA AAGTTTTAGT 

msa30176 .2{305e_M732) ggacgagtaa tgaaaaCAAT AACAACATTT GAAAATAAAA AAGTTTTAGT 

msa30176.2{305__0 90} — CAAT AACAACATTT GAAAATAAAA AAGTTTTAGT 

msa30176.2{305_CJB110} ggacgagtaa tgaaaaCAAT AACAACATTT GAAAATAAAA AAGTTTTAGT 

msa30176 .2 {305__1169NT} ~ CAAT AACAACATTT GAAAATAAAA AAGTTTTAGT 

Consensus - **** ********** ********** ********** 



msa30176 . 2 { 305_18RS21 } 
msa30176.2{305_2603) 
msa30176 . 2 {305_A909} 
msa30176 . 2 {305_H36B} 
msa30176.2{305_JM9130013} 
msa30176.2{305_COHl} 
msa3 0 176 . 2 { 3 05_M7 81 } 
msa30176.2{305e_M732} 
msa30176. 2 {305^090} 
msa3 0176 . 2 { 3 05_CJB110 } 
msa30176 . 2 { 305_1169NT} 
Consensus 



51 

CCTTGGTTTA 
CCTTGGTTTA 
CCTTGGTTTA 
CCTTGGTTTA 
CCTTGGTTTA 
CCTTGGTTTA 
CCTTGGTTTA 
CCTTGGTTTA 
CCTTGGTTTA 
CCTTGGTTTA 



GCACGATCTG 
GCACGATCTG 
GCACGATCTG 
GCACGATCTG 
GCACGATCTG 
GCACGATCTG 
GCACGATCTG 
GCACGATCTG 
GCACGATCTG 
GCACGATCTG 
GCACGATCTG 



GAGAAGCtGC 
GAGAAGCtGC 
GAGAAGCtGC 
GAGAAGCtGC 
GAGAAGCtGC 
GAGAAGCc GC 
GAGAAGCcGC 
GAGAAGCcGC 
GAGAAGCCGC 
GAGAAGCcGC 
GAGAAGCcGC 



TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 



100 

TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 



********** ********** *******_** ********** ********** 



101 150 

msa30176 .2 {305_18RS2l} TAGGAGCAAT AGTGACAGTT AATGATGGCA AACCATTTGA TGAAAATCCA 

msa3017 6. 2 {305^2603} TAGGAGCAAT AGTGACAGTT AATGATGGCA AACCATTTGA TGAAAATCCA 

msa30176.2{305_A909} TAGGAGCAAT AGTGACAGTT AATGATGGCA AACCATTTGA TGAAAATCCA 

msa30176.2{305_H36B} TAGGAGCAAT AGTGACAGTT AATGATGGCA AACCATTTGA TGAAAATCCA 

msa30176.2{3 05_JM913 0013} TAGGAGCAAT AGTGACAGTT AATGATGGCA AACCATTTGA TGAAAATCCA 

msa30176.2{305_COHl} TAGGAGCAAT AGTGACAGTT AATGATGGCA AACCATTTGA TGAAAATCCA 

msa30176 .2{305_M78lj TAGGAGCAAT AGTGACAGTT AATGATGGCA AACCATTTGA TGAAAATCCA 

msa30176.2{305e_M732) TAGGAGCAAT AGTGACAGTT AATGATGGCA AACCATTTGA, TGAAAATCCA 

msa30176.2{305_J)90} TAGGAGCAAT AGTGACAGTT AATGATGGCA AACCATTTGA TGAAAATCCA 

msa30176 . 2 { 305_CJB110 } TAGGAGCAAT AGTGACAGTT AATGATGGCA AACCATTTGA TGAAAATCCA 

msa30176 .2 {305_11S9NT} TAGGAGCAAT AGTGACAGTT AATGATGGCA AACCATTTGA TGAAAATCCA 

Consensus ********** ********** ********** ********** ********** 



151 200 

msa30176.2{305_18RS2l} ACAGCACAGT CTTTGTTGGA AGAGGGTATT AAAGTGGTTT GTGGTAGTCA 

msa30176.2{305_2603) ACAGCACAGT CTTTGTTGGA AGAGGGTATT AAAGTGGTTT GTGGTAGTCA 

msa30176.2{305_A909} ACAGCACAGT CTTTGTTGGA AGAGGGTATT AAAGTGGTTT GTGGTAGTCA 

msa30176.2{305_H36B} ACAGCACAGT CTTTGTTGGA AGAGGGTATT AAAGTGGTTT GTGGTAGTCA 

msa30176.2{305_JM9130013 J ACAGCACAGT CTTTGTTGGA AGAGGGTATT AAAGTGGTTT GTGGTAGTCA 

msa3 0176.2(30 5__C0H1 } ACAGCACAGT CTTTGTTGGA AGAGGGTATT AAAGTGGTTT GTGGTAGTCA 

msa30176.2{3 05_M78l) ACAGCACAGT CTTTGTTGGA AGAGGGTATT AAAGTGGTTT GTGGTAGTCA 

msa30176.2{305e_M732} ACAGCACAGT CTTTGTTGGA AGAGGGTATT AAAGTGGTTT GTGGTAGTCA 

msa3 0176.2{305_090} ACAGCACAGT CTTTGTTGGA AGAGGGTATT AAAGTGGTTT GTGGTAGTCA 

msa30176.2{305_CJB110} ACAGCACAGT CTTTGTTGGA AGAGGGTATT AAAGTGGTTT GTGGTAGTCA 

msa30176-2{305_1169NT} ACAGCACAGT CTTTGTTGGA AGAGGGTATT AAAGTGGTTT GTGGTAGTCA 

Consensus ********** ********** ********** ********** ********** 



msa30176.2{305_18RS2l} 
msa3 0 17 6 . 2 ( 3 0 5__2 6 03 } 
msa30176 . 2 {305_A909 } 
msa30176 . 2{305_H36B} 
msa30176 . 2 {305_JM9130013} 
msa30176 . 2 f 305_COH1 } 
nisa3 0176 . 2 (305_M781 } 
msa30176.2{305e_M732} 
msa30176.2{305_090} 
msa3 0176 . 2 { 3 05_CJB110 } 
msa30176.2{305_1169NT} 
Consensus 



201 

TCCTTTAGAA 
TCCTTTAGAA 
TCCTTTAGAA 
TCCTTTAGAA 
TCCTTTAGAA 
TCCTTTAGAA 
TCCTTTAGAA 
TCCTTTAGAA 
TCCTTTAGAA 
TCCTTTAGAA 
TCCTTTAGAA 
********** 



TTGTTAGATG 
TTGTTAGATG 
TTGTTAGATG 
TTGTTAGATG 
TTGTTAGATG 
TTGTTAGATG 
TTGTTAGATG 
TTGTTAGATG 
TTGTTAGATG 
TTGTTAGATG 
TTGTTAGATG 
********** 



AGGATTTTTG 
AGGATTTTTG 
AGGATTTTTG 
AGGATTTTTG 
AGGATTTTTG 
AGGATTTTTG 
AGGATTTTTG 
AGGATTTTTG 
AGGATTTTTG 
AGGATTTTTG 
AGGATTTTTG 
********** 



TTACATGATT 
TTACATGATT 
TTACATGATT 
TTACATGATT 
TTACATGATT 
TTACATGATT 
TTACATGATT 
TTACATGATT 
TTACATGATT 
TTACATGATT 
TTACATGATT 
********** 



250 

AAAAATCCAG 
AAAAATCCAG 
AAAAATCCAG 
AAAAATCCAG 
AAAAATCCAG 
AAAAATCCAG 
AAAAATCCAG 
AAAAATCCAG 
AAAAATCCAG 
AAAAATCCAG 
AAAAATCCAG 
********** 



251 300 

msa30176.2{305_18RS2l} GAATACCTTA TAACAATCCT ATGGTCAAAA AAGCATTAGA AAAACAAATC 

msa30176.2{305_2603} GAATACCTTA TAACAATCCT ATGGTCAAAA AAGCATTAGA AAAACAAATC 

msa30176.2(305_A909} GAATACCTTA TAACAATCCT ATGGTCAAAA AAGCATTAGA AAAACAAATC 

msa3 0176 . 2 {305__H36B} GAATACCTTA TAACAATCCT ATGGTCAAAA AAGCATTAGA AAAACAAATC 

msa30176.2{305_JM9130013} GAATACCTTA TAACAATCCT ATGGTCAAAA AAGCATTAGA AAAACAAATC 

msa3 0176 . 2 {305_COH1 } GAATACCTTA TAACAATCCT ATGGTCAAAA AAGCATTAGA AAAACAAATC 

msa30176.2{305_M78l} GAATACCTTA TAACAATCCT ATGGTCAAAA AAGCATTAGA AAAACAAATC 

msa30176.2{305e_M732) GAATACCTTA TAACAATCCT ATGGTCAAAA AAGCATTAGA AAAACAAATC 

msa30176. 2 {305_090} GAATACCTTA TAACAATCCT ATGGTCAAAA AAGCATTAGA AAAACAAATC 
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Table 67: Comparative Sequences relating to SAG0475 



msa30176 . 2 {305_CJB110 } 
msa30176.2{305_1169NT} 
Consensus 



GAATACCTTA TAACAATCCT ATGGT CAAAA AAGCATTAGA AAAACAAATC 
GAATACCTTA TAACAATCCT ATGGT CAAAA AAGCATTAGA AAAACAAATC 
********** ********** ********** ********** ********** 



msa30176.2{30S_18RS2l} 
msa30176 .2{305_2603} 
msa30176 . 2 {305_A909} 
msa3 0 17 6 . 2 { 3 0 5_H3 6B J 
msa30176.2{305_JM9130013} 
ms a3 0 17 6 . 2 { 3 05_COH1 } 
msa30176.2{305_M781* 
msa3 0 1 7 6 . 2 { 3 0 5e_M73 2 
msa30176.2{305_090 
msa3 017 6 . 2 { 3 05_C JB110 } 
msa30176.2{3 05_1169NT} 
Consensus 



301 

CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
********** 



CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
********** 



ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
********** 



TTAGTTT CAG 
TTAGTTTCAG 
TTAGTTT CAG 
TTAGTTTCAG 
TTAGTTTCAG 
TTAGTTTCAG 
TTAGTTTCAG 
TTAGTTTCAG 
TTAGTTTCAG- 
TTAGTTTCAG 
TTAGTTTCAG 
********** 



350 

AATCTCAGCT 
AATCTCAGCT 
AATCTCAGCT 
AATCTCAGCT 
AATCTCAGCT 
AATCTCAGCT 
AATCTCAGCT 
AATCTCAGCT 
AATCTCAGCT 
AATCTCAGCT 
AATCTCAGCT 
********** 



351 

msa30176.2{305_18RS2l} AATAGGTATT 

msa30176.2{305_2603} AATAGGTATT 

msa30176.2{3 05_A909} AATAGGTATT 

ms a3 0 17 6 . 2 { 3 05_H3 6B } AATAGGTATT 

msa30176.2{305_JM9130013} AATAGGTATT 

ms a3 0 17 6 . 2 { 3 05_COH1 } AATAGGTATT 

msa3 0 17 6 . 2 { 3 05_M7 8 1 } AATAGGTATT 

msa3 0 1 7 6 . 2 { 3 0 5e_M7 3 2 } AATAGGTATT 

msa3 0176. 2 {305^090} AATAGGTATT 

msa30176.2{305_CJB110} AATAGGTATT 

msa30176.2{305_1169NT} AATAGGTATT 

Consensus ********** 

401 

msa30176-2{3 05_18RS2l} CAGAAGTCTT 

msa3 017 6. 2 {305J2603} CAGAAGTCTT 

msa3 0176.2{305_A909) CAGAAGTCTT 

ms a3 0 17 6 . 2 { 3 05_H3 6B } CAGAAGTCTT 

msa30176.2{305_JM9130013} CAGAAGTCTT 

msa30176 . 2 (305_COH1 } CAGAAGTCTT 

msa3 0176.2{305_M78l} CAGAAGTCTT 

msa30176.2{3 05e_M732} CAGAAGTCTT 

msa3 0176.2{305_090} CAGAAGTCTT 

msa30176 . 2 {305_CJB110} CAGAAGTCTT 

msa3 0176 . 2 { 3 05_1169NT} CAGAAGTCTT 

Consensus ********** 



400 

ACAGGCTCTA ACGGGAAAAC GACAACGACA ACGATGATTG 
ACAGGCTCTA ACGGGAAAAC GACAACGACA ACGATGATTG 
ACAGGCTCTA ACGGGAAAAC GACAACGACA ACGATGATTG 
ACAGGCTCTA ACGGGAAAAC GACAACGACA ACGATGATTG 
ACAGGCTCTA ACGGGAAAAC GACAACGACA ACGATGATTG 
ACAGGCTCTA ACGGGAAAAC GACAACGACA ACGATGATTG 
ACAGGCTCTA ACGGGAAAAC GACAACGACA ACGATGATTG 
ACAGGCTCTA ACGGGAAAAC GACAACGACA ACGATGATTG 
ACAGGCTCTA ACGGGAAAAC GACAACGACA ACGATGATTG 
ACAGGCTCTA ACGGGAAAAC GACAACGACA ACGATGATTG 
ACAGGCTCTA ACGGGAAAAC GACAACGACA ACGATGATTG 
********** ********** ********** ********** 

450 

aAATGCTGGA GGTCAGAGAG GTTTGTTAGC TGGGAATATC 
aAATGCTGGA GGTCAGAGAG GTTTGTTAGC TGGGAATATC 
aAATGCTGGA GGTCAGAGAG GTTTGTTAGC TGGGAATATC 
aAATGCTGGA GGTCAGAGAG GTTTGTTAGC TGGGAATATC 
aAATGCTGGA GGTCAGAGAG GTTTGTTAGC TGGGAATATC 
aAATGCTGGA GGTCAGAGAG GTTTGTTAGC TGGGAATATC 
aAATGCTGGA GGTCAGAGAG GTTTGTTAGC TGGGAATATC 
aAATGCTGGA GGTCAGAGAG GTTTGTTAGC TGGGAATATC 
aAATGCTGGA GGTCAGAGAG GTTTGTTAGC TGGGAATATC 
aAATGCTGGA GGTCAGAGAG GTTTGTTAGC TGGGAATATC 
gAATGCTGGA GGTCAGAGAG GTTTGTTAGC TGGGAATATC 
_********* ********** ********** ********** 



msa30176 . 2 {305_18RS21 } 
msa30176.2{305_2603} 
msa30176.2(305_A909} 
msa30176.2{305_H36B} 
msa30176.2{305_JM9130013} 
msa30176.2{305_COHl} 
msa30176 . 2 {305_M781} 
msa30176 . 2 { 305e_M732 } 
msa30176 . 2 { 305_090 } 
msa3 0176 . 2 { 3 05_CJB110 } 
msa3 0176 . 2 { 3 05_1169NT} 
Consensus 



msa30176.2{305_18RS2ll 
msa30176 .2 {305_2603 } 
msa30176.2{305_A909} 
msa30176.2{305_H36B> 
msa30176 ,2{305_JM9130013 } 
msa3 0 17 6 . 2 { 3 05_COH1 } 
msa30176.2{305_M78l} 
msa30176.2{305e_M732} 
msa3 0176 . 2 { 305__090 } 
msa30176 . 2 f 305_CJB110 > 
msa30176.2{305_1169NT} 
Consensus 



msa30176.2{305_18RS2l} 
msa30176.2{305_2603} 
msa3 0 17 6 . 2 { 3 05_A9 09 \ 
msa30176.2{305_H3 6B} 
msa30176.2{305_JM9130013} 
msa30176.2{305_COHll 
msa30176 . 2 { 305_M781 \ 
msa30176.2{305e__M732) 



451 

GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
********** 

501 

AGTTATGGAA 
AGTTATGGAA 
AGTTATGGAA 
AGTTATGGAA 
AGTTATGGAA 
AGTTATGGAA 
AGTTATGGAA 
AGTTATGGAA 
AGTTATGGAA 
AGTTATGGAA 
AGTTATGGAA 



CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 



TGTTCAGGCT 
TGTTCAGGCT 
TGTTCAGGCT 
TGTTCAGGCT 
TGTTCAGGCT 
TGTTCAGGCT 
TGTTCAGGCT 
TGTTCAGGCT 
TGTTCAGGCT 
TGTTCAGGCT 
TGTTCAGGCT 



GCGaATGATA 
GCGaATGATA 
GCGaATGATA 
GCGaATGATA 
GCGaATGATA 
GCGgATGATA 
GCGgATGATA 
GCGgATGATA 
GCGgATGATA 
GCGgATGATA 
GCGgATGATA 



500 

AAGATAcTCT 
AAGATAcTCT 
AAGATAcTCT 
AAGATAcTCT 
AAGATAcTCT 
AAGATAtTCT 
AAGATAtTCT 
AAGATAtTCT 
AAGATAtTCT 
AAGATAtTCT 
AAGATAcTCT 



********** ********** ***_****** ******_*** 



TTATCAAGTT 
TT AT CAAGTT 
TTATCAAGTT 
TTATCAAGTT 
TTATCAAGTT 
TTATCAAGTT 
TTATCAAGTT 
TTATCAAGTT 
TTATCAAGTT 
TTATCAAGTT 
TTATCAAGTT 



TTCAGCTAAT 
TTCAGCTAAT 
TTCAGCTAAT 
TTCAGCTAAT 
TTCAGCTAAT 
TTCAGCTAAT 
TTCAGCTAAT 
TTCAGCTAAT 
TTCAGCTAAT 
TTCAGCTAAT 
TTCAGCTAAT 



GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 



550 

GAATTTCGTC 
GAATTTCGTC 
GAATTTCGTC 
GAATTTCGTC 
GAATTTCGTC 
GAATTTCGTC 
GAATTTCGTC 
GAATTTCGTC 
GAATTTCGTC 
GAATTTCGTC 
GAATTTCGTC 



551 

CTCATATTGC 
CTCATATTGC 
CTCATATTGC 
CTCATATTGC 
CTCATATTGC 
CTCATATTGC 
CTCATATTGC 
CTCATATTGC 



AGTAATTACT 
AGTAATTACT 
AGTAATTACT 
AGTAATTACT 
AGTAATTACT 
AGTAATTACT 
AGTAATTACT 
AGTAATTACT 



AATTTAATGC 
AATTTAATGC 
AATTTAATGC 
AATTTAATGC 
AATTTAATGC 
AATTTAATGC 
AATTTAATGC 
AATTTAATGC 



CAACTCATTT 
CAACTCATTT 
CAACTCATTT 
CAACTCATTT 
CAACTCATTT 
CAACTCATTT 
CAACTCATTT 
CAACTCATTT 



600 

AGATTATCAT 
AGATTATCAT 
AGATTATCAT 
AGATTATCAT 
AGATTATCAT 
AGATTATCAT 
AGATTATCAT 
AGATTATCAT 
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Table 67: Comparative Sequences relating to SAG0475 



msa30176.2{305_090> 
msa30176.2{305_CJB110} 
msa30176.2{305_1169NT} 
Consensus 



CTCATATTGC AGTAATTACT AATTTAATGC CAACTCATTT AGATTATCAT 
CT CATATTGC AGTAATTACT AATTTAATGC CAACTCATTT AGATTATCAT 
CTCATATTGC AGTAATTACT AATTTAATGC CAACTCATTT AGATTATCAT 
********** ********** ********** ********** ********** 



601 650 

msa30176.2{305_18RS2l} GGGTCTTTTG AAGAtTATGT TGCTGCAAAA TGGAATATCC AAAATCAAAT 

TTlBa30176.2{305_2603} GGGTCTTTTG AAGAtTATGT TGCTGCAAAA TGGAATATCC AAAATCAAAT 

msa3 0176.2(30 5_A9 0 9 } GGGTCTTTTG AAGAtTATGT TGCTGCAAAA TGGAATATCC AAAATCAAAT 

lW3a30176.2{3 05_H36B} GGGTCTTTTG AAGAtTATGT TGCTGCAAAA TGGAATATCC AAAATCAAAT 

msa30176.2{305_JM9130013} GGGTCTTTTG AAGAtTATGT TGCTGCAAAA TGGAATATCC AAAATCAAAT 

msa30176.2{305_COHl} GGGTCTTTTG AAGAtTATGT TGCTGCAAAA TGGAATATCC AAAATCAAAT 

msa30176.2{305_M78l} GGGTCTTTTG AAGAtTATGT TGCTGCAAAA TGGAATATCC AAAATCAAAT 

msa30176.2{305e__M732} GGGTCTTTTG AAGAtTATGT TGCTGCAAAA TGGAATATCC AAAATCAAAT 

msa30176 .2 {305_090 } GGGTCTTTTG AAGAtTATGT TGCTGCAAAA TGGAATATCC AAAATCAAAT 

msa30176.2{305_CJB110} GGGTCTTTTG AAGAaTATGT TGCTGCAAAA TGGAATATCC AAAATCAAAT 

msa30176.2{305_1169NT} GGGTCTTTTG AAGAtTATGT TGCTGCAAAA TGGAATATCC AAAATCAAAT 

Consensus ********** ****_***** ********** ********** ********** 



msa30176.2{305_18RS2l} 
msa30176.2{305_2603} 
msa30176.2f305_A909) 
msa3 0176 . 2 { 3 05__H3 6B) 
msa30176.2{305_JM9130013} 
msa30176.2{305_COHl} 
msa30176.2{305_M78l] 
msa30176 .2{305e_M732] 
msa30176.2{305_090] 
msa30176.2{305_CJB110] 
msa30176 . 2 {305_1169NT] 
Consensus 



651 

GTCTT CAT CT 
GTCTT CAT CT 
GTCTTCATCT 
GTCTT CATCT 
GTCTTCATCT 
GTCTTCATCT 
GTCTTCATCT 
GTCTTCATCT 
GTCTTCATCT 
GTCTTCATCT 
GTCTTCATCT 
********** 



GATTTTTTGG 
GATTTTTTGG 
GATTTTTTGG 
GATTTTTTGG 
GATTTTTTGG 
GATTTTTTGG 
GATTTTTTGG 
GATTTTTTGG 
GATTTTTTGG 
GATTTTTTGG 
GATTTTTTGG 
********** 



TACTTAATTT 
TACTTAATTT 
TACTTAATTT 
TACTTAATTT 
TACTTAATTT 
TACTTAATTT 
TACTTAATTT 
TACTTAATTT 
TACTTAATTT 
TACTTAATTT 
TACTTAATTT 
********** 



TAATCAAGGT 
TAATCAAGGT 
TAATCAAGGT 
TAATCAAGGT 
TAATCAAGGT 
TAATCAAGGT 
TAATCAAGGT 
TAATCAAGGT 
TAATCAAGGT 
TAATCAAGGT 
TAATCAAGGT 
********** 



700 

ATTTCTAAAG 
ATTTCTAAAG 
ATTTCTAAAG 
ATTTCTAAAG 
ATTTCTAAAG 
ATTTCTAAAG 
ATTTCTAAAG 
ATTTCTAAAG 
ATTTCTAAAG 
ATTTCTAAAG 
ATTTCTAAAG 
********** 



msa30176.2{305_18RS2l} 
msa30176.2{305_2603} 
msa30176.2f305__A909} 
msa3 0 17 6 . 2 { 3 05_H3 6B } 
msa30176.2{305_OM9130013; 
msa3 0 17 6 . 2 { 3 0 5_C0H1 ] 
msa30176.2{30S_M78lj 
msa3 0 1 7 6 . 2 { 3 0 5e_M73 2 
msa30176.2{305_090j 
msa3 0 17 6 . 2 { 3 0 5_C JB1 10 } 
msa30176.2{305_1169NT} 
Consensus 



701 

AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
********** 



AACTACTAAA 
AACTACTAAA 
AACTACTAAA 
AACTACTAAA 
AACTACTAAA 
AACTACTAAA 
AACTACTAAA 
AACTACTAAA 
AACTACTAAA 
AACTACTAAA 
AACTACTAAA 
********** 



GCAACAATCG 
GCAACAATCG 
GCAACAATCG 
GCAACAATCG 
GCAACAATCG 
GCAACAATCG 
GCAACAATCG 
GCAACAATCG 
GCAACAATCG 
GCAACAATCG 
GCAACAATCG 
********** 



TTCCTTTCTC 
TTCCTTTCTC 
TTCCTTTCTC 
TTCCTTTCTC 
TTCCTTTCTC 
TTCCTTTCTC 
TTCCTTTCTC 
TTCCTTTCTC 
TTCCTTTCTC 
TTCCTTTCTC 
TTCCTTTCTC 
********** 



750 

TACTACGGAA 
TACTACGGAA 
TACTACGGAA 
TACTACGGAA 
TACTACGGAA 
TACTACGGAA 
TACTACGGAA 
TACTACGGAA 
TACTACGGAA 
TACTACGGAA 
TACTACGGAA 
********** 



msa30176.2{305_18RS2l} 
msa30176.2{305_2603} 
msa30176.2{305_A909} 
msa30176.2{305_H36B} 
msa30176.2{305_JM9130013} 
msa3 0 17 6 . 2 { 3 05_COH1 } 
msa30176.2{305_M781) 
msa30176 . 2 { 305e_M732 } 
msa30176.2{305_090} 
msa30176.2{305_CJB110j 
msa30176 . 2 {305_1169NT} 
Consensus 



msa30176.2{305_18RS2l} 
msa30176 . 2 {305__2603 } 
msa30176.2{305_A909} 
msa30176.2{305_H36B] 
tnsa30176 . 2 {305_JM9130013 j 
msa3 0 17 6 . 2 { 3 0 5_C0H1 ] 
msa30176.2{305_M78l] 
msa30176.2{305e_M732] 
msa30176.2{305J)90} 
msa30176 . 2 { 3 05_CJB110 } 
msa30176 . 2 {305__1169NT} 
Consensus 



751 

AAAGTTGATG GTGCTTACGT ACAAGACAAG 
AAAGTTGATG GTGCTTACGT ACAAGACAAG 
AAAGTTGATG GTGCTTACGT ACAAGACAAG 
AAAGTTGATG GTGCTTACGT ACAAGACAAG 
AAAGTTGATG GTGCTTACGT ACAAGACAAG 
AAAGTTGATG GTGCTTACGT ACAAGACAAG 
AAAGTTGATG GTGCTTACGT ACAAGACAAG 
AAAGTTGATG GTGCTTACGT ACAAGACAAG 
AAAGTTGATG GTGCTTACGT ACAAGACAAG 
AAAGTTGATG GTGCTTACGT ACAAGACAAG 
AAAGTTGATG GTGCTTACGT ACAAGACAAG 
********** ********** ********** 

801 

GAATATTATG TcAGTAGAtG ACATTGGTGT 
GAATATTATG TcAGTAGAtG ACATTGGTGT 
GAATATTATG TcAGTAGAtG ACATTGGTGT 
GAATATTATG TcAGTAGAtG ACATTGGTGT 
GAATATTATG TcAGTAGAtG ACATTGGTGT 
GAATATTATG TcAGTAGAtG ACATTGGTGT 
GAATATTATG TcAGTAGAtG ACATTGGTGT 
GAATATTATG TcAGTAGAtG ACATTGGTGT 
GAATATTATG TtAGTAGAtG ACATTGGTGT 
GAATATTATG TtAGTAGAtG ACATTGGTGT 
GAATATTATG TcAGTAGAcG ACATTGGTGT 
********** *_******_* ********** 



CAACTTTTCT 
CAACTTTTCT 
CAACTTTTCT 
CAACTTTTCT 
CAACTTTTCT 
CAACTTTTCT 
CAACTTTTCT 
CAACTTTTCT 
CAACTTTTCT 
CAACTTTTCT 
CAACTTTTCT 
********** 



CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 
********** 



800 

ATAAAGGGGA 
ATAAAGGGGA 
ATAAAGGGGA 
ATAAAGGGGA 
ATAAAGGGGA 
ATAAAGGGGA 
ATAAAGGGGA 
ATAAAGGGGA 
ATAAAGGGGA 
ATAAAGGGGA 
ATAAAGGGGA 
********** 

850 

CATAACGTAg 
CATAACGTAg 
CATAACGTAn 
CATAACGTAg 
CATAACGTAg 
CATAACGTAg 
CATAACGTAg 
CATAACGTAg 
CATAACGTAg 
CATAACGTAg 
CATAACGTAg 
*********_ 



851 900 

msa30176.2{305_18RS2l} AGAATGCTCT AG CAACTATT GCGGTTGCTA AACTgGCTGG TATCAGTAAT 

msa30176.2{305_2603} AGAATGCTCT AG CAACTATT GCGGTTGCTA AACTgGCTGG TATCAGTAAT 

msa30176 .2 {305_A909 } AGAATGCTCT AG CAACTATT GCGGTTGCTA AACTgGCTGG TATCAGTAAT 

maa30176.2{305__H36B} AGAATGCTCT AG CAACTATT GCGGTTGCTA AACTgGCTGG TATCAGTAAT 

msa30176.2{305_JM9130013} AGAATGCTCT AG CAACTATT GCGGTTGCTA AACTgGCTGG TATCAGTAAT 

msa30176.2{305_COHl} AGAATGCTCT AG CAACTATT GCGGTTGCTA AACTaGCTGG TATCAGTAAT 

msa30176.2{305_M78l} AGAATGCTCT AG CAACTATT GCGGTTGCTA AACTaGCTGG TATCAGTAAT 
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Table 67: Comparative Sequences relating to SAG0475 



msa30176.2{305e_M732} 
msa30176. 2 {305_090} 
msa30176 . 2 { 305_CJB110 } 
msa30176 . 2 {305_1169NT} 
Consensus 



AGAATGCTCT AG CAACTATT GCGGTTGCTA AACTaGCTGG TAT CAGTAAT 
AGAATGCTCT AGCAACTATT GCGGTTGCTA AACTaGCTGG TATCAGTAAT 
AGAATGCTCT AGCAACTATT GCGGTTGCTA AACTaGCTGG TATCAGTAAT 
AGAATGCTCT AGCAACTATT GCGGTTGCTA AACTaGCTGG TATCAGTAAT 
********** ********** ********** ****_***** ********** 



msa30176.2{305_18RS2l} 
msa30176.2{305_2603} 
msa30176.2{305_A909} 
msa3 0 17 6 . 2 { 3 05_H3 6B } 
msa30X76 . 2 {305_JM9130013 ) 
msa3 0 176 . 2 { 3 05_COH1 } 
msa30176 . 2 {305_M78l} 
msa30176 . 2 {305e_M732 } 
msa3 0176.2{305_090) 
msa30176.2{305_CJB110} 
msa30176.2{305_1169NT} 
Consensus 



901 

CAAGTTATTA 
CAAGTTATTA 
CAAGTTATTA 
CAAGTTATTA 
CAAGTTATTA 
CAAGTTATTA 
CAAGTTATTA 
CAAGTTATTA 
CAAGTTATTA 
CAAGTTATTA 
CAAGTTATTA 
********** 



GAGAAACTTT 
GAGAAACTTT 
GAGAAACTTT 
GAGAAACTTT 
GAGAAACTTT 
GAGAAACTTT 
GAGAAACTTT 
GAGAAACTTT 
GAGAAACTTT 
GAGAAACTTT 
GAGAAACTTT 
********** 



AAGCAATTTT 
AAGCAATTTT 
AAGCAATTTT 
AAGCAATTTT 
AAGCAATTTT 
AAGCAATTTT 
AAGCAATTTT 
AAGCAATTTT 
AAGCAATTTT 
AAGCAATTTT 
AAGCAATTTT 
********** 



GGAGGTGTTA 
GGAGGTGTTA 
GGAGGTGTTA 
GGAGGTGTTA 
GGAGGTGTTA 
GGAGGTGTTA 
GGAGGTGTTA 
GGAGGTGTTA 
GGAGGTGTTA 
GGAGGTGTTA 
GGAGGTGTTA 
********** 



950 

AACACCGCTT 
AACACCGCTT 
AACACCGCTT 
AACACCGCTT 
AACACCGCTT 
AACACCGCTT 
AACACCGCTT 
AACACCGCTT 
AACACCGCTT 
AACACCGCTT 
AACACCGCTT 
********** 



951 1000 

msa30176.2{305_18RS2l} GCAATCACTC GGTAAGGTTC ATGGTATTAG TTTCTATAAc GACAGcAAGt 

msa30176.2{3 05_2603> GCAATCACTC GGTAAGGTTC ATGGTATTAG TTTCTATAAc GACAGcAAGt 

msa30176.2{305_A909} GCAATCACTC GGTAAGGTTC ATGGTATTAG TTTCTATAAc GACAGcAAGt 

msa30176.2{305_H36B} GCAATCACTC GGTAAGGTTC ATGGTATTAG TTTCTATAAc GACAGcAAG- 

msa30176 . 2 { 305_JM9130013 } GCAATCACTC GGTAAGGTTC ATGGTATTAG TTTCTATAAc GACAGcAAGt 

msa30176.2{305_COHl} GCAATCACTC GGTAAGGTTC ATGGTATTAG TTTCTATAAc GACAGcAAGt 

msa30176.2{305_M78l} GCAATCACTC GGTAAGGTTC ATGGTATTAG TTTCTATAAc GACAGcAAGt 

msa30176.2{305e_M732} GCAATCACTC GGTAAGGTTC ATGGTATTAG TTTCTATAAc GACAGcAAGt 

msa30176 .2 {305_090} GCAATCACTC GGTAAGGTTC ATGGTATTAG TTTCTATAAc GACAGcAAGt 

msa30176.2{305_CJB110} GCAATCACTC GGTAAGGTTC ATGGTATTAG TTTCTATAA t GACAGcAAGt 

msa30176.2{305_1169NT} GCAATCACTC GGTAAGGTTC ATGGTATTAG TTTCTATAAc GACAGtAAGt 

Consensus ********** ********** ********** *********_ *****_«**_ 



msa30176.2{305_18RS2l) 
msa30176.2{305_2603}. 
msa30176 . 2 {305_A909 } 
. msa3 0176.2(30 5 JK3 6B } 
msa30176 . 2 {305_JM9130013 j 
msa30176.2{305_COHl] 
msa30176.2{305_M78lj 
msa30176 . 2 {305e_M732 ) 
msa30l76.2{305_090] 
msa3 017 6 .2(30 5_CJB110 ) 
msa30176.2{305_1169NT} 
Consensus 



1001 1050 

caactaatat attggcaact caaaaagcat tatctggctt tgataatact 

caactaatat attggcaact caaaaagcat tatctggctt tgataatact 

caactaatat attggcaact caaaaagcat tatctggctt tgataatact 



caactaatat 
caactaatat 
caactaatat 
caactaatat 
caactaatat 
caactaatat 
caactaatat 



attggcaact 
attggcaact 
attggcaact 
attggcaact 
attggcaact 
attggcaact 
attggcaact 



caaaaagcat 
caaaaagcat 
caaaaagcat 
caaaaagcat 
caaaaagcat 
caaaaagcat 
caaaaagcat 



tatctggctt 
tatctggctt 
tatctggctt 
tatctggctt 
tatctggctt 
tatctggctt 
tatctggctt 



tgataatact 
tgataatact 
tgataatact 
tgataatact 
tgataatact 
tgataatact 
tgataatact 



rasa30176.2{305_18RS2l] 
msa30176. 2 {305_2603 ] 
msa3 0 176 .2(3 05_A909 ) 
msa3 0176 .2(3 05_H36B } 
msa30176 . 2 {305_JM9130013 ] 
msa3 0 17 6 . 2 { 3 05_COH1 ) 
msa30176.2{305_M781 
msa30176.2{305e_M732] 
msa30176.2{305_090] 
msa30176 . 2 {3 05_CJB110 
msa30176.2{305_1169NT) 
Consensus 



1051 1100 

aaagttatcc taattgcagg aggtcttgat cgcggtaatg agtttgatga 

aaagttatcc taattgcagg aggtcttgat cgcggtaatg agtttgatga 

aaagttatcc taattgcagg aggtcttgat cgcggtaatg agtttgatga 



aaagttatcc 
aaagttatcc 
aaagttatcc 
aaagttatcc 
aaagttatcc 
aaagttatcc 
aaagttatcc 



taattgcagg 
taattgcagg 
taattgcagg 
taattgcagg 
taattgcagg 
taattgcagg 
taattgcagg 



aggtcttgat 
aggtcttgat 
aggtcttgat 
aggtcttgat 
aggtcttgat 
aggtcttgat 
aggtcttgat 



cgcagtaatg 
cgcggtaatg 
cgcggtaatg 
cgcggtaatg 
cgcggtaatg 
cgcggtaatg 
cgcggtaatg 



agtttgatga 
agtttgatga 
agtttgatga 
agtttgatga 
agtttgatga 
agtttgatga 
agtttgatga 



msa30176.2{305JL8RS2l} 
msa30176 . 2 { 305_2603 } 
msa30176 . 2 {305_A909 } 
msa3 0 17 6 . 2 { 3 0 5_H3 6B J 
msa30176 . 2 {305_JM9130013 } 
msa3 0 176 . 2 { 3 05_COH1 
msa30176.2{305_M781] 
msa30176.2{305e_M732 
msa3 0176 . 2 { 305_090 J 
msa30176.2{305_CJB110] 
msa30176.2{305_1169NT} 
Consensus 



1101 1150 

attgatacca gatatcactg gacttaaaca tatggttgtt ttaggggaat 

attgatacca gatatcactg gacttaaaca tatggttgtt ttaggggaat 

attgatacca gatatcactg gacttaaaca tatggttgtt ttaggggaat 



attgatacca 
attgatacca 
attgatacca 
attgatacca 
attgatacca 
attgatacca 
attgatacca 



gatatcactg 
gatatcactg 
gatatcactg 
gatatcactg 
gatatcactg 
gatatcactg 
gatatcactg 



gacttaaaca 
gacttaaaca 
gacttaaaca 
gacttaaaca 
gacttaaaca 
gacttaaaca 
gacttaagca 



tatggttgtt 
tatggttgtt 
tatggttgtt 
tatggttgtt 
tatggttgtt 
tatggttgtt 
tatggttgtt 



ttaggggaat 
ttaggggaat 
ttaggggaat 
ttaggggaat 
ttaggggaat 
ttaggggaat 
ttaggggaat 



rasa30176.2{305_18RS2l} 
msa30176 .2 f 305_2603 } 
msa30176.2{305__A909) 
msa30176.2{305_H36B} 
msa30176.2{305_JM9130013) 
msa30176.2{305_COHl} 



1151 1200 
cggcatctcg agtaaaacgt gctgcacaaa aagcaggagt aacttatagc 
cggcatctcg agtaaaacgt gctgcacaaa aagcaggagt aacttatagc 
cggcatctcg agtaaaacgt gctgcacaaa aagcaggagt aacttatagc 

cggcatctcg agtaaaacgt gctgcacaaa aagcaggagt aacttatagc 
cggcatctcg agtaaaacgt gctgcacaaa aagcaggagt aacttatagc 
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msa30176.2{305_M78l) 
msa30176.2{305e_M732) 
msa30176. 2 { 305_090} 
msa30176.2{305_CJB110} 
msa30176.2{305_1169NT} 
Consensus 



cggcatctcg agtaaaacgt gctgcacaaa 
cggcatctcg agtaaaacgt gctgcacaaa 
cggcatctcg agtaaaacgt gctgcacaaa 
cggcatctcg agtaaaacgt gctgcacaaa 
cggcatctcg agtaaaacgt gctgcacaaa 



aagcaggagt aacttatagc 
aagcaggagt aacttatagc 
aagcaggagt aacttatagc 
aagcaggagt aacttatagc 
aagcaggagt aacttatagc 



msa30176-2{3 05_18RS2lJ 
msa30176.2{305_2603} 
msa30176.2{305_A909} 
msa3 0 17 6 . 2 { 3 0 5JH3 6B } 
msa3017 6.2{305_JM9130013} 
ms a3 0176.2(30 5_C0H1 } 
msa30176.2{305_M78l} 
msa30176.2{305e_M732) 
msa30176 . 2 { 305_090 } 
msa3 0176.2{305_CJB110} 
msa30176.2{305_1169NT} 
Consensus 



1201 1250 
gatgctttag atgttagaga tgcggtacat aaagcttatg aggtggcaca 
gatgctttag atgttagaga tgcggtacat aaagcttatg aggtggcaca 
gatgctttag atgttagaga tgcggtacat aaagcttatg aggtggcaca 



gatgctttag 
gatgctttag 
gatgctttag 
gatgctttag 
gatgctttag 
gatgctttag 
aatgctttag 



atgttagaga 
atgttagaga 
atgttagaga 
atgttagaga 
atgttagaga 
atgttagaga 
atgttagaga 



tgcggtacat 
tgcggtacat 
tgcggtacat 
tgcggtacat 
tgcggtacat 
tgcggtacat 
tgcggtacat 



aaagcttatg 
aaagcttatg 
aaagcttatg 
aaagcttatg 
aaagcttatg 
aaagcttatg 
aaagcttatg 



aggtggcaca 
aggtggcaca 
aggtggcaca 
aggtggcaca 
aggtggcaca 
aggtggcaca 
aggtggcaca 



msa30176.2{305_18RS2l} 
msa30176.2{305_2603} 
msa30176 . 2 f305_A909> 
msa30176 . 2 (305_H36B) 
msa30176.2{305_JM9130013} 
msa3 0 17 6 . 2 { 3 0 5_COHl } 
msa30176.2{305_M78l} 
msa30176.2{305e_M732}. 
msa30176.2{305_090> 
msa30176.2{305_CJB110) 
msa30176 . 2 { 305_1169NT} 
Consensus 



1251 1300 
acagggcgat gttatcttgc taagtcctgc aaatgcatca tgggacatgt 
acagggcgat gttatcttgc taagtcctgc aaatgcatca tgggacatgt 
acagggcgat gttatcttgc taagtcctgc aaatgcatca tgggacatgt 



acagggcgat 
acagggcgat 
acagggcgat 
acagggcgat 
acagggcgat 
acagggcgat 
acagggcgat 



gttatcttgc 
gttatcttgc 
gttatcttgc 
gttatcttgc 
gttatcttgc 
gttatcttgc 
gttatcttgt 



taagtcctgc 
taagtcctgc 
taagtcctgc 
taagtcctgc 
taagtcctgc 
taagtcctgc 
tmagtcctgc 



aaatgcatca 
aaatgcatca 
aaatgcatca 
aaatgcatca 
aaatgcatca 
aaatgcatca 
gaatgcatca 



tgggacatgt 
tgggacatgt 
tgggacatgt 
tgggacatgt 
tgggacatgt 
tgggacatgt 
tgggacatgt 



msa30176 . 2 {305_18RS2l} 
msa30176.2{305_2603} 
msa30176.2{305_A909} 
msa30176.2{305_H36B} 
msa30176 . 2 (305_JM9130013 } 
rasa3 0 176 . 2 { 3 0 5_COHl \ 
msa30176.2[305J478l} 
msa30176 . 2 {305e_M732 } 
msa30176.2{305_090} 
msa30176.2{305_CJB110} 
msa30176.2{305_1169NT} 
Consensus 



1301 1350 
ataagaattt cgaagtccgt ggtgatgaat tcattgatac tttcgaaagt 
ataagaattt cgaagtccgt ggtgatgaat tcattgatac tttcgaaagt 
ataagaattt cgaagtccgt ggtgatgaat tcattgatac tttcgaaagt 



ataagaattt 
ataagaattt 
ataagaattt 
ataagaattt 
ataagaattt 
ataagaattt 
ataagaattt 



cgaagtccgt 
cgaagtccgt 
cgaagtccgt 
cgaagtccgt 
cgaagtccgt 
cgaagtccgt 
cgaagtccgt 



ggtgatgaat 
ggtgatgaat 
ggtgatgaat 
ggtgatgaat 
ggtgatgaat 
ggtgatgaat 
ggtgatgaat 



tcattgatac 
tcattgatac 
tcattgatac 
tcattgatac 
tcattgatac 
tcattgatac 
tcattgatac 



tttcgaaagt 
tttcgaaa — 
tttcgaaagt 
tttcgaaagt 
tttcgaaagt 
tttcgaaagt 
tttcg 



1351 1362 

msa30176.2{305_18RS2l} cttagaggag ag 

msa30176.2{305_2603} cttagaggag ag 

msa30176.2{305_A909} cttagaggag ag 

msa30176 .2{305_H36Bj — 

msa30176.2{3 05__JM913 0013} cttagaggag ag 

msa30176.2{305_COHl} 

msa30176.2{305_M78l} cttagaggag ag 

msa30176.2{305e_M732) cttagaggag ag 

msa30176 .2{305__090) cttagaggag ag 

tnsa30176.2{305_CJB110} cttagaggag ag 

msa30176.2{305_1169NT} 

Consensus 



SEQ ID NO. 6711 

STRAIN 090 frame: 3 

ITTFFJSTKKVLVLGLARSGEAAARLLAKIjGAIVTV^ 

HPLELLDEDFCYMI KNPGI PYNNPMVKKALEKQI PVLTEVELAYLVSESQLI GI TGSNGK 
TTTTTM I AE VLNAGGQRGLLAGN I GFPASEWQAADDKD I LVMELSS FQLMGVKE FRPH I 
AVI TNLM PTHLDYHGSFEDYYAAKWNI QNQMS S SDFLVLNFNQG I SKELAKTTKATI VP F 
STTEKVDGAYVQDKQLFYKGENI MLVDD I GVPGSHNVENALAT I AVAKLAG I SNQVI RET 
LSNFGGVKHRLQSLGKVHGI SFYNDSKSTNI LATQKALSGFDNTKVI LI AGGLDRGNEFD 
ELI PD I TGLKHMVVLGESASRVKRAAQKAGVTYSDALDVRDAVHKAYEVA 
ANAS WDM YKNFEVRGDE F I DTFESLRGE 



SEQ ID NO. 6712 

STRAIN A909 frame: 3 

ITTFENKKVLVI^IjARSGEAAARLLAKIiGAIVTV^ 

HPLELLDEDFCYMI KNPG I PYNNPMVKKALEKQI PVLTEVELAYLVSESQLI GITGSNGK 
TTTTTMI AEVLNAGGQRGLLAGNI GFPASEWQAANDKDTLVMELSS FQLMGVKE FRPH I 
AVITNIJ^PTHI^YHGSFEDYVAAK^IQNQMSSSDFLVIiNFNQGISKEIiAKTTKATIV^PF 
STTEKVDGAYVQDKQLF YKGEN I MS VDD I GVPGSHNVXNALAT I AVAKLAG I SNQVI RET 
LSNFGGVKHRLQSLGKVHGI SFYNDSKSTNI LATQKALSGFDNTKVI LI AGGLDRGNEFD 
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ELI PD ITGLKHMWLGESASRVKRAAQKAGVT YSDALDVRDAVHKAYEVAQQGDVI LLS P 
ANASWDMYKNFEVRGDEFI DTPESLRGE 

SEQ ID NO. 6713 

STRAIN H36B frame: 1 

GRVMKTITTFF^KKVLVLGLARSGFJIAARLLAKLGAIW 

KWCGSHPLELLDEDFCYMI KNPGI PYNNPMVKKALEKQI PVLTEVELAYLVSESQLIGI 
TGSNGKTTTTTM I AEVLNAGGQRGLLAGN I GF PASE WQAANDKDTLVMELS S FQXjMGVK 
EFRPHIAVITNLMPTHLDYHGSFEDYVAAKWNIQNQMSSSDFLVIjNFNQGISKELAKTrK 
AT I VPFSTTEKVDGAYVQDKQLFYKGENI MSVDD I GVPGSHNVENALAT I AVAKLAG I SN 
QVI RETLSNFGGVKHRLQSLGKVHG I S FYNDSK 

SEQ ID NO. 6714 
STRAIN 18RS21 frame: 1 

GRVMKTITTFF^KKVLVLGLARSGEAAARLLAKLGAIVT^^ 

KWCGSHPLELLDEDFCYMI KNPGI PYNNPMVKKALEKQI PVLTEVELAYLVSESQLIGI 
TGSNGKTTTTTM I AE VLNAGGQRGLLAGN I GFPAS EWQAANDKDTLVMELS SFQLMGVK 
EFRPHIAVITNLMPTHLDYHGSFEDWAAKWNIQNQMSSSDFLVLNFNQGISKEL^^ 
AT I VPFSTTE KVDGAYVQDKQLFYKGENI MS VDD I GVPGSHNVENALAT I AVAKLAG I SN 
QVI RETLSNFGGVKHRLQSLGKVHG I SFYNDSKSTNIIATQKALSGFDNTKVI LI AGGLD 
RGNE FDEL I PD I TGLKHMVVLGESASRVKRAAQKAGVTYSDALDVRDAVHKAYE VAQQGD 
VILLSPANASWDMYKNFEVRGDEFIDTFESLRGE 

SEQ ID NO. 6715 
STRAIN M732 frame: 1 

GRVMKTITTFENKKVLVIXSLARSGEAAARLIAKLGAIVTVNDGK^ 

KWCGSHPLELLDEDFCYMI KNPGI PYNNPMVKKALEKQI PVLTEVELAYLVSESQLIGI 
TGSNGKTTTTTM I AEVLNAGGQRGLLAGN I GFPASEWQAADDKD I LVMELSS FQLMGVK 
EFRPHI AVI TNLMPTHLDYHGS FEDYVAAKWN I QNQMSS SDFLVLNFNQGI SKELAKTTK 
AT I VPFSTTEKVDGAYVQDKQLFYTCGENI MS VDD I GVPGSHNVENALAT I AVAKLAG I SN 
QVI RETLSNFGGVKHRLQSLGKVHG I S FYNDS KSTNI LATQKALSGFDNTKVIL I AGGLD 
RGNE FDEL I PD I TGL KHMVVLGE S AS RVKRAAQKAG VTY S DALD VRDAVH KA YE VAQQGD 
VILLSPANASWDMYKNFEVRGDEFIDTFESLRGE 

SEQ ID NO. 6716 
STRAIN COH1 frame: 1 

GRVMKTITTFENKKVLVLGLARSGEAAARLLAK^ 

KWCGSHPLELLDEDFCYMI KNPGI PYNNPMVKKALEKQI PVLTEVELAYLVSESQLIGI 
TGSNGKTTITTMI AEVLNAGGQRGLLAGNI GFPASEVVQAADDKD I LVMELSS FQLMGVK 
EFRPH I AVI TNLMPTHLDYHGS FEDYVAAKWN I QNQMS S SDFLVLNFNQGI SKELAKTTK 
AT I VPFSTTEKVDGAYVQDKQLFYKGENIMS VDD I GVPGSHNVENALAT I AVAKLAG I SN 
QVIRETLSNFGGVKHRLQSLGKVHGI SFYNDSKSTNILATQKALSGFDNTKVIL IAGGLD 
RGNEFDEL I PD I TGLKHMVVLGES ASRVKRAAQKAGVTYSDAI^ VRDAVHKAYE VAQCX5D 
VI LLS PANASWDMYKNFEVRGDEF I DTFE 

SEQ ID NO. 6717 

STRAIN M781 frame: 1 

GRVMKTITTFENKKVLVIjGLARSGEAAARLLAKLGAIVTVNDGKPF 

KWCGSHPLELLDEDFCYMI KNPGI PYNNPMVKKALEKQI PVLTEVELAYLVSESQLIGI 
TGSNGKTTTTTM I AEVLNAGGQRGLLAGNI GF PASEWQAADDKD I LVMELS S FQLMGVK 
EFRPH I AVI TNLMPTHLDYHGS FEDYVAAKWN I QNQMSS SDFLVLNFNQGI SKELAKTTK 
ATI VPFSTTEKVDGAYVQDKQLFYKGENI MS VDD I GVPGSHNVENALAT I AVAKLAG I SN 
QVI RETLSNFGGVKHRLQSLGKVHGI SFYNDSKSTNI LATQKALSGFDNTKVIL I AGGLD 
RGNEFDEL I PD I TGLKHMVVLGES ASRVKRAAQKAGVTYSDALDVRDAVHKAYEVAQQGD 
VI LLS PANASWDMYKNFEVRGDEF I DTFESLRGE 

SEQ ID NO. 6718 

STRAIN CJB1 10 frame: 1 

GRVMKTITTFENKKVLVIXSLARSGEAAARLIAi^ 

KWCGSHPLELLDEDFC YM I KNPG I PYNNPMVKKALE KQ I PVLTE VELAYLVSE S QL I G I 
TGSNGKTTTTTM I AEVLNAGGQRGLLAGN I GFPASEWQAADDKD I LVMELSS FQLMGVK 
EFRPH I AVI TNLMPTHLDYHGSFEEYVAAKWNI QNQMSS SDFLVLNFNCC I SKEl^KTTK 
AT I VP FSTTEKVDGAYVQDKQL FYKGENIMLVDD I GVPGSHNVENALAT I AVAKLAG I SN 
QVIRETLSNFGGVKHRLQSLGKVHG I SFYNDSKSTNI LATQKALSqFDNTKVILI AGGLD 
RG^ FDEL I PD ITGLKHMWLGESASRVKRAAQKAGVT^ 
VI LLS PANASWDMYKNFEVRGDEF I DTFE SLRGE 

SEQ ID NO. 6719 
STRAIN 1169NT frame: 3 

ITTFENKKVLVLGIjARSGEAAARLLAKLGAIVTVNDGKPFDENPTAQSLLEE 
HPLELLDEDFCYMI KNPGI PYNNPMVKKALEKQI PVLTEVELAYLVS ESQL I GITGSNGK 
TTTTTM I AEVLNAGGQRGLLAGN IGF PAS E WQAADDKDT LVME L S S FQLMGVKE FRPH I 
AVITNLMPTHLDYHGS FED YVAAKWNI QNQMS S SDFLVLNFNQG I SKELAKTTKATI VPF 
STTE KVDGAYVQDKQLFYKGENI MS VDD I GVPGSHNVENALAT I AVAKLAG I SNQVI RET 
LSNFGGVKHRI^SLGKVHGISFYNDSKSTNILATQKALSGFDNTKVILIAGGLDRGNEFD 
EL I PD I TGL KHMWLGE S AS RVKRAAQKAGVT Y S NALD VRDAVH KAYEVAQQGD V I LXS P 
ANASWDMYKNFEVRGDEFIDTF 
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Table 67: Comparative Sequences relating to SAG0475 



SEQ ID NO. 6720 
STRAIN JM9130013 frame: 1 

GRVMKTITTFENKKVLVLGLARSGEAAARLLAKLGAIVT^ 

KWCGSHPLELLDEDFC YM I KNPG I P YNNPMVKKALEKQI PVLTE VELAYLVSESQL IG I 
TGSNGKTTTTTM I AEVLNAGGQRGLLAGN I GFPASEWQAANDKDTLVMELS S FQLMGVK 
EFRPH I AVI TNLMPTHLDYHGS FED YVAAKWN I QNQMS S SDFLVLNFNQG I S KELAKTTK 
AT I VPFSTTEKVDGAYVQDKQLFYKGENIMSVDD I GVPGSHNVENALAT I AVAKLAG I SN 
QVI RETLSNFGGVKHRLQSLGKVHGI SFYNDSKSTNILATQKALSGFDNTKVILI AGGLD 
RSNEFDELI PDI TGLKHMVVI^ESASRVKRAAQKAGVTYSDALDVRDAVHKAYEVAQQGD 
VI LLS PANAS WDMYKNFEVRGDEF I DTFE S LRGE 

SEQ ID NO. 6721 

STRAIN 2603 frame: 1 

GRVM KT I TT FENKKVLVLG LARSGEAAARLLAKLGAI VTWDGKP FDEN PTAQS LLE EG I 
KWCGSHPLE LLDEDFCYMI KNPG I PYNNPMVKKALE KQ I PVLTE VELAYLVSESQL IGI 
TGSNGKTTTTTM I AEVLNAGGQRGLLAGN I GFPASEWQAANDKDTLVMELS S FQLMGVK 
EFRPH I AVI TNLMPTHLDYHGS FED YVAAKVHJ I QNQMSSSDFLVLNFNQG IS KELAKTTK 
AT I VP FSTTEKVDGAYVQD KQLFYKGEN I MS VDD I GVPGSHNVENALAT I AVAKLAG I SN 
QVI RETLSNFGG VKHRLQS LGKVHG I SFYNDSKSTNI LATQKALSGFDNTKVI LIAGGLD 
RGNE FDEL I PD I TGLKHMVVLGES ASRVKRAAQKAGVT YSDALD VRDAVHKAYE VAQQGD 
VI LL S PANAS WDMYKNFEVRGDEF I DTFE SLRGE 



MSA Alignment Results : Pretty output 

PRETTY of : /biotrnp/msa25243 . 2 { * } April 29, 2002 02:20 .. 

1 50 

msa25243.2{305_18RS2l} grvmktlTTF ENKKVLVLGL ARSGEAAARL LAKLGAI VTV NDGKPFDENP 

msa25243.2{305_2603} grvmktlTTF ENKKVLVLGL ARSGEAAARL LAKLGAI VTV NDGKPFDENP 

msa25243 .2 (305_JM9 130013} grvmktlTTF ENKKVLVLGL ARSGEAAARL LAKLGAI VTV NDGKPFDENP 

msa25243 .2{30 5_COHl} grvmktlTTF ENKKVLVLGL ARSGEAAARL LAKLGAI VTV NDGKPFDENP 

msa25243 .2{305_M732} grvmktlTTF ENKKVLVLGL ARSGEAAARL LAKLGAI VTV NDGKPFDENP 

msa25243 -2{3 05__M781 } grvmktlTTF ENKKVLVLGL ARSGEAAARL LAKLGAI VTV NDGKPFDENP 

msa25243 .2{305_1169NT} ITTF ENKKVLVLGL ARSGEAAARL LAKLGAI VTV NDGKPFDENP 

msa25243.2{305„A909} ITTF ENKKVLVLGL ARSGEAAARL LAKLGAI VTV NDGKPFDENP 

msa25243.2{305_CJB110} grvmktlTTF ENKKVLVLGL ARSGEAAARL LAKLGAI VTV NDGKPFDENP 

msa25243 .2{305_090} ITTF ENKKVLVLGL ARSGEAAARL LAKLGAI VTV NDGKPFDENP 

msa25243.2{305_H36B} grvmktlTTF ENKKVLVLGL ARSGEAAARL LAKLGAI VTV NDGKPFDENP 

Consensus **** ********** ********** ********** ********** 



msa25243.2{ 

msa25243 . 
msa25243.2{305. 

msa25243 . 

msa25243 . 

msa25243 . 
msa25243.2{ 

msa25243 
msa25243 .2{ 
msa25243 

msa25243 



305_18RS2l} 
2{305_2603} 
_JM9130013} 
2{305_COH1} 
2{305_M732} 
2{305_M78l} 
305_1169NT} 
2{305_A909} 
305__CJB110} 
.2{305_090} 
2{305_H36B} 
Consensus 



51 

TAQSLLEEGI 
TAQSLLEEG I 
TAQSLLEEGI 
TAQSLLEEGI 
TAQSLLEEGI 
TAQSLLEEGI 
TAQSLLEEGI 
TAQSLLEEGI 
TAQSLLEEGI 
TAQSLLEEGI 
TAQSLLEEGI 



KWCGSH PLE 
KWCGSHPLE 
KWCGSHPLE 
KWCGSHPLE 
KWCGSHPLE 
KWCGSHPLE 
KWCGSHPLE 
KWCGSHPLE 
KWCGSHPLE 
KWCGSHPLE 
KWCGSHPLE 



********** ********** 



LLDEDFCYMI 
LLDEDFCYMI 
LLDEDFCYMI 
LLDEDFCYMI 
LLDEDFCYMI 
LLDEDFCYMI 
LLDEDFCYMI 
LLDEDFCYMI 
LLDEDFCYMI 
LLDEDFCYMI 
LLDEDFCYMI 
********** 



KNPGIPYNNP 
KNPG I PYNNP 
KNPGIPYNNP 
KNPGIPYNNP 
KNPGIPYNNP 
KNPGIPYNNP 
KNPGIPYNNP 
KNPGIPYNNP 
KNPGIPYNNP 
KNPGIPYNNP 
KNPGIPYNNP 



100 

MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 



********** ********** 



msa25243 .2{305_18RS21 
msa25243.2{305_2603 
msa25243.2{305_JM9130013 
msa2 5243.2(30 5_COHl } 
msa25243 . 2 {305_M732 * 
msa25243.2{305_M781 
msa25243 .2{305_1169NT 
msa25243.2{305_A909 
msa2 52 4 3 . 2 { 3 0 5_C JB1 1 0 
msa25243.2{305_090 
msa25243.2{305_H36B} 
Consensus 



101 

PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
********** 



LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 



TGSNGKTTTT 
TGSNGKTTTT 
TGSNGKTTTT 
TGSNGKTTTT 
TGSNGKTTTT 
TGSNGKTTTT 
TGSNGKTTTT 
TGSNGKTTTT 
TGSNGKTTTT 
TGSNGKTTTT 
TGSNGKTTTT 



********** ********** 



TMIAEVLNAG 
TMIAEVLNAG 
TMIAEVLNAG 
TMIAEVLNAG 
TMIAEVLNAG 
TMIAEVLNAG 
TMIAEVLNAG 
TMIAEVLNAG 
TMIAEVLNAG 
TMIAEVLNAG 
TMIAEVLNAG 
********** 



150 

GQRGLLAGNI 
GQRGLLAGNI 
GQRGLLAGNI 
GQRGLLAGNI 
GQRGLLAGNI 
GQRGLLAGNI 
GQRGLLAGNI 
GQRGLLAGNI 
GQRGLLAGNI 
GQRGLLAGNI 
GQRGLLAGNI 
********** 



msa25243.2{ 
msa25243 
msa25243.2{305 
msa25243 
msa25243 
msa25243 . 

msa25243.2{ 
msa25243 . 

msa25243.2{ 
msa25243 
msa25243 



305__18RS2l} 
2{305_2603} 
_JM9130013} 
2(305^0^' 
2{305_M732" 
2{305_M781] 
305_1169NT] 
2{305_A909] 
305_CJB110' 
2{305_090; 
2{305_H36B] 
Consensus 



151 

GFPASEWQA 
GFPASEWQA 
GFPASEWQA 
GFPASEWQA 
GFPASEWQA 
GFPASEWQA 
GFPASEWQA 
GFPASEWQA 
GFPASEWQA 
GFPASEWQA 
GFPASEWQA 
********** 



AnDKDtLVME 
AnDKDtLVME 
AnDKDtLVME 
AdDKDi LVME 
AdDKDiLVME 
AdDKDiLVME 
AdDKDtLVME 
AnDKDtLVME 
AdDKDiLVME 
AdDKDiLVME 
AnDKDtLVME 
*_***-**** 



LSS FQLMGVK 
LSS FQLMGVK 
LSS FQLMGVK 
LSS FQLMGVK 
LSSFQLMGVK 
LSS FQLMGVK 
LSSFQLMGVK 
LSSFQLMGVK 
LSSFQLMGVK 
LSSFQLMGVK 
LSSFQLMGVK 
********** 



EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 
********** 



200 

NLMPTHLDYH 
NLMPTHLDYH 
NLMPTHLDYH 
NLMPTHLDYH 
NLMPTHLDYH 
NLMPTHLDYH 
NLMPTHLDYH 
NLMPTHLDYH 
NLMPTHLDYH 
NLMPTHLDYH 
NLMPTHLDYH 
********** 
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Table 67: Comparative Sequences relating to SAG0475 



msa25243 ,2{ 

msa25243 . 
msa25243.2{305 t 

maa25243 . 

msa25243 . 

msa25243 . 
msa25243.2{ 

msa25243 
msa25243.2{ 
msa25243 

msa25243 



305_18RS21} 
2{305_2603} 
JM9130013} 
2{305_COH1} 
2{305_M732} 
2{305_M781} 
305_1169NT} 
2{305_A909} 
305_CJB110} 
2{305_090} 
2{305_H36B} 
Consensus 



201 

GSFEdYVAAK 
GSFEdYVAAK 
GSFEdYVAAK 
GSFEdYVAAK 
GSFEdYVAAK 
GSFEdYVAAK 
GSFEdYVAAK 
GSFEdYVAAK 
GS FEe YVAAK 
GSFEdYVAAK 
GSFEdYVAAK 



WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 



DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 



****_***** ********** ********** 



ISKELAKTTK 
ISKELAKTTK 
ISKELAKTTK 
ISKELAKTTK 
ISKELAKTTK 
ISKELAKTTK 
ISKELAKTTK 
ISKELAKTTK 
ISKELAKTTK 
ISKELAKTTK 
ISKELAKTTK 
********** 



250 

ATIVPFSTTE 
ATIVPFSTTE 
ATIVPFSTTE 
ATIVPFSTTE 
ATIVPFSTTE 
ATIVPFSTTE 
ATIVPFSTTE 
ATIVPFSTTE 
ATIVPFSTTE 
ATIVPFSTTE 
ATIVPFSTTE 
********** 



msa25243.2{305_18RS2l} 
msa25243 . 2 {305_2603 } 
msa25243 . 2 {305_JM913 0013 } 
msa25243 ,2{3 05_COHl} 
msa2 5243 .2(3 05_M732 } 
msa25243 .2{305_M78l} 
msa25243 .2{305_1169NT} 
rasa2 5243.2{305_A909} 
msa25243.2{305_CJB110} 
msa25243 .2 {305^090} 
msa25 24 3 . 2 { 3 05_H3 6B } 
Consensus 



251 

KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 



QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 



********** ********** 



sVDDIGVPGS 
SVDDIGVPGS 
SVDDIGVPGS 
SVDDIGVPGS 
sVDDIGVPGS 
SVDDIGVPGS 
SVDDIGVPGS 
SVDDIGVPGS 
1VDDIGVPGS 
1VDDIGVPGS 
sVDDIGVPGS 
_********* 



HNVeNALAT I 
HNVeNALAT I 
HNVeNALAT I 
HNVeNALATI 
HNVeNALAT I 
HNVeNALATI 
HNVeNALATI 
HNVxNALATI 
HNVeNALATI 
HNVeNALATI 
HNVeNALATI 
***_****** 



300 

AVAKLAGISN 
AVAKLAGISN 
AVAKLAGISN 
AVAKLAGISN 
AVAKLAGISN 
AVAKLAGISN 
AVAKLAGISN 
AVAKLAGISN 
AVAKLAGISN 
AVAKLAGISN 
AVAKLAGISN 
********** 



msa25243.2{305_18RS2l} 
msa25243 . 2 { 305_2603 } 
msa25243.2{305_JM9130013} 
msa2 5243 . 2 { 3 05_COH1 } 
msa25243 .2{305_M732} 
msa25243 .2{305_M78l} 
msa2 52 4 3 . 2 { 3 0 5_1 1 6 9NT } 
msa25243.2{305_A909} 
msa25243 .2{305_CJB110} 
msa25243 . 2 { 305_090 } 
msa25243.2{305_H36B} 
Consensus 



301 

QVI RETLSNF 
QVIRETLSNF 
QVI RETLSNF 
QVI RETLSNF 
QVIRETLSNF 
QVIRETLSNF 
QVIRETLSNF 
QVIRETLSNF 
QVIRETLSNF 
QVIRETLSNF 
QVIRETLSNF 



GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 



********** ********** 



GKVHGISFYN 
GKVHGISFYN 
GKVHGISFYN 
GKVHGISFYN 
GKVHGISFYN 
GKVHGISFYN 
GKVHGISFYN 
GKVHGISFYN 
GKVHGISFYN 
GKVHGISFYN 
GKVHGISFYN 
********** 



DSKstnilat 
DSKstnilat 
DSKstnilat 
DSKstnilat 
DSKstnilat 
DSKstnilat 
DSKstnilat 
DSKstnilat 
DSKstnilat 
DSKstnilat 

DSK 

*** 



350 

qkalsgfdnt 
qkalsgf dnt 
qkalsgfdnt 
qkalsgfdnt 
qkalsgfdnt 
qkalsgfdnt 
qkalsgfdnt 
qkalsgfdnt 
qkalsgfdnt 
qkalsgfdnt 



msa25243 . 2 {305_18RS21 } 
msa25243 . 2 {305_2603 } 
msa25243 . 2 {305_JM9130013 } 
msa25243 .2{305_COHl} 
msa25243.2{305_M732) 
msa25243.2{305_M78l} 
msa25243.2{305_JL169NT} 
ins a2 5243 . 2 { 3 05_A9 0 9 } 
msa2 5243 . 2 {305_CJB110 } 
msa25243 .2{305_090} 
msa25243 .2{305_H36B} 
Consensus 



351 

kviliaggld 
kviliaggld 
kviliaggld 
kviliaggld 
kviliaggld 
kviliaggld 
kviliaggld 
kviliaggld 
kviliaggld 
kviliaggld 



rgnef delip 
rgnefdelip 
rsnefdelip 
rgnefdelip 
rgnefdelip 
rgnefdelip 
rgnefdelip 
rgnefdelip 
rgnefdelip 
rgnefdelip 



ditglkhmw 
ditglkhmw 
ditglkhmw 
ditglkhmw 
ditglkhmw 
ditglkhmw 
ditglkhmw 
ditglkhmw 
ditglkhmw 
ditglkhmw 



lgesasrvkr 
lgesasrvkr 
lgesasrvkr 
lgesasrvkr 
lgesasrvkr 
lgesasrvkr 
lgesasrvkr 
lgesasrvkr 
lgesasrvkr 
lgesasrvkr 



400 

aaqkagvtys 
aaqkagvtys 
aaqkagvtys 
aaqkagvtys 
aaqkagvtys 
aaqkagvtys 
aaqkagvtys 
aaqkagvtys 
aaqkagvtys 
aaqkagvtys 



msa25243 .2{ 

msa25243 
msa25243.2{305 

msa25243 . 

msa25243 . 

msa25243 . 
msa25243.2{ 

msa25243 
msa25243.2{ 
msa25243 

msa25243 



3 05_18RS21} 
2 {305^2603} 
_JM9130013} 
2{305_COH1} 
2{305_M732) 
2{305_M78l} 
305_1169NT} 
2{305_A909} 
3 05_CJB110) 
2{305_090} 
2{305_H36B} 
Consensus 



401 

daldvrdavh 
daldvrdavh 
daldvrdavh 
daldvrdavh 
daldvrdavh 
daldvrdavh 
naldvrdavh 
daldvrdavh 
daldvrdavh 
daldvrdavh 



kayevaqqgd 
kayevaqqgd 
kayevaqqgd 
kayevaqqgd 
kayevaqqgd 
kayevaqqgd 
kayevaqqgd 
kayevaqqgd 
kayevaqqgd 
kayevaqqgd 



vi lisp anas 
villspanas 
villspanas 
villspanas 
villspanas 
villspanas 
vilxspanas 
villspanas 
villspanas 
villspanas 



wdmyknf evr 
wdmyknf evr 
wdmyknf evr 
wdmyknf evr 
wdmyknf evr 
wdmyknf evr 
wdmyknf evr 
wdmyknf evr 
wdmyknf evr 
wdmyknf evr 



450 

gdef idtf es 
gdef idtf es 
gdef idtf es 
gdefidtfe- 
gdef idtf es 
gdef idtf es 
gdef idtf — 
gdef idtf es 
gdef idtf es 
gdef idtf es 



451 

msa25243.2{305_18RS2l} lrge 

msa25243.2{305_2603} lrge 

msa25243.2{305_JM9130013} lrge 

msa25243 .2{305_COH1} 

msa25243.2{305_M732} lrge 

msa25243.2{305_M78l} lrge 

msa25243.2{305_1169NT} 

msa25243.2{305_A909} lrge 

msa25243.2{305_CJB110} lrge 

msa25243. 2 {305^090} lrge 

msa25243.2{305_H36B) 

Consensus 
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WO 2004/018646 



PCT/US2003/026827 



Table 68: Comparative Sequences relating to SAG 0499 



SEQ ID NO. 6801 
STRAIN 2603 

ATGGCTAAAGAGAGGGTAGATGTTCTTGCCTATAAA.CAGGGACTTTTTGATACACGAGAG 
CAAGCGAAACGTGGTGTTATGGCAGGAATGGTGATTAACGTTATCAATGGAGAACGTTAT 
GATAAACCAGGTGAAAAGGTTGCAGACGATACrGAATTAAAACTAAAAGGTGAAAAACTA 
AAATATGTTAGTAGAGGTGGATTGAAATTAGAAAAAGCTTTACAAGTTTTTGAAA 
GTTGCAGATAAGCTAACTATAGATATTGGCGCCTCTACGGGTGGTTTTACTGATGTTATG 
CTACAAT CAGGAGCG CGTTT AGTTTACGCAGTAG ATGTAGGAACAAAT CAATTAGTTTG G 
AAGTTACGTCAGGAT CATCGTGTT CGTT CTATGG AACAAT ATAATTTT AGGTATG C C CAA 
AAAGAAGATTT CAAGGAGGGACTG CCTGA^TTTG CAT CG ATAGATGT CTCATTTAT CTCT 
CTTAATTTG ATTTTACCAG CT CTAAAAGAAATTTT AGTGGATGGTGG ACAAGTAGTG G CA 
TTAATTAAACCACAATTTGAAGCAGGTCGTGAGCAAATTGGTAAAAATGGTAT^ 
GACAAGTTGGTT CATGAAAAGGTTTTG ACAACAGTGAC CAATTT CACGA^VAG ATT ATGG A 
TATACGGTTAAACATCITGATTTTTCGCCCATTCAAG 

TTAATG CAT TTG CAAAAGTGT CAAG AT C CACAAAAT CTT GTG CTT GAC CAAAT ACAAGAT 
GTTAT AGAAAAAGCACATAAGGAAT TTAAGAAAAATG AAG AAGAG 



SEQ XD NO. 6802 
STRAIN 090 

GCTAAAGAGAGGGTAGATGTTCTTGCCT 

ATAAACAGGGACTTTTTGATACACGAGAGCAAGCGAAACGTGGTGTTATG 

GCAG<3AATGGTGATTAACX3TTATCAATGGAGAACGTTATGATAAACCAGG 

TGAAAAGGTTGCAGACX3ATACTGAATTAAAACTAAAAGGTGAAAAACTAA 

AATATGTTAGTAGAGGTGGATTGAAATTAGAAAAAGCITTACAAGTT^ 

GAAATTT CAGTTG CAGATAAG CTAACTATAGATATTGG CGCCTCTACGGG 

TGGTTTTACTGATGTTATG CTACAATCAG GAG CGCGTTTAGTTTACG CAG 

TACATGTAGGAA(2AAATCAATTAGTTTG^AAGTTAC^TCAGGATCATCGT 

GTTCGTTCTATGGAACAATATAATTTTAGGTATGCC 

CAAGGAGGGACTGCCTGAATTTGCATCGATAGATGTCTCATT^ 

TTAATTTGATTTTACCAGCT CTAAAAGAAATTTT 

GTAGTGGCATTAATTAAACCACAATTTGAAGCAGGTCGTGAGCAAAT^ 
TAAAAATGGTATTGT CAAAGACAAGTTGGTTCATGAAAAG GTTTTGACAA 
CAGTGAC CAATTTCA.CX?AAAC^TTATGGATATACGGTTAAACAT CTTGAT 
1TTTCGCCCATTCAAGGTGGAGATGGAAATATTGAGTTTTTAATGCATTT 
GO\AAAGTGTCAAGATC CACAAAAT CTTGTGCTTGACC^^ 
TTATAGAAAAAGCACATAAGGAATTTAAGAAAAATGAAGAAGAG 

SEQ XD NO. 6803 
STRAIN A909 

GCTAAAGAGAGGGTAGATGTT CTTG C CTA 

TAAACAGGGACITTTTGATACACGAGAGCAAGCGAAACGTGGTGTTATGG 
CAGGAATGGTGATTAACGTTAT CAATGGAGAACGTTATGATAAAC CAGGT 
GAAAAGGTTG CAGACGATACTGAATTAAAACTAAAAGGTGAAAAACTAAA 
ATATGTTAGTAGAGGTGGATTGAAATTAGAAAAAGCTTT^ 
AAATTTCAGTTG CAGATAAG CTAACTATAGATATTGG CG C CTCTACGGGT 
GGTTTTACTGATGTTATG CTACAATCAGGAGCG CGTTTAGTTTACG CAGT 
AGATGTAGGAACAAAT CAATTAGTTTGGAAGTTACGTCAGGATCAT CGTG 
TTCXSTTCTATGGAACAATATAATTTTAGGTATGCCCAAAAAGA^ 
AAGGAGGGACTG CCTGAATTTGCAT CGATAGATGT CTCATTTAT CT CTCT 
TAATTTGATTTTACCAGCTCTAAAAGAAATTTTAG 

TAGTGGCATTAATTAAACCACAATTTGAAGCAGGTCX3TGAGCAAATTGGT 
AAAAATGGTATTGT CAAAGACAAGTTGGTTCATG AAAAGGTTTTG ACAAC 
AGTGACCT^TTTCACGAAAGATTATGGATATACGGTTAAACATC 
TTTCG CCCATTCAAGGTGGACATGGAAATATTGAGTTTTTAATG CATTTG 
CAAAAGTGT CAAGAT CCACAAAAT CTTGTG CTTGACC^ 
TATAGAAAAAGCACATAAGGAATTTAAGAAA^ATGAAGAAGAG 



SEQ XD NO. 6804 
STRAIN H36B 

GCTAAAGAGAGGGTAGATGTTCTTG CCTATAAACAGG 
GACTTTTTGATACACGAGAGCAAGCGAAACGTGGTGTTATGGCAGGAATG 
GTGATTAACGTTAT CAATGGAGAACGTTATGATAAAC CAGGTGAAAAGGT 
TGCAGACGATACTGAATTAAAACTAAAAGGTGAAAAACTAAAATATGTTA 
GTAGAGGTGGATTGAAATTAGAAAAAG CTTTACAAGTTTTTG AAATTTCA 
GTTG CAGATAAG CTAACTATAG ATATTGG CG C CT CTACGGGTGGTTTTAC 
TGATGTTATGCTACAATCAGGAGCGCGTTTAGTT^ 

GAACAAATCAATTAGTTTGGAAGTTACGT CAGGAT CATCGTGTTCGTT CT 

ATGGAACAATATAATTTTAGGTATGCCCAAAAAGAAGATTTCAAGGAGGG 

ACTG CCTGAATTTG CATCGATAGATGTCT CATTTATCT'CT CTTAATTTGA 

TTTTACCAGCTCTAAAAGAAATTTTAGTGGATGGTG 

TTAATTAAACCACAATTTGAAGCAGGTCGTGAGCAAAT^ 

TATTGTCAAACACAAGTTGGTTCATGAAAAGGTTTT^^ 

ATTTCACGAAAGATTATGGATATACGGTTAAACATCTTGATTTT^ 

ATTCAAGGTGGACATGGAAATATTGAGTTTTTAATGCATTTC CAAAAGTG 

TCAAGATCCACAAAATCITGTGCTTGACCAAATACAAGATG 

AAGCACATAAGGAATTTAAGAAAAATGAAGAAGAG 



SEQ XD NO. 6805 

STRAIN 18RS21 

GCTAAAGAGAGGGTAGATGTTCTTG CCTA 
TAAACAGGGACTTTTTGAT 
CAGGAATGGTGATTAACGTTAT CAATGGAGAACGTTATGATAAAC CAGGT 
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GAAAAGGTTG CAGACGATACTGAATTAAAACTAAAAGGTGAAAAACTAAA 
ATATGTTAGTAGAGGTGGATTGAAATTAGAAAAAG CITTACAAGTTTTTG 
AAATTTCAGTTGCAGATAAGCTAACTATAGATATTGGCGCCTCTACGGGT 
GGTTTTACTGATGTTATGCTACAATCAGGAGCGCGTTTAGTTTACGCAGT 
AGATGTAGGAACAAATCAATTAGTTTGGAAGTTACGT CAGGATCAT CGTG 
TT CGTTCTATGGAACAATATAATTTTAGGTATG CCCAAAAAGAAGATTT C 
AAGGAGG GACTGCCTGAATTTGCAT CG ATAGATGT CTCATTTAT CTCTCT 
TAATTTG ATTTT AC CAG CT CTAAAAGAAATTTTAGTGGATGGTGG ACA^G 
TAGTGGCATTAATTAAACCACAATTTGAAG CAGGT CGTG AG CAAATTGGT 
AAAAATGGTATTGTCAAAGACAAGTTGGTT CATGAAAAGGTTTT GACAAC 
AGTGACCAATTTCACGAAAGATTATGGATATACGGTrAAACATC^ 
TTTCGCCCATTCAAGGTGGACATGGAAATATTGAGTTTTTAATGC^TTTG 
CAAAAGTGT CAAGAT C CACAAAAT CTTGTG CTTGACCAAATACAAGATGT 
TATAGAAAAAGCACATAAGGAATTTAAGAAAAATGAAGAAGAG 

SEQ ID NO. 6806 
STRAIN M732 

GCTAAAGAGAGGGTAGATGTTCTTGCCTA 

TAAACAGGGACTTTTTGATACACGAGAG CAAG CGAAACGTGGTGTTATGG 
CAGGACTGGTGATTAACGTTAT CAATGGAGAACGTTATGATAAACCAGG C 
GAAAAGGTTGCAGACGATACTGAATTAAAACTAAAAGGTGAAAAACTAAA 
ATATGTTAGTAGAGGTGGATTGAAATTAGAAAAAGCTTTACAAGTTTTTG 
AAATTTCAGTTGCAGATAAG CTAACTATAGATATTGG CG C CT CTACGGGT 
GG1TTU ACTGATGTTATGCTACAATCAGG AGCG CGTTTAGTTTACG CAGT 
AGATGTAGGAACAAATCAATTAGTTTGGAAGTTACGT CAGGAT CAT CGTG 
TTCGTTCTATGGAACAATATAATTTTAGGTATGCCCAAAAAG^ 
AAGGAGGGACTG CCTGAATTTG CATCGATAGATGT CTCATTTATCTCT CT 
TAATTTGATTTTAC CAG CTCTAAAAGAAATTTTAGTGGATGGTGGACAAG 
TAGTGGCATTAATTAAACCACAATTTGAAGCAGGTCGTGAGCAAATTGGT 
AAAAATGGTATTGTCAAAGACAAGTTGGTTCATGAAAAGGTTTTG^ 
AGTGACCAATTT CACGAAAG ATTATGGATATACGGTTAAACATCTTG ATT 
TTTCGCCCGTTCAAGGTGGACATGGAAATATTGAGTTTT^ 
CAAAAGTGT CAAGAT CCACAAAAT CTTGTG CTTGACCAAATACAAGATGT 
TATAGAAAAAGCACATAAGGAATTTAAGAAAAATGAAGAAGAG 

SEQ XD NO. 6807 
STRAIN COH! 

GCTAAAGAGAGGGTAGATGTTCTTG CCT 

ATAAACAGG GACTTTTTGATACACGAGAGCAAG CGAAACGTGGTGTTATG 
GCAGGACTGGTGATTAACGTTATCAATGGAGAACGTTATGATAAACCAGG 
CXSAAAAGGTTGCAGACX^TACTGAATTAAA^ 

AATATGTTAGTAGAGGTGGATTGAAATTAGAAAAAGCTTTACAAGTTTTT 
GAAATTT CAGTTGCAGATAAGCTAACTATAGATATTGG CG CCTCTACGGG 
TGGTTTT ACTGATGTTATG CTACAATCAGGAG CG CGTTTAGTTTACG CAG 
TAGATGTAGGAACAAATCAATTAGTTTGGAAGTTACGTCAGGATCATCGT 
GTTCGTT CTATGGAACAATATAATTTTAGGTATG C CCAAAAAGAAGATTT 
CAAGGAGGGACTGC CTGAATTTG CATCGATAGATGTCTCATTTATCTCTC 
TTAATTTGATTTTACCAGCT CTAAAAGAAATTTTAGTGGATGGTGGACAA 
GTAGTGGCATTAATTAAACCACAATTTGAAGCAGGTCGTGAGCAAATTGG 
TAAAAATGGTATTGT CAAAGACAAGTTGGTT CR.TGAAAAGGTTTTGACAA 
CAGTGACCAATTTCACGAAAGATTATGGATATACGGTTAAACAT CTTGAT 
TTTTCGCCCGTTCAAGGTGGACATGGAAATATTGAGTTTTTAA 
GCAAAAGTGTCAAGATCCACAAAATCITGTGCITGACCAAAT^ 
TTATAGAAAAAG CACATAAGGAATTTAAGAAAAATGAAGAAGAG 

SEQ ID NO. 6808 
STRAIN M781 

G CTAAAGAGAGGGTAGATGTTCTTG CCT 

ATAAACAGGGACTTTTTGATACACGAG AG CAAG CGAAACGTG GTGTTATG 

GCAGGACTGGTGATTAACX5TTATCAATGGAGAACGTTATGATAAACCAGG 

CGAAAAGGTTGCAGACGATACTGAATTAAAACTAAAAGGTGAAAAACTAA 

AATATGTTAGTAGAGGTGGATTGAAATTAGAAAAAGCITTACAAGTTT^ 

GAAATTT CAGTTGCAGATAAGCTAACTATAGATATTGGCGCCTCTACGGG 

TG GTTTTACTGATGTTATGCTACAAT CAGGAGCGCGTTTAGTTTACG CAG 

TAGATGTAGGAACAAATCAATTAGTTTGGAAGTTACGTC^ 

GTTCGTTCTATGGAACAATATAATTTTAGGTATGCCCAAAAAGAAGAT^ 

CAAGGAGGGACTGCCTGAATTTGCATCGATAGATGTCTCATTTATCTCT 

TTAATTTGATTTTACCAGCT CTAAAAGAAATTTTAGTG^ 

GTAGTGGCATTAATTAAA.C CACAATTTGAAGCAGGTCGTGAG CAAATTGG 

TAAAAATGGTATTGTCAAAGACAAGTTGGTTCATGAAAAGGTTTTGACA^ 

CAGTGACCAATTTCACX5AAAGATTATGGATATACGGTTA 

TTTT CX3CCCGTTCAAGGTGGACATGGAAATATTGAGTTTTTAATG CATTT 

GCAAAAGTGT CAAG ATC CAGAAAATCTTGTGCTTGAC CAAATACAAGATG 

TTATAGAAAAAG CACATAAGGAATTTAAGAAAAATGAAGAAGAG 

SEQ ID NO. 6809 
STRAIN CJB 110 

GCTAAAGAGAGGGTAGATGTTCTTG CCTA 

TAAACAGGGACTTTTTGAT ACACGAGAGCAAG CGAAACGT GGTGTTATGG 
CAGGAATGGTGATTA^CGTTATCAATGGAGAACGTTATGATAAACCAGGT 
GAAAAGGTTGCAGACGATACTGAATTAAAACTAAAAGGTG 
ATATGTTAGTAGAGGTGGATTGAAATTAGAAAA^G CTTTACAAGT TT TTG 
AAATTTCAGTTG CAGAT AAG CTAACTATAGATATTGG CGC CTCTACGGGT 



980 
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PCT/US2003/026827 



Table 68: Comparative Sequences relating to SAG 0499 



GGTTTTACTGATGTTATGCTAC^TCAGGAGCGCGTTTAGTTTACGCAGT 
AGATGTAGGAACAAATCAATTAGTTTGGAAGTTACGTCAGGATCATCGTG 
TTCGTTCTATGGAACAATATAATTTTAGGTATGCCCAAAAAGAAGATTTC 
AAGG AGGGACT G CCTGAATTTG CAT CGATAGATGTCT CATTTAT CT CT CT 
TAATTTGATTTTACCAGCTCTAAAAGAAATTTTAGTGGATGGTGGACAAG 
TAGTGGCATTAATTAAACCACAATTTGAAGCAGGTCGTGAGCAAATTGGT 
AAAAATGGTATTGTCAAAGACAAGTTGGTTCATGAAAAGGTTTTGACIAAC 
AGTGACCAATTTCACGAAAGATTATGGATATACGGTTAAACATCTTGATT 
TTTCGCCCATTGAAGGTGGACATGGAAATATTGAGTTTTTAATGGATTTG 
CAAAAGTGT CAAGATCCACAAAATCITGTGCTTGACCAAATACAAGATGT 
TATAGAAAAAGCACATAAGGAATTTAAGAAAAATGAAGAAGAG 

SEQ ID NO. 6810 
STRAIN 1169NT 

G CTAAAGAG AGGGTAGATGTT C TTG CCTA 

TAAACAGGGACTTTTTGATACACGAGAGCAAGCGAAACGTGGTGTTATGG 

CAGGACrGGTGATTAACGTTATCAATGGAGAACGTTATGATAAACCAGGC 

GAAAAGGTTGCAGACGATACrGAATTAAAACTAAAAGGTGAAAAACTAAA 

ATATGTTAGTAGAGGTGGATTGA^TTAGAAAAAGCTTTACAAGTTTTTG 

AAATTTCAGTTG CAGATAAGCTAACTATAGATATTGG CG C CTCTACGGGT 

GGTTTTACTGATGTT ATG CTACAAT CAGGAG CGCGTTTAGTTTACGCAGT 

AGATGTAGGAA'CAAATCAATTAGTTTGGAAGTTACGTCAGGATCATCGTG 

TTCGT^CTATGGAACAATATAATTTTAGGTATGCCCAAAAAGAAGATTTC 

AAGGAGGGACTGCCTGAATTTG CAT CGATAGATGT CTCATTTATCTCTCT 

TAATTTGATTTTGCCAGCTCrAAAAGAAATTTTAGTGGATGGTG 

TAGTGG CATTAATTAAACCACAATTTGAAG CAGGTCGTGAGCAAATTGGT 

AAAAATGGTATTGTCAAAGACAAGTTGGTTC^TGAAAAGGTTTT 

AGTGACCAATTTCACGAAAGATTATGGATATACGGTTAAAGATCTTGA 

TTTCGCCCATTCAAGGTGGACATGGAAATATTGAGTT^ 

CAAAAGTGTCAAGATCGACAAAATCTTGTGCTTGACCA 

TATAGAAAAAGCACATAAGGAATTTAAGAAAAATGAAGAAGAG 

SEQ ID HO. 6811 
STRAIN JM9 130013 

GCTAAAGAGAGGGTAGATGTTCTTGCCTA 

TAAACAGGGACTTTTTGATACA.CGAGAGCAAGCGAAACGTGGTGTTATGG 

CAGGAATGGTGATTAACGTTATCAATGGAGAACGTTATGATAAACCAGGT 

GAAAAGGTTGC^GACGATACTGAATTAAAACTAAAAGGTGAAAAACTAAA 

ATATGTTAGTAGAGGTGGATTGAAATTAGAAAAAGCTTTACAAGTTTTTG 

AAATTTCAGTTG CAGATAAGCTAACTATAGATATTGG CG C CT CTACGGGT 

GGTTTTACTGATGTTATGCTACAATCAGGAGCGCGTTTAGTTTACGCAGT 

AGATGTAGGAACAAATCAATTAGTTTGGAAGTTACGTCAGGAT CAT CGT G 

TTCGTTCTATGGAACAATATAATTTTAGGTATG C CO^AAAAGAAGATTT C 

AAGGAGGGACTGCCTGAATTTG CAT CGATAGATGTCTCATTTATCTCTCT 

TAATTTGATTTTAC CAG CT CTAAAAGAAATTTTAGTGGATGGTGG ACAAG 

TAGTGGCATTAATTAAACCACAATTTGAAG CAGGTCGTGAGCAAATTGGT 

AAAAATGGTATTGTCAAAGACMGTTGGTTCATGAAAA 

AGTGACCAATTTCACGAAAGATTATGGATATACGGTTAAACATCTTGATT 

TTTCGCCCATTCAAGGTGGACATGGAAATATTGAGTTT 

CAAAAGTGT CAAGAT CCACAAAAT CTTGTG CTTGAC CAAATACAAGATGT 

TATAGAAAAAGCACATAAGGAATTTAAGAAAAATGAAGAAGAG 



PRETTY of: /biotmp/msa236683 . 2 { * } May 14, 2003 02:57 



msa236683 
msa236683 -2{ 
msa236683 . 
msa236683 . 
msa236683 . 2{ 
msa236683 . 
msa236683 .2(310 
msa236683 
msa236683 
msa236683 
msa236683.2{ 



2{310_090> 
310_18RS21} 
2{310__2603} 
2{310_A909} 
310_CJB110) 
2{310_H36B} 
_JM9130013} 
2{310_COH1) 
2(310_M732} 
2{310_M78l} 
310_1169NT} 
Consensus 



GCTAAAG 

GCTAAAG 

atgGCTAAAG 

GCTAAAG 

GCTAAAG 

GCTAAAG 

GCTAAAG 

GCTAAAG 

GCTAAAG 

GCTAAAG 

GCTAAAG 

********** 



AGAGGGTAGA 
AGAGGGTAGA 
AGAGGGTAGA 
AGAGGGTAGA 
AGAGGGTAGA 
AGAGGGTAGA 
AGAGGGTAGA 
AGAGGGTAGA 
AGAGGGTAGA 
AGAGGGTAGA 
AGAGGGTAGA 



TGTTCTTGCC 
TGTTCTTGCC 
TGTTCTTGCC 
TGTTCTTGCC 
TGTTCTTGCC 
TGTTCTTGCC 
TGTTCTTGCC 
TGTTCTTGCC 
TGTTCTTGCC 
TGTTCTTGCC 
TGTTCTTGCC 



TATAAACAGG 
TATAAACAGG 
TATAAACAGG 
TATAAACAGG 
TATAAACAGG 
TATAAACAGG 
TATAAACAGG 
TATAAACAGG 
TATAAACAGG 
TATAAACAGG 
TATAAACAGG 



50 

GACTTTTTGA 
GACTTTTTGA 
GACTTTTTGA 
GACTTTTTGA 
GA CTTTTT GA 
GACTTTTTGA 
GACTTTTTGA 
GACTTTTTGA 
GACTTTTTGA 
GACTTTTTGA 
GACTTTTTGA 



********** ********** ********** ********** 



51 

msa236683 .2{310_090} TACACGAGAG 

msa236683 .2{310_18RS2lJ TACACGAGAG 

tnsa2 3 66 83 .2(310_2603} TACACGAGAG 

rasa236683.2{310_A909} TACACGAGAG 

msa236683 .2 {310_CJB110} TACACGAGAG 

msa2 3 66 8 3 . 2 { 3 1 0_H3 6B } TACACGAGAG 

msa236683 .2{310_JM9130013) TACACGAGAG 

msa236683 . 2 f 310_COH1 } TACACGAGAG 

msa236683.2{310_M732} TACACGAGAG 

msa236683 .2{310_M781} TACACGAGAG 

msa236683 .2{310_1169NT} TACACGAGAG 

Consensus ********** 



100 

CAAGCGAAAC GTGGTGTTAT GGCAGGAaTG GTGATTAACG 
CAAGCGAAAC GTGGTGTTAT GGCAGGAaTG GTGATTAACG 
CAAGCGAAAC GTGGTGTTAT GGCAGGAaTG GTGATTAACG 
CAAGCGAAAC GTGGTGTTAT GGCAGGAaTG GTGATTAACG 
CAAGCGAAAC GTGGTGTTAT GGCAGGAaTG GTGATTAACG 
CAAGCGAAAC GTGGTGTTAT GGCAGGAaTG GTGATTAACG 
CAAGCGAAAC GTGGTGTTAT GGCAGGAaTG GTGATTAACG 
CAAGCGAAAC GTGGTGTTAT GGCAGGAcTG GTGATTAACG 
CAAGCGAAAC GTGGTGTTAT GGCAGGAcTG GTGATTAACG 
CAAGCGAAAC GTGGTGTTAT GGCAGGAcTG GTGATTAACG 
CAAGCGAAAC GTGGTGTTAT GGCAGGAcTG GTGATTAACG 
********** ********** *******_** ********** 
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msa23S683 
msa236683 ,2{ 

msa236683 . 

msa236683 
msa236683 .2{ 

msa236683 
msa236683 .2(310 

msa236683 . 

msa236683 . 

msa236683 . 
msa236683 .2{ 



.2{310_090} 
310_18RS2l} 
2{310_2603} 
2{310_A909) 
310_CJB110} 
2{310_H36B} 
_JM9130013} 
2{310_COHli 
2{310_M732} 
2{310_M781} 
310_1169NT} 
Consensus 



101 

TTATCAATGG 
TTATCAATGG 
TTATCAATGG 
TTATCAATGG 
TTATCAATGG 
TTATCAATGG 
TTATCAATGG 
TTATCAATGG 
TTATCAATGG 
TTATCAATGG 
TTATCAATGG 
********** 



AGAACGTTAT 
AGAACGTTAT 
AGAACGTTAT 
AGAACGTTAT 
AGAACGTTAT 
AGAACGTTAT 
AGAACGTTAT 
AGAACGTTAT 
AGAACGTTAT 
AGAACGTTAT 
AGAACGTTAT 
********** 



GATAAACCAG 
GATAAACCAG 
GATAAACCAG 
GATAAACCAG 
GATAAACCAG 
GATAAACCAG 
GATAAACCAG 
GATAAACCAG 
GATAAACCAG 
GATAAACCAG 
GATAAACCAG 
********** 



GtGAAAAGGT 
GtGAAAAGGT 
GtGAAAAGGT 
GtGAAAAGGT 
GtGAAAAGGT 
GtGAAAAGGT 
GtGAAAAGGT 
GcGAAAAGGT 
GcGAAAAGGT 
GcGAAAAGGT 
GcGAAAAGGT 
*_******** 



150 

TGCAGACGAT 
TGCAGACGAT 
TGCAGACGAT 
TGCAGACGAT 
TGCAGACGAT 
TGCAGACGAT 
TGCAGACGAT 
TGCAGACGAT 
TGCAGACGAT 
TGCAGACGAT 
TGCAGACGAT 
********** 



151 200 

msa236683 .2{310_090) ACTGAATTAA AACTAAAAGG TGAAAAACTA AAATATGTTA GTAGAGGTGG 

msa236683 . 2 { 310_18RS2l} ACTGAATTAA AACTAAAAGG TGAAAAACTA AAATATGTTA GTAGAGGTGG 

msa2 36683.2(3 10_2 603} ACTGAATTAA AACTAAAAGG TGAAAAACTA AAATATGTTA GTAGAGGTGG 

msa2 36683. 2(31 0_A9 09} ACTGAATTAA AACTAAAAGG TGAAAAACTA AAATATGTTA GTAGAGGTGG 

msa236683.2(310__CJB110> ACTGAATTAA AACTAAAAGG TGAAAAACTA AAATATGTTA GTAGAGGTGG 

msa236683.2(31 0_H3 6B } ACTGAATTAA AACTAAAAGG TGAAAAACTA AAATATGTTA GTAGAGGTGG 

msa2 3 66 8 3 . 2(310_JM9130013 } ACTGAATTAA AACTAAAAGG TGAAAAACTA AAATATGTTA GTAGAGGTGG 

rasa2 36683. 2(31 0__COH1 } ACTGAATTAA AACTAAAAGG TGAAAAACTA AAATATGTTA GTAGAGGTGG 

msa2 36683. 2(31 0 JM73 2 } ACTGAATTAA AACTAAAAGG TGAAAAACTA AAATATGTTA GTAGAGGTGG 

msa2 36683 .2{310_M78l} ACTGAATTAA AACTAAAAGG TGAAAAACTA AAATATGTTA GTAGAGGTGG 

msa2 36683. 2(31 0_116 9 NT } ACTGAATTAA AACTAAAAGG TGAAAAACTA AAATATGTTA GTAGAGGTGG 

Consensus ********** ********** ********** ********** ********** 



msa236683 .2(310_090} 
msa236683.2(310_18RS2l} 
msa236683 . 2 { 310_2603 } 
msa236683 .2(310_A909} 
msa236683 .2{310_CJB110} 
msa236683 .2{310_H36B) 
msa236683 . 2 (310_JM9130013 } 
msa236683 .2{310_COHl} 
msa236683 .2{310_M732} 
msa236683 .2{310_M78l} 
msa236683 .2{310_1169NT} 
Consensus 



201 

ATTGAAATTA 
ATTGAAATTA 
ATTGAAATTA 
ATTGAAATTA 
ATTGAAATTA 
ATTGAAATTA 
ATTGAAATTA 
ATTGAAATTA 
ATTGAAATTA 
ATTGAAATTA 
ATTGAAATTA 



GAAAAAGCTT 
GAAAAAGCTT 
GAAAAAGCTT 
GAAAAAGCTT 
GAAAAAGCTT 
GAAAAAGCTT 
GAAAAAGCTT 
GAAAAAGCTT 
GAAAAAGCTT 
GAAAAAGCTT 
GAAAAAGCTT 



********** ********** 



TACAAGTTTT 
TACAAGTTTT 
TACAAGTTTT 
TACAAGTTTT 
TACAAGTTTT 
TACAAGTTTT 
TACAAGTTTT 
TACAAGTTTT 
TACAAGTTTT 
TACAAGTTTT 
TACAAGTTTT 
********** 



TGAAATTTCA 
TGAAATTTCA 
TGAAATTTCA 
TGAAATTTCA 
TGAAATTTCA 
TGAAATTTCA 
TGAAATTTCA 
TGAAATTTCA 
TGAAATTTCA 
TGAAATTTCA 
TGAAATTTCA 
********** 



250 

GTTGCAGATA 
GTTGCAGATA 
GTTGCAGATA 
GTTGCAGATA 
GTTGCAGATA 
GTTGCAGATA 
GTTGCAGATA 
GTTGCAGATA 
GTTGCAGATA 
GTTGCAGATA 
GTTGCAGATA 
********** 



msa23668 
msa236683 .2 
msa236683 
msa236683 
msa236683 .2 
msa236683 
msa23 6683 .2(31 
msa236683 
msa236683 
msa236683 
msa236683.2 



3 .2(310_090} 
(310_18RS21} 
.2 {310_2603} 
.2(310_A909) 
(310_CJB110) 
.2{310_H36B} 
0_JM9130013} 
.2{310_COH1} 
.2{310^M732} 
.2{310__M781} 
(310_1169NT} 
Consensus 



251 

AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
********** 



AGATATTGGC 
AGATATTGGC 
AGATATTGGC 
AGATATTGGC 
AGATATTGGC 
AGATATTGGC 
AGATATTGGC 
AGATATTGGC 
AGATATTGGC 
AGATATTGGC. 
AGATATTGGC 
********** 



GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
********** 



GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
********** 



300 

TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
********** 



msa236683 
msa236683 ,2{ 

msa236683 . 

msa236683 . 
msa236683 .2( 

msa236683 . 
msa236683.2{310, 

msa236683 . 

msa236683 . 

msa236683 . 
msa236683 .2{ 



.2{310_090} 
310_18RS21} 
2{310_2603} 
2(310_A90 9} 
310_CJB110} 
2{310_H36B} 
_JM9130013) 
2(310_COH1} 
2(310_M73 2) 
2(310_M781} 
310_1169NT} 
Consensus 



301 

CTACAATCAG 
CTACAATCAG 
CTACAATCAG 
CTACAATCAG 
CTACAATCAG 
CTACAATCAG 
CTACAATCAG 
CTACAATCAG 
CTACAATCAG 
CTACAATCAG 
CTACAATCAG 
********** 



GAGCGCGTTT 
GAGCGCGTTT 
GAGCGCGTTT 
GAGCGCGTTT 
GAGCGCGTTT 
GAGCGCGTTT 
GAGCGCGTTT 
GAGCGCGTTT 
GAGCGCGTTT 
GAGCGCGTTT 
GAGCGCGTTT 
********** 



AGTTTACGCA 
AGTTTACGCA 
AGTTTACGCA 
AGTTTACGCA 
AGTTTACGCA 
AGTTTACGCA 
AGTTTACGCA 
AGTTTACGCA 
AGTTTACGCA 
AGTTTACGCA 
AGTTTACGCA 
********** 



GTAGATGTAG 
GTAGATGTAG 
GTAGATGTAG 
GTAGATGTAG 
GTAGATGTAG 
GTAGATGTAG 
GTAGATGTAG 
GTAGATGTAG 
GTAGATGTAG 
GTAGATGTAG 
GTAGATGTAG 
********** 



350 

GAACAAATCA 
GAACAAATCA 
GAACAAATCA 
GAACAAATCA 
GAACAAATCA 
GAACAAATCA 
GAACAAATCA 
GAACAAATCA 
GAACAAATCA 
GAACAAATCA 
GAACAAATCA 
********** 



351 

msa236683 .2(310_090) ATTAGTTTGG 

msa236683.2(310_18RS21) ATTAGTTTGG 

msa236683 . 2 { 310_2603 j ATTAGTTTGG 

msa236683 .2(310_A909} ATTAGTTTGG 

msa236683 . 2{310_CJB110} ATTAGTTTGG 

msa236683 .2(310_H36B} ATTAGTTTGG 

msa236683 .2(310_JM9130013) ATTAGTTTGG 

msa236683 .2(310_COH1) ATTAGTTTGG 

msa236683.2(310_M732} ATTAGTTTGG 

msa236683.2(310_M78l} ATTAGTTTGG 

msa236683.2(310_1169NT} ATTAGTTTGG 

Consensus ********** 



AAGTTACGTC 
AAGTTACGTC 
AAGTTACGTC 
AAGTTACGTC 
AAGTTACGTC 
AAGTTACGTC 
AAGTTACGTC 
AAGTTACGTC 
AAGTTACGTC 
AAGTTACGTC 
AAGTTACGTC 



AGGATCATCG 
AGGATCATCG 
AGGATCATCG 
AGGATCATCG 
AGGATCATCG 
AGGATCATCG 
AGGATCATCG 
AGGATCATCG 
AGGATCATCG 
AGGATCATCG 
AGGATCATCG 



TGTTCGTTCT 
TGTTCGTTCT 
TGTTCGTTCT 
TGTTCGTTCT 
TGTTCGTTCT 
TGTTCGTTCT 
TGTTCGTTCT 
TGTTCGTTCT 
TGTTCGTTCT 
TGTTCGTTCT 
TGTTCGTTCT 



400 

ATGGAACAAT 
ATGGAACAAT 
ATGGAACAAT 
ATGGAACAAT 
ATGGAACAAT 
ATGGAACAAT 
ATGGAACAAT 
ATGGAACAAT 
ATGGAACAAT 
ATGGAACAAT 
ATGGAACAAT 



********** ********** ********** ********** 
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msa236683 
msa236683 . 2{ 

msa236683 . 

msa236683 . 
msa236683 .2 { 

msa236683 . 
msa236683 .2(310, 

msa236683 . 

msa236683 . 

msa236683 . 
msa236683 .2{ 



2{310_090} 
310_18RS2l} 
2f310_2603} 
2{310_A909} 
310_OJB110} 
2{310_H36B} 
_JM9130013> 
2{310_COHl} 
2{310_M732} 
2{310_M78l} 
310_1169NT} 
Consensus 



401 

ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 



GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 



AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 



TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 



450 

ACTGC CTGAA 
ACTGCCTGAA 
ACTGC CTGAA 
ACTGCCTGAA 
ACTGCCTGAA 
ACTGCCTGAA 
ACTGCCTGAA 
ACTGCCTGAA 
ACTGCCTGAA 
ACTGCCTGAA 
ACTGCCTGAA 



********** ********** ********** ********** ********** 



msa236683 
msa236683.2{ 

msa236683 . 

msa236683 . 
msa236683 .2{ 

msa236683 . 
msa236683 .2(310 

tnsa236683 . 

msa236683 . 

msa236683 . 
msa236683 .2( 



.2(310^090} 
310_18RS2l} 
2(310_2603} 
2(310_A909} 
310_CJB110} 
2{310_H36Bj 
JM9130013' 
2(310_COHi; 
2(310_M732' 
2{310_M781] 
310_1169NT] 
Consensus 



451 

TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
********** 



TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
********** 



ATTTATCTCT 
ATTTATCTCT 
ATTTATCTCT 
ATTTATCTCT 
ATTTATCTCT ' 
ATTTATCTCT 
ATTTATCTCT 
ATTTATCTCT 
ATTTATCTCT 
ATTTATCTCT 
ATTTATCTCT 
********** 



CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
********** 



500 

TTTTaCCAGC 
TTTTaCCAGC 
TTTTaCCAGC 
TTTTaCCAGC 
TTTTaCCAGC 
TTTTaCCAGC 
TTTTaCCAGC 
TTTTaCCAGC 
TTTTaCCAGC 
TTTTaCCAGC 
TTTTgCCAGC 
****_***** 



msa236683 
msa236683 .2{ 

msa236683 . 

msa236683 . 
msa236683 .2{ 

msa236683 . 
msa236683. 2(310 

msa236683 . 

msa236683. 

msa236683 . 
tnsa236683 .2( 



2 {310^090} 
310_18RS2l} 
2(310_2603} 
2{310_A909} 
310_CJB110* 
2{310_H36B] 
_JM9130013 
2{310_COH1] 
2(310_M732' 
2(310_M78l' 
310_1169NT} 
Consensus 



501 

TCTAAAAGAA 
TCTAAAAGAA 
TCTAAAAGAA 
TCTAAAAGAA 
TCTAAAAGAA 
TCTAAAAGAA 
TCTAAAAGAA 
TCTAAAAGAA 
TCTAAAAGAA 
TCTAAAAGAA 
TCTAAAAGAA 
********** 



ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
********** 



ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
********** 



AGTAGTGGCA 
AGTAGTGGCA 
AGTAGTGGCA 
AGTAGTGGCA 
AGTAGTGGCA 
AGTAGTGGCA 
AGTAGTGGCA 
AGTAGTGGCA 
AGTAGTGGCA 
AGTAGTGGCA 
AGTAGTGGCA 
********** 



550 

TTAATTAAAC 
TTAATTAAAC 
TTAATTAAAC 
TTAATTAAAC 
TTAATTAAAC 
TTAATTAAAC 
TTAATTAAAC 
TTAATTAAAC 
TTAATTAAAC 
TTAATTAAAC 
TTAATTAAAC 
********** 



msa236683 
msa236683 .2{ 

msa236683 . 

msa236683 . 
msa236683 . 2{ 

tnsa236683 . 
msa236683 .2(310 

msa236683 . 

msa236683 . 

msa236683 . 
msa236683 .2( 



.2(310_090} 
310_18RS2l} 
2(310_2603} 
2(310_A909} 
310__CJB110} 
2{310_H36B} 
_JM9130013} 
2{310_COH1} 
2(310_M732} 
2{310_M781} 
310_1169NT} 
Consensus 



551 

CACAATTTGA 
CACAATTTGA 
CACAATTTGA 
CACAATTTGA 
CACAATTTGA 
CACAATTTGA 
CACAATTTGA 
CACAATTTGA 
CACAATTTGA 
CACAATTTGA 
CACAATTTGA 
********** 



AGCAGGTCGT 
AGCAGGTCGT 
AGCAGGTCGT 
AGCAGGTCGT 
AGCAGGTCGT 
AGCAGGTCGT 
AGCAGGTCGT 
AGCAGGTCGT 
AGCAGGTCGT 
AGCAGGTCGT 
AGCAGGTCGT 



GAGCAAATTG 
GAGCAAATTG 
GAGCAAATTG 
GAGCAAATTG 
GAGCAAATTG 
GAGCAAATTG 
GAGCAAATTG 
GAGCAAATTG 
GAGCAAATTG 
GAGCAAATTG 
GAGCAAATTG 



********** ********** 



GTAAAAATGG 
GTAAAAATGG 
GTAAAAATGG 
GTAAAAATGG 
GTAAAAATGG 
GTAAAAATGG 
GTAAAAATGG 
GTAAAAATGG 
GTAAAAATGG 
GTAAAAATGG 
GTAAAAATGG 
********** 



600 

TATTGTCAAA 
TATTGTCAAA 
TATTGTCAAA 
TATTGTCAAA 
TATTGTCAAA 
TATTGTCAAA 
TATTGTCAAA 
TATTGTCAAA 
TATTGTCAAA 
TATTGTCAAA 
TATTGTCAAA 
********** 



msa236683 
msa236683 .2{ 
msa236683 
msa236683 
msa236683 .2( 
msa236683 
msa236683 .2(310 
msa236683 . 
' msa236683. 

msa236683 . 
msa236683 .2{ 



2(310_090} 
310_18RS21} 
2(310_2603} 
2(310_A909} 
310_CJB110} 
2{310_H36B) 
_JM9130013} 
2{310_COH1} 
2(310_M732} 
2(310__M781} 
310_1169NT} 
Consensus 



601 

GACAAGTTGG 
GACAAGTTGG 
GACAAGTTGG 
GACAAGTTGG 
GACAAGTTGG 
GACAAGTTGG 
GACAAGTTGG 
GACAAGTTGG 
GACAAGTTGG 
GACAAGTTGG 
GACAAGTTGG 
********** 



TTCATGAAAA 
TTCATGAAAA 
TTCATGAAAA 
TTCATGAAAA 
TTCATGAAAA 
TTCATGAAAA 
TTCATGAAAA 
TTCATGAAAA 
TTCATGAAAA 
TTCATGAAAA 
TTCATGAAAA 
********** 



GGTTTTGACA 
GGTTTTGACA 
GGTTTTGACA 
GGTTTTGACA 
GGTTTTGACA 
GGTTTTGACA 
GGTTTTGACA 
GGTTTTGACA 
GGTTTTGACA 
GGTTTTGACA 
GGTTTTGACA 
********** 



ACAGTGACCA 
ACAGTGACCA 
ACAGTGACCA 
ACAGTGACCA 
ACAGTGACCA 
ACAGTGACCA 
ACAGTGACCA 
ACAGTGACCA 
ACAGTGACCA 
ACAGTGACCA 
ACAGTGACCA 
********** 



650 

ATTTCACGAA 
ATTTCACGAA 
ATTTCACGAA 
ATTTCACGAA 
ATTTCACGAA 
ATTTCACGAA 
ATTTCACGAA 
ATTTCACGAA 
ATTTCACGAA 
ATTTCACGAA 
ATTTCACGAA 
********** 



msa23668 
msa236683 .2 
msa236683 
msa236683 
msa236683 .2 
msa236683 
msa236683.2{31 
msa236683 
msa236683 
msa236683 
msa236683 .2 



3 .2(310_090} 
(310_18RS21} 
.2{310_2603"} 
.2{310_A909} 
{310_CJB110} 
.2(310_H36B 
0_JM9130013] 
.2{310_COH1] 
.2{310_M732] 
.2{310_M78l] 
(310_1169NT) 



651 

AGATTATGGA 
AGATTATGGA 
AGATTATGGA 
AGATTATGGA 
AGATTATGGA 
AGATTATGGA 
AGATTATGGA 
AGATTATGGA 
AGATTATGGA 
AGATTATGGA 
AGATTATGGA 



TATACGGTTA 
TATACGGTTA 
TATACGGTTA 
TATACGGTTA 
TATACGGTTA 
TATACGGTTA 
TATACGGTTA 
TATACGGTTA 
TATACGGTTA 
TATACGGTTA 
TATACGGTTA 



AACATCTTGA 
AACATCTTGA 
AACATCTTGA 
AACATCTTGA 
AACATCTTGA 
AACATCTTGA 
AACATCTTGA 
AACATCTTGA 
AACATCTTGA 
AACATCTTGA 
AACATCTTGA 



TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 



700 

aTTCAAGGTG 
aTTCAAGGTG 
aTTCAAGGTG 
aTTCAAGGTG 
aTTCAAGGTG 
aTTCAAGGTG 
aTTCAAGGTG 
gTT CAAGGTG 
gTTCAAGGTG 
gTTCAAGGTG 
aTTCAAGGTG 
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Consensus ********** ********** ********** ********** _********* 



msa236683 
msa236683 .2{ 

msa236683 . 

msa236683 . 
msa236683 .2{ 

msa236683 . 
msa236683.2{310 

msa236683 . 

msa236683 . 

msa236683 . 
msa236683 .2{ 



.2{310_090} 
310_18RS2l} 
2{310_2603} 
2{310_A909} 
310_CJB110} 
2{310_H36B) 
_JM9 130013} 
2{310_COH1} 
2{310_M732} 
2{310_M781} 
310_1169NT} 
Consensus 



701 

GACATGGAAA 
GACATGGAAA 
GACATGGAAA 
GACATGGAAA 
GACATGGAAA 
GACATGGAAA 
GACATGGAAA 
GACATGGAAA 
GACATGGAAA 
GACATGGAAA 
GACATGGAAA 
********** 



TATTGAGTTT 
TATTGAGTTT 
TATTGAGTTT 
TATTGAGTTT 
TATTGAGTTT 
TATTGAGTTT 
TATTGAGTTT 
TATTGAGTTT 
TATTGAGTTT 
TATTGAGTTT 
TATTGAGTTT 
********** 



TTAATGCATT 
TTAATGCATT 
TTAATGCATT 
TTAATGCATT 
TTAATGCATT 
TTAATGCATT 
TTAATGCATT 
TTAATGCATT 
TTAATGCATT 
TTAATGCATT 
TTAATGCATT 
********** 



TGCAAAAGTG 
TGCAAAAGTG 
TGCAAAAGTG 
TGCAAAAGTG 
TGCAAAAGTG 
TGCAAAAGTG 
TGCAAAAGTG 
TGCAAAAGTG 
TGCAAAAGTG 
TGCAAAAGTG 
TGCAAAAGTG 
********** 



750 

TCAAGATCCA 
TCAAGATCCA 
TCAAGATCCA 
TCAAGATCCA 
TCAAGATCCA 
TCAAGATCCA 
TCAAGATCCA 
TCAAGATCCA 
TCAAGATCCA 
TCAAGATCCA 
TCAAGATCCA 
********** 



msa236683 .2 {310^090} 
msa236683 ,2{310_18RS2l} 
msa236683. 2 { 310^2603} 
msa236683 .2(310_A909} 
msa236683 .2 {310_CJB110 } 
msa236683 .2{310_H36B} 
msa236683.2{310_jJM9130013 
rasa236683 .2{310_COH1] 
msa236683 . 2 { 310_M73 2 j 
rasa236683.2{310_M78l) 
msa236683 .2{310_1169NT} 
Consensus 



751 

CAAAATCTTG 
CAAAATCTTG 
CAAAATCTTG 
CAAAATCTTG 
CAAAATCTTG 
CAAAATCTTG 
CAAAATCTTG 
CAAAATCTTG 
CAAAATCTTG 
CAAAATCTTG 
CAAAATCTTG 
********** 



TGCTTGACCA 
TGCTTGACCA 
TGCTTGACCA 
TGCTTGACCA 
TGCTTGACCA 
TGCTTGACCA 
TGCTTGACCA 
TGCTTGACCA 
TGCTTGACCA 
TGCTTGACCA 
TGCTTGACCA 
********** 



AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
********** 



GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
********** 



800 

AAGCACATAA 
AAGCACATAA 
AAGCACATAA 
AAGCACATAA 
AAGCACATAA 
AAGCACATAA 
AAGCACATAA 
AAGCACATAA 
AAGCACATAA 
AAGCACATAA 
AAGCACATAA 
********** 



801 825 

msa236683.2{310_090} GGAATTTAAG AAAAATGAAG AAGAG 

msa236683 .2{310_18RS2l} GGAATTTAAG AAAAATGAAG AAGAG 

msa236683 . 2 {310__2603 } GGAATTTAAG AAAAATGAAG AAGAG 

msa236683 .2{310_A909} GGAATTTAAG AAAAATGAAG AAGAG 

msa236683.2{310_C»JB110} GGAATTTAAG AAAAATGAAG AAGAG 

msa2 36683.2(31 0_H3 6B } GGAATTTAAG AAAAATGAAG AAGAG 

msa236683.2{310_JM9130013} GGAATTTAAG AAAAATGAAG AAGAG 

msa23 6683 .2(310_COH1 J GGAATTTAAG AAAAATGAAG AAGAG 

msa236683.2|310_M732} GGAATTTAAG AAAAATGAAG AAGAG 

msa236683 .2{310_M78l} GGAATTTAAG AAAAATGAAG AAGAG 

msa236683-2{310_1169NT} GGAATTTAAG AAAAATGAAG AAGAG 

Consensus ********** ********** ***** 



SEQ ZD NO. 6812 
STRAIN 2603 frame: 1 

MAKERVDV1JVYKCGLFDTREQAKRGVMAGMVINVINGERYDKPGEKVADDTELKLKGEKLK 
YVSRGGLKLEKALQVFE I SVADKLT ID I GASTGG FTD VML QSGARL VYAVDVGTNQL VWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPEFAS I DVS F I SLNLi I L PALKE I LVDGGQWAL 
IKPQFEAGREQI GKNGI VKDKL VHE KVLTTVTNFT KDYG YTVKHLDF S P I QGGHGN I EFL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ZD NO. 6813 
STRAIN 090 frame: 1 

AKERVDVIAYKC^LFDTREQAKRGVMAGMVINVINGERYDKPGEKVADDTELKLKGEKLK 
YVSRGGLKLEKALQVFE I S VADKLTI DIGASTGGFTDVMLQSGARLVYAVDVGTNQLVWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPEFASIDVSFISLNLILPALKEILVDGGQVVAL 
IKPQFEAGREQIGKNGIVKDKLVHEKVLTTVTNFTKDYGYTVKHLDFSPIQGGHGNIEFL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ZD NO. 6814 
STRAIN A909 frame: 1 

AKERVDVXAYKX^LFDTREQAKRGVMAGMVINVINGERYDKPGEKVADDTELKLKGEKLK 
YVSRGGLKLEKALQVFE I SVADKLT I D I GASTGG FTDVMLQSGARLVYAVDVGTNQLVWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPEFAS I DVS F I S LNL I LPALKE I LVDGGQWAL 
I KPQFEAGREQI GKNGI VKDKL VHEKVLTTVTNFT KDYG YTVKHLDFSP IQGGHGNI EFL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ZD NO. 6815 

STRAIN 18RS21 frame: 1 

AKERVDVIAYKQGLFDTREQAKRGVMAGMVINVINGERYDKPGEKVADDTEL 
YVSRGGLKLEKALQVFE I SVADKLTI DIGASTGG FTDVMLQSGARLVYAVDVGTNQLVWK 
LRQDHRVRSMEQ YNFRYAQKEDFKEGLPE FAS IDVSFI SLNLI LPALKE ILVDGGQWAL 
I KPQFEAGREQIGKNGI VKDKLVHEKVLTTVTNFTKDYGYTVKHLDFSPIQGGHGNIEFL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ZD NO. 6816 
STRAIN M732 frame: 1 

AKERVDVLAYKQGLFDTREQAKRGVMAGLVINVINGERYDKPGEKVADDTELKLKGEKLK 
YVSRGGLKLEKALQVFE I SVADKLT I D I GASTGGFTDVMLQSGARLVYAVDVGTNQLVWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPEFAS IDVSF I SLNLI LPALKE I LVDGGQWAL 
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I KPQFEAGREQ I GKNG I VKDKLVHE KVLTT VTNFTKD YGYTVKHLD F S P VQGGHGN I EFL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ID NO. 6817 
STRAIN COH1 frame: 1 

AKERVDVLAYKQGLFDTREQAKRG VMAGL V I NVI NGERYDKPGE KVADDTEL KLKGE KLK 
YVSRGGLKLEKALQVFE I S VADKLT I D I GASTGG FTD VMLQSGARLVYAVDVGTNQLVWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPEFAS IDVSF I SLNL ILPALKE ILVDGGQWAL 
I KPQFEAGREQ I GKNGI VKDKLVHE KVLTT VTNFTKD YGYTVKHLDFS PVQGGHGN I E FL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ID NO. 6818 
STRAIN M781 frame: 1 

AKERVDVLAYKQGL FDTREQAKRGVMAGL VI NVI NGERYDKPGE KVADDTELKLKGE KLK 
YVSRGGLKLEKALQVFE I S VADKLT I D I GASTGG FTD VMLQSGARLVYAVDVGTNQLVWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPEFAS IDVSF I SLNL I LPALKE ILVDGGQWAL 
I KPQFEAGREQI GKNGI VKDKLVHEKVLTTVTNFTKDYGYTVKHLDFSPVQGGHGNI EFL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ZD NO. 6819 
STRAIN CJB 1 10 frame: 1 

AKERVDVLAYKQGLFDTREQAKRGVMAGMVINVINGERYDKPGEKVADDTELKLKGEKLK 
YVSRGGLKLEKALQVFEISVADKLTIDIGASTGGFTDVMLQSGARLVYAVDVGTNQLVWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPEFAS I DVSF I SLNL I LPALKE ILVDGGQWAL 
I KPQFEAGREQ I GKNGI VKDKLVHEKVLTTVTNFTKDYGYTVKHLDFSP I QGGHGNIEFL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ID NO. 6820 
STRAIN 1169NT frame: 1 

AKERVDVLAYKC^LFDTREQAKRGVMAGLVINVINGERYDKPGEKVADDTELKLKGEKLK 
YVSRGGLKLEKALQVFE I S VADKLT I D I GASTGGFTDVMLQSGARLVYAVDVGTNQLVWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPEFAS I DVSF I SLNL I LPALKE ILVDGGQWAL 
I KPQFEAGREQ I GKNG I VKDKLVHEKVLTTVTNFTKD YGYTVKHLDF S P I QGGHGN I EFL 
MHLQKCQDPQNLVLDQI QDVIEKAHKEFKKNEEE 

SEQ ID NO. 6821 
STRAIN JM9 130013 frame: 1 

AKERVDVLAYKQGLFDTREQAKRGVMAGMVINVINGERYDKPGEKVADDTELKLKGEKLK 
YVSRGGLKLEKALQVFE I S VADKLT ID I GASTGG FTD VMLQSGARLVYAVDVGTNQLVWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGL PEFAS IDVS F I SLNLI LPALKE I LVDGGQWAL 
I KPQFEAGREQI GKNG I VKDKLVHEKVLTT VTNFTKD YGYTVKHLDF S P I QGGHGN I EFL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ID NO. 6822 
STRAIN H36B frame: 1 

AKE R VDVLA YKQGL FDTRE Q AKRG VMAGMV I NV I NGE R YD KPGE KVADDTELKL KGE KL K 
YVSRGGLKLEKALQVFE I S VADKLT I D I GASTGGFTD VMLQSGARLVYAVDVGTNQLVWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPE FAS IDVSF I SLNL I LPALKE I LVDGGQWAL 
I KPQFEAGREQI GKNGI VKDKLVHEKVLTTVTNFTKDYGYTVKHLDFSP IQGGHGNIEFL 
MHLQKCQDPQNLVLDQ I QDVI EKAHKE FKKNEEE 

PRETTY of : /biotmp/msa236800 . 2 { * } May 14, 2003 02:58 

' ' 1 50 

msa236800.2{310_090} -AKERVDVLA YKQGL FDTRE QAKRGVMAGm VINVINGERY DKPGEKVADD 

msa2 36800.2(31 0_1 8 RS2 1 } -AKERVDVLA YKQGL FDTRE QAKRGVMAGm VINVINGERY DKPGEKVADD 

msa236800.2{310__2603} mAKERVDVLA YKQGLFDTRE QAKRGVMAGm VINVINGERY DKPGEKVADD 

msa2 36800.2(31 0__A9 0 9 } -AKERVDVLA YKQGLFDTRE QAKRGVMAGm VINVINGERY DKPGEKVADD 

msa2 36800. 2(31 0_CJB11 0 } -AKERVDVLA YKQGLFDTRE QAKRGVMAGm VINVINGERY DKPGEKVADD 

msa236800 .2{310_H3 6B} -AKERVDVLA YKQGLFDTRE QAKRGVMAGm VINVINGERY DKPGEKVADD 

msa236800.2{310_JM9130013} -AKERVDVLA YKQGLFDTRE QAKRGVMAGm VINVINGERY DKPGEKVADD 

msa236800 .2{310_COHl} -AKERVDVLA YKQGLFDTRE QAKRGVMAG1 VINVINGERY DKPGEKVADD 

msa236800 .2(310_M732} -AKERVDVLA YKQGLFDTRE QAKRGVMAGl VINVINGERY DKPGEKVADD 

msa2368 00.2{310_M78l} -AKERVDVLA YKQGLFDTRE QAKRGVMAGl VINVINGERY DKPGEKVADD 

msa236800.2(310_1169NT} -AKERVDVLA YKQGLFDTRE QAKRGVMAGl VINVINGERY DKPGEKVADD 

Consensus ********** ********** *********_ ********** ********** 

51 100 

msa236800 .2 {310_090} TELKLKGEKL KYVSRGGLKL EKALQVFEIS VADKLT ID I G ASTGGFTDVM 

msa236800.2(310_18RS2l} TELKLKGEKL KYVSRGGLKL EKALQVFEIS VADKLTIDIG ASTGGFTDVM 

msa236800.2(310_2603 TELKLKGEKL KYVSRGGLKL EKALQVFEIS VADKLTIDIG ASTGGFTDVM 

msa236800 .2{310_A909} TELKLKGEKL KYVSRGGLKL EKALQVFEIS VADKLTIDIG ASTGGFTDVM 

msa236800 .2{310_CJB110} TELKLKGEKL KYVSRGGLKL EKALQVFEIS VADKLTIDIG ASTGGFTDVM 

msa236800 .2{310_H36B} TELKLKGEKL KYVSRGGLKL EKALQVFEIS VADKLTIDIG ASTGGFTDVM 

msa2 36800 .2 (310_JM9 130013} TELKLKGEKL KYVSRGGLKL EKALQVFEIS VADKLTIDIG ASTGGFTDVM 

msa236800.2{310_COHl) TELKLKGEKL KYVSRGGLKL EKALQVFEIS VADKLTIDIG ASTGGFTDVM 

msa236800.2(310__M732} TELKLKGEKL KYVSRGGLKL EKALQVFEIS VADKLTIDIG ASTGGFTDVM 

msa236800 .2{310_M781) TELKLKGEKL KYVSRGGLKL EKALQVFEIS VADKLTIDIG ASTGGFTDVM 

msa236800.2(310_1169NT} TELKLKGEKL KYVSRGGLKL EKALQVFEIS VADKLTIDIG ASTGGFTDVM 

Consensus ********** ********** ********** ********** ********** 

101 150 
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WO 2004/018646 



PCT/US2003/026827 



Table 68: Comparative Sequences relating to SAG 0499 



msa236800 
tnsa236800 -2{ 
msa236800 . 
msa236800 
msa236800.2{ 
rasa236800 . 
msa236800 .2(310 
msa236800 
msa236800 
msa236800 
msa236800 .2{ 



.2(310_090 
310_18RS21 
2{310_2603} 
2{310__A909} 
310_CJB110} 
2{310_H36B} 
_JM9130013} 
2{310_COH1} 
2? 310_M732} 
2{310_M781} 
310_1169NT} 
Consensus 



LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA^ 



VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 



KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 



MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 



KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 



********** ********** ********** ********** ********** 



msa236800 
msa236800 .2 { 

msa236800 . 

msa236800 . 
msa236800 .2 { 

msa236800 
0133236800.2(310, 

ntsa236800 . 

msa236800 . 

msa236800 . 
msa236800.2{ 



2{310_090) 
310_18RS21 
2(310_2603j 
2(310_A909) 
310_CJB110) 
2(310_H36B} 
_JM9130013] 
2(310_COH1] 
2(310_M732} 
2(310_M78l} 
310_1169NT) 
Consensus 



151 

FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
********** 



LNLILPALKE 
LNLILPALKE 
LNLILPALKE 
LNLILPALKE 
LNLILPALKE 
LNLILPALKE 
LNLILPALKE 
LNLILPALKE 
LNLILPALKE 
LNLILPALKE 
LNLILPALKE 



ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 



LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 



200 

EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 



********** ********** ********** ********** 



msa23680 
msa236800 . 2 
msa236800 
msa236800 
msa236800 .2 
msa236800 
msa236800.2(31 
msa236800 
msa236800 
msa236800 
msa236800 .2 



0.2{310_090} 
{310_18RS2l} 
.2(310^2603} 
.2{310_A909) 
(310_CJB110) 
.2(310_H36B} 
0_JM9130013} 
.2{310_COH1) 
.2(310_M732) 
.2(310_M781} 
(310_1169NT) 
Consensus 



201 

DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
********** 



TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
********** 



YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
********** 



iQGGHGNIEF 
iQGGHGNIEF 
iQGGHGNIEF 
iQGGHGNIEF 
iQGGHGNIEF 
iQGGHGNIEF 
iQGGHGNIEF 
vQGGHGNIEF 
vQGGHGNIEF 
vQGGHGNIEF 
iQGGHGNIEF 



250 

LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 



_********* ********** 



msa236800 
msa236800 . 2 { 

msa236800 . 

msa236800 . 
msa236800 .2{ 

msa236800 . 
msa236800. 2(310. 

msa236800 . 

msa236800 . 

msa236800 . 
msa236800.2( 



.2(310„090] 
310_18RS21] 
2(310_2603] 
2(310_A909] 
310_CJB110] 
2{310_H36B} 
JM9130013] 
2(310_COH1] 
2(310_M732 
2(310_M78l] 
310_1169NT} 
Consensus 



251 

QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 



VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 



275 
KNEEE 
KNEEE 
KNEEE 
KNEEE 
KNEEE 
KNEEE 



KNEEE 
KNEEE 
KNEEE 



********** ********** ***** 
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Table 69: Comparative Sequences relating to SAG0032 



SEQ ID NO. 6901 
STRAIN 2603 

ATGAATAAAAAGGTACTATTGACATCGACAATGGCAGCTTCGCTATTATCAGTCGCAAGT 
GTTGAAGCACAAGAAAGAGATACGACGTGGACAGCACGT^ 

G ATTTGGTAAAG CAAGACAATAAAT CATCAT ATACTGTGAAAT ATGGTG ATACACTAAG C 
GTTATTTCAGAAGCAATGTCAATTGATATGAATGTCTTAGCAAAAATAAATAACATTGCA 
GATAT CAAT CTT ATTTAT CCTGAG ACAACACT GACAGTAACTTACGAT CAGAAGAGT CAT 
ACTGCCACTTCAATGAAAATAGAAACACCAGCAACAAATGCT 
ACTGTGGATTTGAAAACCAATCAAGTTTCT 

ATTTCGGAAGGTATGACACCAGAAGCAGCAACAACGATTGTTTCGCCAATGAAGACATAT 
T CTT CTG CG C CAG CTTTGAAAT CAAAAGAAGTATTAG CACAAGAG CAAG CTGTT AGT CAA 
G CAG CAG CTAATGAACAGGTAT CAC CAG CT CCTGTGAAGT CGATTACTT CAGAAGTT CCA 
G CAG CTAAAGAG GAAGTTAAA.C CAACT CAG ACGT CAGT CAGT CAGT CAACAACAGTATCA 
CCAGCTTCTGTTGCCGCTGAAACACCAGCTCCAGTAGCTAAAGTAGCACCGGTAAGAACT 
GT AGCAGCC C CTAG AGTGG CAAGTGTT AAAGT AGT CACT C CTAAAGT AG AAACTGGTGCA 
TGAGGA3AGCATGTATCAGCTCCAGCAGTTCCTGTG^ 

AGTAAGCTACAAGCGACTGAAGTTAAGAGCGTTCCGGTAGCACAAAAAGCT 

ACACCGGTAGCACAACCAGCTTGAACAACAAATGCAGT 

GGGCTCCAACCTCATGTTGCAGCrTATAAAGAAAAAGTAGC^ 

GAATT CAGTACATAC CGTG CGGGAGAT CCAGGTGATCAT GGTAAAGGTTTAG CAGTTGAC 
TTTATTGTAGGT ACTAAT CAAG CACTTGGTAATAAAGTT G O^CAGTACTCTACACAAAAT 
ATGG CAG C^LAATAACATTT CAT ATGTTAT CTGGCAACAAAA.GTTTTACTCAAATACAAAC 
AGTATTTATGGAC CTG CTAATACTTGGAATG CAATGCCAGAT CGTGGTGGCGTTACTG C C 
AAC CACTATGACCACGTT CACGTAT CATTTAACAAATAATATAAAAAAGGAAG CTATTTG 
GCTT CTTTTTTATATG CCTTGAATAGACTTT CAAGGTTCT 



SEQ 10 NO. 6902 
STRAIN 090 

TGAGACAACACTGACAGTAACTTACGATCAGAAG AGT CAT ACTG C CACTT 
CAATGAAAATAGAAACACCmGCAACAAATGCTGCTGGTCAAACAC 
ACTGTGGATTTGAAAAC CAAT CAAGTTTCTGTTG CAGACCAAAAAGTTT C 
TCT CAATACAATTT CGGAAGGTATGACAC CAGAAGCAG CAACAACGATTG 
TTTCG C CAATGAAGACATATTCTTCTG CG CCAG CTTTGAAATCAAAA.GAA 
GTATT AG CACAAGAG CAAG CTGTT AGT CAAG CAG CAGCTAATGAACAGGT 
ATCAACAGCT CCTGTGAAGT CGATTACTT CAG AAGTTC CAGCAG CTAAAG 
AGGAAGTTAAACCAACTCAGACGT CAGT CAGT CAGTCAACAACAGTATCA 
CCAGCTTCTGTTGCCGCrGAAACACCAGCTCGAGTAGCT'AAAG 
GGTAAGAACTGTAGCAG CC C CTAGAGTGGCAAGTGTTAAAGTAGTCACT C 
CTAAAGT AGAAACTGGTGCAT CAC CAG AG CATGTAT CAG CTCCAG CAGTT 
C CTGTGACTACGACTTCAACAG CTACAGAC^GTAAGTTACAAGCGACTGA 
AGTT AAGAG CGTT CCGGTAG CACAAAAAG CTC CAACAG CAACAC C GGTAG 
CACAACCAG CTT CAACAAGA^TGCAGTAG CTGCACAT CCTGAAAATGCA 
GC4GCTCCAACCTCATGTTGCAGCTTATAAAGAAAAAGTAG 
TGGAGTTAATGAA.TTCAGTACATACCGTGC^GGTGATCCAGGTGATCATG 
GTAAAGGTTTAGCAGTCGACnT^TATTGTAGGTAAA 

AATGAAGTTGCACAGTACTCTACACAAAATATGG CAG CAAATAACATTTC 
AT ATGTTAT C TGGCAACAAAAGTTTTACT CAAATACAAATAGTATTTATG 
GACCTGCTAATACTTGGAATGCAA.TGCCAGATCGTGGTGGCGTTACTGCC 
AACCATTATGACCATGTTCACGTATCATTTA^CAAATAATATAAAAA^GG 
AAGCTATTTGGC1T' CTTTTTTATATGCCTTGAAT AGACTTT CAAGGTTCT 
TATATAATTTTTATTA 



SEQ XD NO. 6903 
STRAIN A909 

CTGATTTGGTAAAG CAAGACAATAAATCAT CATATACTGTGAA 
ATATGGTGATACACTAAG CGTTATTTCAGAAG CAATGTCAATTGATATGA 
ATGTCTTAG CAAAAATTAATAACATTGCAGATATCAATCTTATTTATC cT 
C^GACAACACTG aCAGTAACTTACG ATCAGAAGAGTCATACTGCTACCT C 
AATGAAAATAGAAACACCAG CAACAAATG CTG CTGGT CAAACAaCAG cT A 
CTGTCGATTTGAAAACCAATCAAGTTTCTGTTGCAGACCAAAA^ 
CTCAATACAATTTCGGAAG GTATGACACCAGAAGCAG CAACAACGATTGT 
TTCGCCAATGAAGACATATTCTTcTGCGCCAGCIT^ 

TATTAg CACAAGGGC aAG CTGTTAGTCAAGCAGCAGCTAATGAACAGGTA 
TCAc CAG CT C CTGTG AAGTCGATTACIT CAGAAGTTC CAg CAG CTAAAGA 
C4GAAGTTAAACCAaCTCAgACGTCAgTCAGTCA 
CAgCTT?CTGTTGCCGCTGAAA.CACCAGCTCCAgTAGCr 
GTAAGAACTGTAGCAG C C CCTAGAGTGG CAAGTGTTAAAGTAGT CACTC C 
TAAAGTAGAAACTGGTG CATCAC CAGAGCATGTAT CAGCT CCAGCAGTT C 
CTGTGACTACGACTTGAACAGCTACAGACAGTAAGTTACAAGCGACTC^ 
GTTAAGAG CGTTCCGGTAGCACAAAAAGCT C CAACAGCAACACCGGT AG C 
ACAACCAGCTTCAAGAACAAATGCAGTAGCTGCACATCCT 
GG CTCCAACCTCATGTTGCAGCriT'ATAAA.GAA 

GGAGTTAATGAATTCAGTACATACCGTGCGGGAGATCCAGGTGATCATGG 
TAAAGGTTT AGCAGTTGACTTTATTGTAgGTAAAAAC CAAGCACTTGGTA 
ATGAAGTTG CACAGT ACTCT ACACA^AATATGG CAGCaAATAACATTTCA 
TATGTTATCTGG CAACAAAAGTTTTACT CAAAT aCAAATAGTATTTATGG 
ACcTGCTAATACTTGGAATGCAATGCCAGATCGTGGTGGCGTTAcTGCCA 
ACCa CTATGAC CACGTT CACGT ATCATTTAACAAATaATATAAAAAAGGA 
AG CT aTTTGGCTTCTTTTTTATATG C CT^ CAAGGTT CTT 

ATATAATTTTTATTA 

SEQ ZD NO. 6904 
STRAIN H36B 
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PCT/US2003/026827 



Table 69: Comparative Sequences relating to SAG0032 



CTGATTT GGT AAAG CAAG ACAATAAAT CATCATATAc TGTG AAAT A 

TGGTGATACAcTAAGCGTTATTTCAGAAGCAATGTCaATTGATATGAATG 

TCTTAGCAAAAATTAATAACATTGCAGATATCAATCTTATTTATCcTGAG 

ACAACaCTGaCAGTAaCTTACGATCAGAAGAGTCATACTGCTACTTCAAT 

GAAA^TAGAAACACCAGCAACAAATGCTGCTGGTCAAACAACAGCTACTG 

TCGATTTGAAAACCAATCAAGTTTCTGTTGCAG^ 

AATAGAATTTCGGAAGGTATGACACCAGAAGCAGCAACAACGATTGTTTC 

GCCAATGAAGAC^TATTCTTCTGCGCCAGCTTTGAAATCAAAAGAAGTAT 

TAGCACAAGGGCAAG CTGTTAGTCAAGCAGCAG CTAATGAACAGGTATCA 

CCAG CT C CTGTG AAGTCGATTACTT CAGAAGTTC CAG CAG CTAAAGAGGA 

AGTTAAACCAACTCAGACGTCAGTCAGTCAGTCAACAACAGTATCACCAG 

CTTcTGTTGCCGCTGAAACACCAGCTCCAGTAGcTAAAGTAGCACCGGTA 

AGAACTGTAGCAGCCCcTAGAGTGGCAAGTGTTAAAGTAGTCACTCcTAA 

AGTAGAAACTGGTGCATCACCAGAGCATGTATCAGCTCCAGCAGTTCCTG 

TGACTACGACTT CAACAG CTACAGACAGT AAGTTACAAG CGACTG AAGTT 

AAGAGCGTTCCGGTAGCACAAAAAGCTCCAACAGCAACACCGGTAGCACA 

ACCAG CTTCAACAACAAATG CAGTAGCTG CACAT CCTGAAAATG CAAGG C 

TCCAACCTCATGTTGCAG CT TATAAAG AAAAAGTAG CGT CAACTTATGGA 

GTTAATGAATTCAGTACATACCGTGCGGGAGATCCAGGTGATCATGGTAA 

AGGTTTAGGAGTTGACTTTATTGTAGGTAAAAACCAAGCACTTGGTAATG 

AAGTTGCACAGTACTCT ACACAAAA t aTGG CAGCAAATAACATT T CATAT 

GTTATCTGG CaACAAAAGTTTT ACT CAAAT ACAAATAGTATTTAT GGACC 

TG CTAATACTTGGAATG CAATG CCAgAT CGTGGTGGCGTTACTG CCAAC C 

ACTATGACCACGTTCACGTATCaTTTAAC^^TAATATAAAAAAGGAAGC 

TATTTGGCTTCTTTTTTATATGCCTTGC^TA 

TAATTTTTATTA 

SEQ ID NO. 6905 
STRAIN 18RS21 
CTGATTTGGTAAAGCAAGACAAT 

AAAT CATCAT ATACTGTGAAATATGGTGAT ACAcTAAG C GTTATTTCAGA 

AGCAATGTCAATTGATATGAATGTCTTAGCAAAAaTAAATAACAT^ 

ATAT CAATCTT ATTTATCc TGAGACAACa CTGaCAGTAACT^^ CAG 

AAGAGTCATACTGCCaCTTCAATGAAAATAGAAACACCAGCAaC?\AATGC 

TGCTGGTCAaACAaCAGCTACTGTGGATTTGAAA 

TTGCAGACCAAAAAGTTTCTCTCAATACAATT^ 

GAAG CAG CAACAACGATTGTTT CG C CAATGAAGAC aTATTCTTcTGCGC C 
AG CITTGAAaTCAAAAGAAGTATTAG CACAAGAG CAAG CTGTTAGTCAAG 
CAGCAGCTAATGAACAGGTATCACCAG CT C CTGTGAAGTCGATTACTTCA 
GAAGTTCCAGCAGCTAAAGAGGAAGTTAAACCAACTCAGACGTCAGTCAG 
TCAGTCAACAAa^GTATCACCA,GCTTCrGTTGCCGCTGAAACACCAGCTC 
CAGTAG CTAAAGTAG CACCGGTAAGAACTGTAGCAGC CCCTAGAGTGGCA 
AGTGTTAAAGTAGT CACTCCTAAAGTAGAAACTGGTG CAT CACCAGAG CA 
TGTATCAGCTCCAGCAGTTCCTGTGACTACGACTTCACCAGCTACAG^ 
GT AAGTTACAAG CGACTGAAGTTAAGAGCGTT CCGGTAG CACAAAAAGCT 
CCAACAGCAACACCGGTAGCACAACCAGCTTCAACAACMATGCAGTAGC 
TG CACAT CCTGAAAATGCAGGG CT C CAAC CTCATGTTG CAGCTTATAAAG 
AAAAAGTAG CGT CAACTTATGGAGTTAATGAATTCAGTACATAC CGTGCG 
GGAGATCCAGGTGATCATGGTAAAGGTTTAGCAGTTGACTTTATTGTAGG 
TACTAAT CAAG CACTTGGTAATAAAGTTG CACAGTACT cT ACACAAAATA 
TGGCAGCAAATAACATTTCATATGTTATCTGGCAA 

AATACAAACAGTATTTATGGAC CTGCTAATACTTGGAATG CAATGCCAGA 
TCGTGGTGGCGTTACTGCCAACCACTATGACCACGTTCACGTATCATTTA 
ACAAATAATATAAAAAAGGAAGCTATTTGGCTTCTT^ 
AATAGACITTCAAGGTTCTTATATAATTTTTATTA 

SEQ ID NO. 6906 
STRAIN COH1 
CTGATTT 

GGTAAAG CAAGACAATAAAT CATCATATACTGTGAAATATGGTGATACAC 
TAAG CGTTATTTCAGAAG CAATGT CAATTGATATGAATGTCTT AG CAAAA 
ATTAATAACATTGCAGATATCA^TCITATTTATCCTGAGACAACACT 
AGTAACTTACGATCAGAAGAGT CATACTG CCACTT CAATGAAAATAGAAA 
CACCAGCAACAAATGCTGCTGGTCPAACIAAC^ 

ACCAATCAAG , 1T N 1TTGTTG CAGACCAAAAAGTTT c T CT CAATACAATTT C 

GGAAGGTATGACACCAGaaGCAGCAACAACGATTGTTTCGCCAATGAAGA 

CaTATTCTTCTGCGCCAGCTTTGAAATCAAAAGAAGTAT^ 

CAAG CTGTT AGTCAAGT AG CAG CTAATGAACAGGTATCACCAGCT CCTGT 

GAAGT C^ATTACrTCAGAAGTTCCAGCAG CTAAAGAGGAAGTTAAAC CAA 

CTCAGACGTCAGT CAGT CAGTTAACAACAGTATCACCAG CTTCTGTTG CC 

GCTGAAACACCAGCTCCAGTAGCTAAAGTAGCACCGGTAAGAACTGTAGC 

AG CCCCTAGAGTGGCAAGTG cT AAAGT AGTCACT C cTAAAGTAGAAACTG 

GTGCATCAC CAGAG CATGTATCAGCTCCAGCAGTT CCTGTGACTACGACT 

TCACCAGCTACAGACAGTA^GTTACAAGCGACTGAAGTTAAGAGCGTTCC 

GGTAGCACAAAAAGCTCCAACAGCMCACCGGTAGCACAACCAGC^ 

CAACAAATGCAGTAGCTGC^(^TCCTGAAAATGCAGGGCTCCAACCTCAT 

GTTGCAGCTT AT AAAGAAAAAGTAG CGT CAACTTATGGAGT^ 

C^GTACATACCXSTGCGGGAGATCCAGGTGATCATGGTAAAGGTTTAGCAG 

TTGACTTTATTGTAGGTAAAAACCAAGCACTTGGTAATGAAGT^ 

Ta CTCTACACAAAATATG<3 CAGCAAATAACATTT CATATGTTAT CTGGCA 

ACAAAAGTTTTATT CAAATACAAATAGTATTTATGGACCTGCTAATACTT 

GGAATGCAATGCCAGATCGTGGTGGCGTTACTGCCAACCACTATGACCAC 

GTTCACGTAT CATTT AACAAATAAT ATAAAAAAGGAAG CTATTTGG CTT C 
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Table 69: Comparative Sequences relating to SAG0032 



TT Tl'TTATAT G C CTTGAATAG ACTTT CAAGGTT CTTATATAATTTTT ATT 
A 

SEQ ID NO. 6907 
STRAIN M732 

CTGATTTGGTAAAG CAAGACAATAAAT CAT CATAT ACTGTGAAAT ATGGT 
GATACAnTAAG CGTT ATTT CAGAAG CAATGT CAATTG ATATG AATGT CTT 
AG CAAAAATT A*\TAACATTG CAGAT AT CAAT CTTATTTAT C CTG AGACAA 
CACTG ACAGTAACTT ACGAT CAGAAGAGT CA t ACTG CCACTT CAATG AAA 
ATAGAAACAC CAGCAACAAATG CTG CTGGT CAAACAACAG CTACTGT C G A 
TTTGAAAACC^TCAAGTTTTTGTTGCAGACCAAAAAGTTTCTCTCAATA 
CAATTTCGGAAGGTATGACACCAGAAGCAGGAACAACG^ 
ATGAAGACATATTCrTCTGCGCCAGCTTTGAAATCAAAAGAAGTATTAGC 
ACAAGAG CAAG CTGTTAGT CAAGTAG CAG CTAATG AACAGGTATCAC CAG 
CT C CTGTGAAGT CG ATTACTT CAGAAGTTC CAG CAGCTAAAG AG G AAGTT 
AAACCAACTCAGACGTCAGTCAGTCAGTTAACAAf^GTATCACCAGCTTC 
TGTTGCCGCTGAAACACCAGCTCCAGTAGCTAAAGTAGCACCGGTAAGAA 
CTGTAGCAGCCCCTAGAGTGGCAAGTGCTAAAGTAGTCACTCCTAAAGTA 
GAAACTGCTGCATCACCAGAGCATGTATCAGCTCCAGCAGTTCCTGTGAC 
TACGACTT CACCAG CTACAGACAGT AAGTT ACAAG CG ACTGAAGTTAAG A 
GCGTTCCGGTAGCAC^W^GCTCCAAC^GCAaCACCGGTAG(^C^CCA 
GCTTCAACAACAAATGCAGTAGCT^ 

AC CT CATGTTGCAG CTT ATAAAGAAAAAGTAG CGT CAACTTATGGAGTTA 

ATGAATT CAGTACATAC CGTGCGGG AGAT C CAGGTGATCATGGTAAAGGT 

TTAG CAGTTGACTTTA t tgtaggtaaaaaccAAGCACTTGGTAATGAAGT 

TGCACAGTACTcTACACTVAAATATGGCAGCAAATAAC 

TCTGG CAACAAAAGTTTTATTCAAATACAAATAGT ATTT ATGGACCTG CT 

AATACTTGCAATGCAATGCCAGATCGTGGTGCCX3TTACTGCCAACCACTA 

TGACCACGTTCACGTAT CATTTAACAAATAATATAAAAAAGGAAG CTATT 

TGGCTTCTTTTTTATATGCCTTGAATAGACTITTCAAGCTT CTTATATAAT 

TTTTATTA 

SEQ ID NO. 6908 
STRAIN M78 1 

CTGATTTGGTAAAG CAAGACAATAAATCATCATATACTGTGAAATATGGT 
GATACACTAAGCGTTATTT CAGAAGCAATGTCAATTG ATATG AATGT CTT 
AG CAAAAATTAATAACATTG CAGATAT CAATCTT'ATTTATCCTGAGACAA 
CACTGACAGTAACrrTACGATCAGAAGAGTCATACr 

ATAGAAACACCAGCAACAAATCCTGCTGCTCAAACAACAGCTACTGTCGA 
TTTG AAAAC CAAT CA^GTTTTTGTTGCAGACCAAAAAGTTT CTCT CAAT A 
CAATTTCGGAAGGTATGACACCAGAAG CAGCAACAACGATTGTTTCGCCA 
ATGAAGACATATT CTT CTG CGC CAG CTTTGAAAT CAAAAGAAGTATTAG C 
ACAAGAG CAAGCTGTTAGT CAAGTAG CAG CTAATGAACAGGTAT CAC CAG 
CT CCTGTGAAGTCGATTACTTCAG AAGTT C CAGCAG CTAAAGAGGAAGTT 
AAAC CAACT CAGACGTCAGTCAGT CAGTTAACAACAGTAT CACCAG CTT C 
TGTTGCCGCTGAAACACCAG CTCCAGTAGCTAAAGTAGCACCGGTAAGAA 
CTGTAGCAG C CCCTAGAGTGGCAAGTG CTAAAGTAGTCACTCCTAAAGTA 
GAAACTGGTG CATCAC CAGAGCATGTATCAGCTC CAG CAGTT CCTGTGAC 
TACGACTTC^CCAGCTACAGACAGTaaGTTACAAGCGACTGAAGTTAAGA 
GCGTTCCGGTAGC!ACAAAAAGCTCCAACAGCAACACCGGTAGCACAACCA 
GCTTCAACAACAA^TG CAGTAG CTGCA(^TCCTGAAAATG CAGGG CT CCA 
AC CTCATGTTG CAG CTTATAAAGAAAAAGTAG CGT CAACTTATGGAGTTA 
ATGAATTCAGTACATACCGTGCGCGAGATCCAGGTGATCATGGTAAAGGT 
TTAG CAGTTGACTTTATTGTAGGTAAAAAC CAAG CACTTGGTAATGAAGT 
TG CACAGTACTCTACACAAAATATGGCAG CAAATAACATTTCATATGTTA 
TCTGGCAACAAAAGTTTTATTCAAATACAAATAGTATT^ 
AATACTTGGAATGCAATGCCAGATCGTGGTGGCGTTACTGCCAACCACTA 
TGACCACGTT CACGTAT CATTTAACAAATAATATAAAAAAGGAAG CT aTT 
TGGC"lTC"riU M iTTATATGCCITGAATAgACTTTCAAGGTTCrTATATAAT 
TTTTATTA 

SEQ XD NO. 6909 
STRAIN CJB 110 

CTGATTTGGTAAAG CAAGACAATAAAT CAT CATAT ACTGTGAAA 

TATGGTGATACACT AAG CGTTATTT CAGAAGCAATGT CAATTGATATGAA 

TGTCTTAGCAAAAATTAAT AACATTGCAGATAT CAAT CITATTT ATC CTG 

AGACAACACTGACAGTAACTTACGATCIAGAAGAGTCATACT 

ATGAAAATAGAAACACCAGCMCAAATGCTGCTGGTCAAACACCAGCTAC 

TGTGGATTTGAAAACCAATCAAGTTTcTGTTGCAGACCAAAA^ 

TCAATACAATTTCGGAAGGTATGACACCAGAAGCAGC^ 

T CGCCAATG AAGACATATT CTTCTG CG CCAGCTT^ CAAAAGAAGT 

ATTAG CACAAGAG CAAGCTGTTAGT CAAG CAG CAG CTAATGAACAGGTAT 

CAACAGCTCCTGTGAAGTCGATTACTT CAGAAGTTC 

GAAGTTAAACCAACTCAGACGTCAGTCAGTCAGTCAACAA 

AgCTTCTGTTGCCG CTG AAACACCAG CT C CAGTAG CT AAAGT AGCAC CGG 

TAAgAACTGTAG CAG CC CCTAG AGTGGCAAGTGTTAAAGTAGTCACT CCT 

AAAGTAGAAACTGGTGCATCAC CAGAG CATGTAT CAG CT C CAG CAGTTC C 

TGTGACTACGACTTCAAC^GcTACAGACAGTaAGTTaCAAGCGACTGAAG 

TTAAGAGCGTTCCGGTAGCACAAAAAGCTCCAACAGCAACACCGGTAGCA 

CAACCAGCTTCAACAAGAAATCCAGTAGCTGC^ 

GCTCCAACCT CATGTTG CAG CTTAT aAAGAAAAAGTAGCGTCAACTTATG 
GAGTTAATG AATTCAGT ACATa C CGTGCAGGTGAT C CAgGTGAT CATGGT 
AAAGGTTTAGCAGTcGACTTTATTGTAgGTAAAAACCAAGCACTTGGTAA 
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TG AAGTTG GAGAGTACT CT ACACAAAATATGG CAG CAAAT AACATTT CAT 
ATGTTATCTGGCAACAAAAGTTTTACTCAAATACAAATAGTATTTATGGA 
CCTGCTAATACTTGGAATGCAATGCCAGATCGTGGTGGCGTTACTGCCAA 
CCATTATGAC CATGTT CACGT AT CATTTAACAAATAAT ATAAAAAAGGAA 
GCTATTTGG CTT CTTTTTT ATATG C CTTG AATAG AC t TTCAAGGTT CTT A 
TATAATTTTTATTA 

SEQ ID NO. 6910 
STRAIN 1169NT 
CTGATTTG 

GTAAAGCMGACAATAAATCATCATATACTGTGAAATATGGTGATACACT 
AAGCGTTATTTCAGAAGCAATGTCAATTGATATGAATGTCTTAGCAAAAA 
TTAATAACATTG CAG AT AT CAATCTT ATTTAT C C TGAGACAACACTGACA 
GTAACTTACG AT CAg AAGAGTCAT ACTG C CACTTCAATGAAAATAGAAAC 
AC CAG CAACAAATG CTG CTGGT CAAACAACAGCTACTGTGGATTTGAAAA 
CCAATGAAGTTTCTGTTGCAGACGAAAAAGTTTCTCT 
GAAGGTATGACACCAGAAG CAg CAACAACG ATTGTTTCGC CAATGAAGAC 
ATATTCTTCTGCGC CAG CTTTg AAAT CAAAAGAAGTATTAGCACAAG AG C 
AAGCTGTTAGTCAAGC^GCAGCrAATGAAC^GGTATCACCAGCTCCTGTG 
AAGTCGATTACTTCAgAAGTT C CAg CAGCTAAAGAGG AAGTTAGAC CAa C 
TcAGACGTCAGTCAGTCAGTCMCMG^GTATCACC^gCTTCTGTTGCCG 
CTGAAACAC CAG CT C CAGT AGCTAAAGTAG CAC CGGT AAGAACTGTAG CA 
GCCCCAGCCCCrAGAGTGGCAAGTGCTAAAGTAGTCACTCCTAAAGTAGA 
AAcTGGTGCATCACCAGAGCATGTACCAGCTCCAGC^GTTcCTGTGACTA 
cG ACTTCAACAG CTAC aGACAaTaAGTTACAAG CGACTGAAGTTAAgAG C 
GtTCCGGTgGCACAAAAAGCTCCAACAGCAAGACCGGTaGCACA 
TTcAACAACAAATGCAGTAGcTGCACATCCTGAAAATGCA 
CTCATGTTG CAGCTTAT AAAGAAAAAGTAG CGTCAACTTATGGAGTTAAT 
GAATTCAGTACATaC CGTG CGGGAGATCCAGGTG ATCATGGT AAAGGTTT 
AG CAGTTGACTTTATTGTagGTAAAAACCAAG CACTTGGTAATGAAGTTG 
CACAGTACTCTACACAAAATATGG CAGCAAAT^ CATATGTTAT C 

TGGCAACAAAAGTTTTACTCAAATACAAATAGTATTTATG^ 
TACTTGGAATGG^TGCCAGATCGTGGTGGCGTTACTGCCAACGA.CTATG 
AC CACGTTCACGTAT CATTTAACAAAT AATATAAAAAAGGAAGCTATTTG 
G CTTCTTTTTTATATG C CTTGAAT AGACTTT CAAGG t TCTTATATAATTT 
TTATTA 

SEQ XD NO. 6911 
STRAIN JM9 1300 13 

CTGATTTGGTAAAG CAAGACAATAAAT CAT CATATACT 

GTGAAATATGGTGATACACTAAGCGTTATTTCAGAAGCAATGTCAATTGA 

TATGAATGT CTTAG CAAAAATAAAT AACATTG CAGAT ATCAAT CTTATTT 

AT CcTGAGACAACACTGACAGTAACTTACGAT CAGAAGAGTCATACTGC C 

ACTTCAATGAAAATAGAAACAC CAG CAACAAATG CTG CTGGT CAAACAAC 

AGCTACTGTGGATTTGAAAACCAATC7VA3TTTCTGCT 

TTTCT CTCAATACAATTT CG GAAGGTATGACACCAGAAG 

ATTGTTT CG CCAATG AAGACATATT CTT CTG CG C CAG CTTTGAAATCAAA 

AGAAGTATTAGCACAAGAGCAAGCTGTTAGTCAAGCAGCAGCTAATGAAC 

AGGTATCACCAG CT C CTGTGAAGTCGACTACTTCAGAAGTTCCAGCAG CT 

AAAGAGGAAGTTAAACCAACTa^GACGTCAGTCAGT(^GTCAACAACAGT 

ATCACCAgCTTCTGTTGCCGCTGAAACACCAGCTCCAGTAGCT 

CACCGGTAAGAACTGTAGCAG C C CCTAgAGTGGCAAGTGTTAAAGTAGTC 

ACT C CTAAAGTAGAAACTGGTG CAT CACC AGAG CATGTATCAG CT CCAG C 

AGTT C CTGTGACTACGACTT CACCAG CTACAG aCAGT AAGTTACAAG CGA 

cTGAAGTTAAGAGCGTTCCGGTAG CACAAAAAG CTCCAACAG CAACACCG 

GTAGCaCAACCAGCTTCAACAACAAATGCAGTAGCTX^ 

TGCAGGGCTCCAACCTCATGTTGCAGCTTATAAAGAAAAAGTAGCGTO^ 

CITATGGAGTTAATGAATTCAGTACATACCGTGCGGGAGATCCAgGTGAT 

CATGGTAAAGGTTTAG CAGTTGACTTTATTGTAGGTACTAATCAAG CACT 

TGGTAATAAAGTTG CACAGTACTCTACACAAAATATGG CAGCAAATAACA 

TTTGATATGTTATCTGGCAACAAAAGTTTTACT 

TATGGACCTGCTAATACTTGGAATGCAATGCCAGATCGTGGTGGCGTTAC 
TG CCAACCACTATG AC CACGTT CACGTAT CATTTAACAAATAAT ATAAAA 
AAGGAAG CTATTTGG CTT CTTTTTTATATG CCTTG AATAGACTTT CAAGG 
TT CTT AT AT AATTTTTATTA 

PRETTY of: /biotmp/msal67919 . 2 { * } March 11, 2003 08:55 .. 

1 50 

msal 67919 . 2 { 322_C0H1 } 

msal67919.2{322_M78l} 

msal67919.2{322_M732} 

msal67919.2{322_18RS2l} 

msal67919.2{322_2603} atgaataaaa aggtactatt gacatcgaca atggcagctt cgctattatc 

msal67919.2{322_JM9130013} 

msal67919.2{322_090} 

msal 67 919 . 2 { 322_CJB110 } : 

msal67919.2{322_A909} 

msal67919.2{322_H36B} 

maal67919.2{322_1169NT} 

Consensus ********** ********** ********** ********** ********** 

51 100 
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msal67919.2{322_COHl) 

msal67919 . 2{322_M781) ~ 

msal67919 . 2{ 322_M732 } ~ 

msal67919.2{322_18RS2l} 

msal67919.2{322_2603} agtcgcaagt gttcaagcac aagaaacaga tacgacgtgg acagcacgta 

msal67919 . 2 { 322_JM913 0013 } • 

msal67919.2{322_090} 

msal67919 .2{322_CJB110} 

maal67919.2{322_A909} 

msal67919.2{322_H36B} 

msal67919.2{322_1169NT} 

Consensus ********** ********** ********** ********** ********** 

101 150 

msal67919.2{322_C0Hl} — ct gatttggtaa agcaagacaa taaatcafcca 

msal67919.2{322_M78l} ct gatttggtaa agcaagacaa taaatcatca 

msal67919 .2{322__M732} » ct gatttggtaa agcaagacaa taaatcatca 

msal67919 .2{322_18RS21} ~~ — ct gatttggtaa agcaagacaa taaatcatca 

msa!67919. 2 { 322^2603} ctgtttcaga ggtaaaggct gatttggtaa agcaagacaa taaatcatca 

msal67919.2{322_JM9130013 J ct gatttggtaa agcaagacaa taaatcatca 

msal67 919 .2{322_090) 

msal67919 .2{322_CJB110} ct gatttggtaa agcaagacaa taaatcatca 

msa!67919 . 2{322_A909} ct gatttggtaa agcaagacaa taaatcatca 

msal67919.2{322_H36B} ct gatttggtaa agcaagacaa taaatcatca 

msal67919 . 2{322_1169NT} ct gatttggtaa agcaagacaa taaatcatca 

Consensus ********** ******** 



msal67919 . 2 { 322_C0H1 } 
msal67919 . 2 { 322_M78 1 } 
msa!67919 . 2 { 322_M732 } 
msal67919 . 2 { 322_18RS2 1 } 
msal67919 . 2 {322_2603 } 
msa!67919.2{322_JM9130013} 
msal67919.2{322_090} 
msal67919 . 2 {322_CJB110 } 
msal67919 . 2 {322_A909 } 
msal67919 . 2 { 322_H36B } 
msal67919.2{322_1169NT} 
Consensus 



151 

tatactgtga 
tatactgtga 
tatactgtga 
tatactgtga 
tatactgtga 
tatactgtga 



aatatggtga 
aatatggtga 
aatatggtga 
aatatggtga 
aatatggtga 
aatatggtga 



tacactaagc 
tacactaagc 
tacantaagc 
tacactaagc 
tacactaagc 
tacactaagc 



gttatttcag 
gttatttcag 
gttatttcag 
gttatttcag 
gttatttcag 
gttatttcag 



200 

aagcaatgtc 
aagcaatgtc 
aagcaatgtc 
aagcaatgtc 
aagcaatgtc 
aagcaatgtc 



tatactgtga aatatggtga tacactaagc gttatttcag aagcaatgtc 

tatactgtga aatatggtga tacactaagc gttatttcag aagcaatgtc 

tatactgtga aatatggtga tacactaagc gttatttcag aagcaatgtc 

tatactgtga aatatggtga tacactaagc gttatttcag aagcaatgtc 



msal 67 9 19 . 2 f 322_C0H1 } 
msal67919 . 2 { 322_M781 } 
msal67919.2{322_M732} 
msal67919 . 2 { 322_18RS21 } 
msal67919 .2 {322_2603 } 
msal67919.2{322_JM9130013} 
msal67919 . 2 {322_090 } 
msal 67 919 - 2 {322_CJB110 } 
msal67919.2{322_A909} 
msal 67 9 1 9 . 2 { 3 2 2_H3 6B } 
msal67919 . 2 {322_1169NT} 
Consensus 



201 

aattgatatg 
aattgatatg 
aattgatatg 
aattgatatg 
aattgatatg 
aattgatatg 



aatgtcttag 
aatgtcttag 
aatgtcttag 
aatgtcttag 
aatgtcttag 
aatgtcttag 



caaaaattaa 
caaaaattaa 
caaaaattaa 
caaaaataaa 
caaaaataaa 
caaaaataaa 



taacattgca 
taacattgca 
taacattgca 
taacattgca 
taacattgca 
taacattgca 



250 

gatatcaatc 
gatatcaatc 
gatatcaatc 
gatatcaatc 
gatatcaatc 
gatatcaatc 



aattgatatg aatgtcttag caaaaattaa taacattgca gatatcaatc 
aattgatatg aatgtcttag caaaaattaa taacattgca gatatcaatc 
aattgatatg aatgtcttag caaaaattaa taacattgca gatatcaatc 
aattgatatg aatgtcttag caaaaattaa taacattgca gatatcaatc 



msal67919. 

msal67919. 

msal67919. 
msal67919.2{ 

msal67919. 
msal67919.2{322 
msal67919 
msal67919.2{ 

msal67919. 

msal67919. 
msal67919.2{ 



2{322_COHl} 
2{322_M78lJ 
2{322_M732} 
322_18RS21} 
2{322_2603} 
_JM9130013} 
.2{322_090} 
322_CJB110} 
2{322_A909} 
2{322_H36B 
322_1169NT 
Consensus 



251 

ttatttatcc 
ttatttatcc 
ttatttatcc 
ttatttatcc 
ttatttatcc 
ttatttatcc 

ttatttatcc 
ttatttatcc 
ttatttatcc 
ttatttatcc 



TGAGACAACA 
TGAGACAACA 
TGAGACAACA 
TGAGACAACA 
TGAGACAACA 
TGAGACAACA 
TGAGACAACA 
TGAGACAACA 
TGAGACAACA 
TGAGACAACA 
TGAGACAACA 
********** 



CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
********** 



CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
********** 



300 

GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
********** 



msal 67919.2(32 2_C0H1 } 
msal67919.2{322_M78lj 
msal67919.2{322_M732] 
msa!67919 . 2 { 322_18RS2 1 \ 
msal67919.2{322J2603. 
msal67919 . 2 { 322_JM9130013 
msal67919 . 2 {322__090 ] 
msal67919 . 2 { 322_CJB110 " 
msal67919.2{322_A909] 
msal67919-2{322_H36B] 
msal67919 . 2 {322_1169NT] 
Consensus 



301 

ACTGCcACTT 
ACTGCcACTT 
ACTGCcACTT 
ACTGCcACTT 
ACTGCcACTT 
ACTGCcACTT 
ACTGCcACTT 
ACTGCcACTT 
ACTGCtACTT 
ACTGCtACTT 
ACTGCcACTT 



CAATGAAAAT 
CAATGAAAAT 
CAATGAAAAT 
CAATGAAAAT 
CAATGAAAAT 
CAATGAAAAT 
CAATGAAAAT 
CAATGAAAAT 
CAATGAAAAT 
CAATGAAAAT 
CAATGAAAAT 



*****_**** ********** 



AGAAACACCA 
AGAAACACCA 
AGAAACACCA 
AGAAACACCA 
AGAAACACCA 
AGAAACACCA 
AGAAACACCA 
AGAAACACCA 
AGAAACACCA 
AGAAACACCA 
AGAAACACCA 
********** 



GCAACAAATG 
GCAACAAATG 
GCAACAAATG 
GCAACAAATG 
GCAACAAATG 
GCAACAAATG 
GCAACAAATG 
GCAACAAATG 
GCAACAAATG 
GCAACAAATG 
GCAACAAATG 



350 

CTGCTGGTCA 
CTGCTGGTCA 
CTGCTGGTCA 
CTGCTGGTCA 
CTGCTGGTCA 
CTGCTGGTCA 
CTGCTGGTCA 
CTGCTGGTCA 
CTGCTGGTCA 
CTGCTGGTCA 
CTGCTGGTCA 



********** ********** 
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msal67919. 

msal67919. 

msal67919. 
msal67919.2{ 

msal67919. 
msal67919.2{322 
msal67919 
msal67919.2{ 

msal67919. 

msal67919. 
msal67919 .2{ 



2{322_COHl] 
2{322_M78l! 
2{322_M732; 
322_18RS21 ! 
2{322_2603' 
JM9130013^ 
2 {322^090} 
322_CJB110) 
2{322_A909} 
2{322_H36B} 
322__1169NT} 
Consensus 



351 

AACAaCAGCT 
AACAaCAGCT 
AACAaCAGCT 
AACAaCAGCT 
AACAaCAGCT 
AACAaCAGCT 
AACAC CAGCT 
AACAc CAGCT 
AACAaCAGCT 
AACAaCAGCT 
AACAaCAGCT 
****_***** 



ACTGTcGATT 
ACTGTcGATT 
ACTGTcGATT 
ACTGTgGATT 
ACTGTgGATT 
ACTGTgGATT 
ACTGTgGATT 
ACTGTgGATT 
ACTGTcGATT 
ACTGTcGATT 
ACTGTgGATT 
*****_**** 



TGAAAACCAA 
TGAAAACCAA 
TGAAAACCAA 
TGAAAACCAA 
TGAAAACCAA 
TGAAAACCAA 
TGAAAACCAA 
TGAAAACCAA 
TGAAAACCAA 
TGAAAACCAA 
TGAAAACCAA 
********** 



TCAAGTTT tT 
TCAAGTTTtT 
TCAAGTTT tT 
TCAAGTTTcT 
TCAAGTTTcT 
TCAAGTTTcT 
TCAAGTTTcT 
TCAAGTTTcT 
TCAAGTTTcT 
TCAAGTTTcT 
TCAAGTTTCT 
********_* 



400 

GTTGCAGACC 
GTTGCAGACC 
GTTGCAGACC 
GTTGCAGACC 
GTTGCAGACC 
GTTGCAGACC 
GTTGCAGACC 
GTTGCAGACC 
GTTGCAGACC 
GTTGCAGACC 
GTTGCAGACC 
********** 



msal67919. 

msal67919. 

msal67919. 
msal67919.2{ 

msal67919. 
msalS7919.2{322 
msal67919 
msal67919.2{ 

msal67919 

msal67919 
msal67919.2{ 



2f 32 2_COHl) 
2{322__M781} 
2{322_M732' 
322_18RS2i; 
2{32 2_2603^ 
_JM9130013' 
.2{322_090] 
322_CJB110} 
2f322_A909} 
2{322_H36B} 
322_1169NT} 
Consensus 



401 

AAAAAGTTTC 
AAAAAGTTTC 
AAAAAGTTTC 
AAAAAGTTTC 
AAAAAGTTTC 
AAAAAGTTTC 
AAAAAGTTTC 
AAAAAGTTTC 
AAAAAGTTTC 
AAAAAGTTTC 
AAAAAGTTTC 
********** 



TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
********** 



ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
********** 



GTATGACACC 
GTATGACACC 
GTATGACACC 
GTATGACACC 
GTATGACACC 
GTATGACACC 
GTATGACACC 
GTATGACACC 
GTATGACACC 
GTATGACACC 
GTATGACACC 
********** 



450 

AGAAGCAGCA 
AGAAGCAGCA 
AGAAGCAGCA 
AGAAGCAGCA 
AGAAGCAGCA 
AGAAGCAGCA 
AGAAGCAGCA 
AGAAGCAGCA 
AGAAGCAGCA 
AGAAGCAGCA 
AGAAGCAGCA 
********** 



451 500 

msal67919 .2{322_COHl} ACAACGATTG TTTCGCCAAT GAAGACATAT TCTTCTGCGC CAGCTTTGAA 

msal67919.2{322_M78l} ACAACGATTG TTTCGCCAAT GAAGACATAT TCTTCTGCGC CAGCTTTGAA 

msal67919.2{322_M732} ACAACGATTG TTTCGCCAAT GAAGACATAT TCTTCTGCGC CAGCTTTGAA 

msal67 919 .2(32 2_18RS2l} ACAACGATTG TTTCGCCAAT GAAGACATAT TCTTCTGCGC CAGCTTTGAA 

msal67919.2{322_2603) ACAACGATTG TTTCGCCAAT GAAGACATAT TCTTCTGCGC CAGCTTTGAA 

msal67919.2{322_JM9130013} ACAACGATTG TTTCGCCAAT GAAGACATAT TCTTCTGCGC CAGCTTTGAA 

msal67919.2{322_090} ACAACGATTG TTTCGCCAAT GAAGACATAT TCTTCTGCGC CAGCTTTGAA 

msal67919.2{322_CtTB110} ACAACGATTG TTTCGCCAAT GAAGACATAT TCTTCTGCGC CAGCTTTGAA 

msa!67919.2{322_A909} ACAACGATTG TTTCGCCAAT GAAGACATAT TCTTCTGCGC CAGCTTTGAA 

msal67919 .2{322_H36Bj ACAACGATTG TTTCGCCAAT GAAGACATAT TCTTCTGCGC CAGCTTTGAA 

msal67919 .2{322_1169NT} ACAACGATTG TTTCGCCAAT GAAGACATAT TCTTCTGCGC CAGCTTTGAA 

Consensus ********** ********** ********** ********** ********** 



501 

msal67919.2{322_COHl} ATCAAAAGAA GTATTAGCAC 

msal67919 .2(322_M78l} ATCAAAAGAA GTATTAGCAC 

msal67919 .2{322_M732} ATCAAAAGAA GTATTAGCAC 

msal67919.2{322_18RS2lJ ATCAAAAGAA GTATTAGCAC 

msal67919.2{322_2603) ATCAAAAGAA GTATTAGCAC 

msal67919.2{322_JM9130013} ATCAAAAGAA GTATTAGCAC 

msal67919.2{322_090} ATCAAAAGAA GTATTAGCAC 

msal67919.2{322__CJB110) ATCAAAAGAA GTATTAGCAC 

msal67919.2{322_A909} ATCAAAAGAA GTATTAGCAC 

msal67919.2{322_H36B} ATCAAAAGAA GTATTAGCAC 

msal67919.2{322_1169NT} ATCAAAAGAA GTATTAGCAC 

Consensus ********** ********** 



AAGaGCAAGC 
AAGaGCAAGC 
AAGaGCAAGC 
AAGaGCAAGC 
AAGaGCAAGC 
AAGaGCAAGC 
AAGaGCAAGC 
AAGaGCAAGC 
AAGgGCAAGC 
AAGgGCAAGC 
AAGaGCAAGC 



TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 



550 

GtAGCAGCTA 
GtAGCAGCTA 
GtAGCAGCTA 
GcAGCAGCTA 
GcAGCAGCTA 
GcAGCAGCTA 
GCAGCAGCTA 
GcAGCAGCTA 
GcAGCAGCTA 
GcAGCAGCTA 
GcAGCAGCTA 



***_****** ********** *w******** 



551 600 

msal67919.2{322_COHl) ATGAACAGGT ATCAcCAGCT CCTGTGAAGT CGATTACTTC AGAAGTTCCA 

meal67919.2{322_M781} ATGAACAGGT ATCAcCAGCT CCTGTGAAGT CGATTACTTC AGAAGTTCCA 

msal67919.2{322_M732} ATGAACAGGT ATCAcCAGCT CCTGTGAAGT CGATTACTTC AGAAGTTCCA 

msal67919 .2{322_18RS2l) ATGAACAGGT ATCAcCAGCT CCTGTGAAGT CGATTACTTC AGAAGTTCCA 

msal67919 .2{322_2603 } ATGAACAGGT ATCAcCAGCT CCTGTGAAGT CGATTACTTC AGAAGTTCCA 

tnsal67919.2{322_JM9130013) ATGAACAGGT ATCAcCAGCT CCTGTGAAGT CGATTACTTC AGAAGTTCCA 

msal67919.2{322_090} ATGAACAGGT ATCAaCAGCT CCTGTGAAGT CGATTACTTC AGAAGTTCCA 

msal67919 .2{322_CJB110} ATGAACAGGT ATCAaCAGCT CCTGTGAAGT CGATTACTTC AGAAGTTCCA 

msal67919 .2{322_A909} ATGAACAGGT ATCAcCAGCT CCTGTGAAGT CGATTACTTC AGAAGTTCCA 

msal67919 .2{322_H36B} ATGAACAGGT ATCAcCAGCT CCTGTGAAGT CGATTACTTC AGAAGTTCCA 

msal67919.2{322_1169NT) ATGAACAGGT ATCAcCAGCT CCTGTGAAGT CGATTACTTC AGAAGTTCCA 

Consensus ********** ****-***** ********** ********** ********** 



msa!67919 . 2 { 322_COHl } 
msal67919.2{322__M78l} 
msal67919 . 2 {322_M732 j 
msal67919 . 2 { 32 2_18RS21 
msal67919 . 2 { 322_2603 
msal67919 . 2 {322_JM9130013 ' 
msal67919. 2 { 322_090 
msal67919 - 2 { 322_CJB110 1 
msal67919.2{322_A909} 
msal67919 . 2 { 322_H36B} 
Tnsal67919.2{322_1169NT} 
Consensus 



601 

GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 



AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAg 



********** *********_ 



ACCAACTCAG 
ACCAACTCAG 
ACCAACTCAG 
ACCAACTCAG 
ACCAACTCAG 
ACCAACTCAG 
ACCAACTCAG 
ACCAACTCAG 
ACCAACTCAG 
ACCAACTCAG 
ACCAACTCAG 
********** 



ACGTCAGTCA 
ACGTCAGTCA 
ACGTCAGTCA 
ACGTCAGTCA 
ACGTCAGTCA 
ACGTCAGTCA 
ACGTCAGTCA 
ACGTCAGTCA 
ACGTCAGTCA 
ACGTCAGTCA 
ACGTCAGTCA 
********** 



650 

GTCAGTtAAC 
GTCAGTtAAC 
GTCAGTtAAC 
GTCAGTcAAC 
GTCAGTcAAC 
GTCAGTCAAC 
GTCAGTcAAC 
GTCAGTcAAC 
GTCAGTcAAC 
GTCAGTcAAC 
GTCAGTcAAC 
******_*** 



992 
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Table 69: Comparative Sequences relating to SAG0032 



msal67919. 

msal67919. 

msal67919. 
msal67919.2{ 

msal67919 
msal67919.2{322 
msa!67919 
msal67919.2{ 

msal67919. 

msa!67919 . 
msa!67919.2{ 



2{322_COHl} 
2{322_M78l} 
2{322_M732) 
322_18RS2l} 
2{322_2603} 
JM9130013} 
2{322_090} 
322_CJB110} 
2{322_A909} 
2{322__H36B} 
322_1169NT} 
Consensus 



651 

AACAGTATCA 
AACAGTATCA 
AACAGTATCA 
AACAGTATCA 
AACAGTATCA 
AACAGTATCA 
AACAGTATCA 
AACAGTATCA 
AACAGTATCA 
AACAGTATCA 
AACAGTATCA 
********** 



CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
********** 



TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 



AACACCAGCT 
AACACCAGCT 
AACACCAGCT 
AACACCAGCT 
AACACCAGCT 
AACACCAGCT 
AACACCAGCT 
AACACCAGCT 
AACACCAGCT 
AACACCAGCT 
AACACCAGCT 



********** ********** 



700 

CCAGTAGCTA 
CCAGTAGCTA 
CCAGTAGCTA 
CCAGTAGCTA 
CCAGTAGCTA 
CCAGTAGCTA 
CCAGTAGCTA 
CCAGTAGCTA 
CCAGTAGCTA 
CCAGTAGCTA 
CCAGTAGCTA 
********** 



701 

msa!67919 . 2 { 322__COHl } AAGTAGCACC GGTAAGAACT 

msal67919 . 2 {322__M781 } AAGTAGCACC GGTAAGAACT 

msalS7919. 2{322__M732} AAGTAGCACC GGTAAGAACT 

msal67919.2{322_18RS2l) AAGTAGCACC GGTAAGAACT 

msal67919.2{322_2603) AAGTAGCACC GGTAAGAACT 

msal67919.2{322_JM9130013} AAGTAGCACC GGTAAGAACT 

msa!67919 .2 {322_090} AAGTAGCACC GGTAAGAACT 

msa!67919 .2{322_CJB110} AAGTAGCACC GGTAAGAACT 

msal67919.2(322_A909} AAGTAGCACC GGTAAGAACT 

msal67919.2{322_H36B} AAGTAGCACC GGTAAGAACT 

msal67919 . 2 { 322_1169NT} AAGTAGCACC GGTAAGAACT 

Consensus ********** ********** 



GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAGcagccc CAGCCCCTAG 

**** ********** 



751 

msal67919.2{322_COHl} GcTAAAGTAG TCACTCCTAA AGTAGAAACT 

msal67919.2f 322_M78l} GcTAAAGTAG TCACTCCTAA AGTAGAAACT 

msal67919.2{322_M732} GcTAAAGTAG TCACTCCTAA AGTAGAAACT 

msal 67 9 1 9 . 2 { 3 2 2_1 8 RS2 1 } GtTAAAGTAG TCACTCCTAA AGTAGAAACT 

msal6791 9. 2 {322^2603} GtTAAAGTAG TCACTCCTAA AGTAGAAACT 

msa!67919.2{322_JM913 0013j GtTAAAGTAG TCACTCCTAA AGTAGAAACT 

msal67919.2{322_090) GtTAAAGTAG TCACTCCTAA AGTAGAAACT 

msal67919.2{322_CJB110} GtTAAAGTAG TCACTCCTAA AGTAGAAACT 

msal67919.2f 322_A909} GtTAAAGTAG TCACTCCTAA AGTAGAAACT 

msal67919.2(322_H36B} GtTAAAGTAG TCACTCCTAA AGTAGAAACT 

msal67919 .2{322_1169NT} GcTAAAGTAG TCACTCCTAA AGTAGAAACT 

Consensus *-******** ********** ********** 



msal67919. 

msal67919. 

msal67919. 
msal67919.2{ 

msal67919. 
msal67919.2{322 
msal67919 
msal67919.2{ 

msa!67919 

msal67919. 
msal67919.2{ 



2{322_COHl} 
2{322_M781} 
2{322_M732} 
322_18RS2l} 
2{322_2603} 
_JM9130013} 
.2{322_090j 
322_CJB110) 
2{322_A909} 
2{322_H36B} 
322_1169NT} 
Consensus 



801 

At CAGCTCCA 
At CAGCTCCA 
At CAGCTCCA 
At CAGCTCCA 
At CAGCTCCA 
At CAGCTCCA 
At CAGCTCCA 
At CAGCTCCA 
At CAGCTCCA 
At CAGCTCCA 
AcCAGCTCCA 
*_******** 



GCAGTTCCTG 
GCAGTTCCTG 
GCAGTTCCTG 
GCAGTTCCTG 
GCAGTTCCTG 
GCAGTTCCTG 
GCAGTTCCTG 
GCAGTTCCTG 
GCAGTTCCTG 
GCAGTTCCTG 
GCAGTTCCTG 
********** 



TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
********** 



GGTGCATCAC 
GGTG CATCAC 
GGTGCATCAC 
GGTGCATCAC 
GGTGCATCAC 
GGTGCATCAC 
GGTGCATCAC 
GGTGCATCAC 
GGTGCATCAC 
GGTGCATCAC 
GGTGCATCAC 
********** 



TTCAc CAGCT 
TTCAcCAGCT 
TTCAcCAGCT 
TTCAcCAGCT 
TTCAcCAGCT 
TTCAcCAGCT 
TTCAaCAGCT 
TTCAaCAGCT 
TTCAaCAGCT 
TTCAaCAGCT 
TTCAaCAGCT 
****_***** 



750 

AGTGGCAAGT 
AGTGGCAAGT 
AGTGGCAAGT 
AGTGGCAAGT 
AGTGGCAAGT 
AGTGGCAAGT 
AGTGGCAAGT 
AGTGGCAAGT 
AGTGGCAAGT 
AGTGGCAAGT 
AGTGGCAAGT 
********** 

800 

CAGAG CATGT 
CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 
********** 

850 

ACAGACAgTA 
ACAGACAgTA 
ACAGACAgTA 
ACAGACAgTA 
ACAGACAgTA 
ACAGACAgTA 
ACAGACAgTA 
ACAGACAgTA 
ACAGACAgTA 
ACAGACAgTA 
ACAGACAaTA 
*******_** 



851 

msal67919.2(322_COHl} AGTTACAAGC GACTGAAGTT 

msal67919.2{322_M78l} AGTTACAAGC GACTGAAGTT 

msal67919.2{322_M732} AGTTACAAGC GACTGAAGTT 

msal67919.2{322_18RS2l} AGTTACAAGC GACTGAAGTT 

msal67919.2{322_2603 \ AGTTACAAGC GACTGAAGTT 

msal67919.2{322_JM9130013} AGTTACAAGC GACTGAAGTT 

msal67919.2{322_090} AGTTACAAGC GACTGAAGTT 

msal67919 .2{322_CJB110} AGTTACAAGC GACTGAAGTT 

msal67919.2{322_A909) AGTTACAAGC GACTGAAGTT 

msal67919.2{322__H36B| AGTTACAAGC GACTGAAGTT 

msal67919 .2{322_1169NT} AGTTACAAGC GACTGAAGTT 

Consensus ********** ********** 



AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 



CGGTaGCACA 
CGGTaGCACA 
CGGTaGCACA 
CGGTaGCACA 
CGGTaGCACA 
CGGTaGCACA 
CGGTaGCACA 
CGGTaGCACA 
CGGTaGCACA 
CGGTaGCACA 
CGGTgGCACA 



********** ****_***** 



900 
AAAAGCTCCA . 
AAAAGCTCCA 
AAAAGCTCCA 
AAAAGCTCCA 
AAAAGCTCCA 
AAAAGCTCCA 
AAAAGCTCCA 
AAAAGCTCCA 
AAAAGCTCCA 
AAAAGCTCCA 
AAAAGCTCCA 
********** 



901 950 

msal67919.2{322_C0Hl} ACAGCAACAC CGGTAGCACA ACCAGCTTCA ACAACAAATG CAGTAGCTGC 

msal67919.2{322_M78l) ACAGCAACAC CGGTAGCACA ACCAGCTTCA ACAACAAATG CAGTAGCTGC 

nisal67919.2{322_M732} ACAGCAACAC CGGTAGCACA ACCAGCTTCA ACAACAAATG CAGTAGCTGC 

msa!67919.2{322_18RS2l} ACAGCAACAC CGGTAGCACA ACCAGCTTCA ACAACAAATG CAGTAGCTGC 

msal67919. 2 {322_2603} ACAGCAACAC CGGTAGCACA ACCAGCTTCA ACAACAAATG CAGTAGCTGC 

msal 67919.2(322 JM9130013) ACAGCAACAC CGGTAGCACA ACCAGCTTCA ACAACAAATG CAGTAGCTGC 

msal6791972{322_090} ACAGCAACAC CGGTAGCACA ACCAGCTTCA ACAACAAATG CAGTAGCTGC 

msal67919.2{322_CJB110} ACAGCAACAC CGGTAGCACA ACCAGCTTCA ACAACAAATG CAGTAGCTGC 

msal67919.2{322_A909} ACAGCAACAC CGGTAGCACA ACCAGCTTCA ACAACAAATG CAGTAGCTGC 

msal67919.2{322_H36B} ACAGCAACAC CGGTAGCACA ACCAGCTTCA ACAACAAATG CAGTAGCTGC 

msal67919.2{322_1169NT) ACAGCAACAC CGGTAGCACA ACCAGCTTCA ACAACAAATG CAGTAGCTGC 



993 



WO 2004/018646 
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Table 69: Comparative Sequences relating to SAG0032 



Consensus 



********** ********** ********** ********** ********** 



msal67919.2{322_COHl} 
msal67919 . 2 (322_M78l) 
msal67919 . 2 { 322_M732 } 
msal67919.2{322_18RS21~ 
msal67919.2{322_2603 
msal67919.2{322_JM9130013 
msal67919 . 2 {322_090 
msal67919.2{322_CJB110} 
msal67919 .2{322_A909} 
msal67919.2{322_H36B} 
rasal67919.2{322_1169NT} 
Consensus 



951 

ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
********** 



AATGCAgGgC 
AATGCAgGgC 
AATGCAgGgC 
AATGCAgGgC 
AATGCAgGgC 
AATGCAgGgC 
AATGCAgGgC 
AATGCAgGgC 
AATGCAaGgC 
AATGCAaGgC 
AATGCAgGaC 
******_*_* 



TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
********** 



TGTTGCAGCT 
TGTTGCAGCT 
TGTTGCAGCT 
TGTTGCAGCT 
TGTTGCAGCT 
TGTTGCAGCT 
TGTTGCAGCT 
TGTTGCAGCT 
TGTTGCAGCT 
TGTTGCAGCT 
TGTTGCAGCT 
********** 



1000 
TATAAAGAAA 
TATAAAGAAA 
TATAAAGAAA 
TATAAAGAAA 
TATAAAGAAA 
TATAAAGAAA 
TATAAAGAAA 
TATAAAGAAA 
TATAAAGAAA 
TATAAAGAAA 
TATAAAGAAA 
********** 



1001 

msal67919.2{32 2_COHl } AAGTAGCGTC 

msal67919.2{322_M78l} AAGTAGCGTC 

msal67919.2{322_M732) AAGTAGCGTC 

msal67 9 19 . 2 { 3 2 2_18RS2 1 } AAGTAGCGTC 

msal67919.2{322_2603} AAGTAGCGTC 

msal67919.2{322_JM9130013} AAGTAGCGTC 

msal67919.2{322_090} AAGTAGCGTC 

ms al 67 9 19 . 2 { 3 2 2_C JB 110} AAGTAGCGTC 

msal67919.2{322_A909} AAGTAGCGTC 

msal 6 7 9 1 9 . 2 { 3 2 2_H3 6B } AAGTAGCGTC 

msal67919.2{322_1169NT} AAGTAGCGTC 

Consensus ********** 



AACTTATGGA 
AACTTATGGA 
AACTTATGGA 
AACTTATGGA 
AACTTATGGA 
AACTTATGGA 
AACTTATGGA 
AACTTATGGA 
AACTTATGGA 
AACTTATGGA 
AACTTATGGA 



GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 



TCAGTACATA 
TCAGTACATA 
TCAGTACATA 
TCAGTACATA 
TCAGTACATA 
TCAGTACATA 
TCAGTACATA 
TCAGTACATA 
TCAGTACATA 
TCAGTACATA 
TCAGTACATA 



1050 
CCGTGCgGGa 
CCGTGCgGGa 
CCGTGCgGGa 
CCGTGCgGGa 
CCGTGCgGGa 
CCGTGCgGGa 
CCGTGCaGGt 
CCGTGCaGGt 
CCGTGCgGGa 
CCGTGCgGGa 
CCGTGCgGGa 



********** ********** ********** ******_*★_ 



1051 

msal67919.2(322_COHl} GATCCAGGTG ATCATGGTAA 

msal67919.2{322_M78l} GATCCAGGTG ATCATGGTAA 

msal67919.2{322_M732} GATCCAGGTG ATCATGGTAA 

msal67919.2{322_18RS2l} GATCCAGGTG ATCATGGTAA 

msal6791 9. 2 {322^2603} GATCCAGGTG ATCATGGTAA 

msal67919.2{322_JM9130013} GATCCAGGTG ATCATGGTAA 

msal67919.2{322_090} GATCCAGGTG ATCATGGTAA 

msal67919.2{322_CJB110j GATCCAGGTG ATCATGGTAA 

msal67919.2{322_A909) GATCCAGGTG ATCATGGTAA 

msal67919.2{322_H36B} GATCCAGGTG ATCATGGTAA 

msal67 919.2{322_1169NT} GATCCAGGTG ATCATGGTAA 

Consensus ********** ********** 



AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 



GTtGACTTTA 
GTtGACTTTA 
GTtGACTTTA 
GTtGACTTTA 
GTtGACTTTA 
GTtGACTTTA 
GTcGACTTTA 
GTcGACTTTA 
GTtGACTTTA 
GTtGACTTTA 
GTtGACTTTA 



********** **„******* 



1100 
TTGTAGGTAa 
TTGTAGGTAa 
TTGTAGGTAa 
TTGTAGGTAc 
TTGTAGGTAc 
TTGTAGGTAc 
TTGTAGGTAa 
TTGTAGGTAa 
TTGTAGGTAa 
TTGTAGGTAa 
TTGTAGGTAa 
*********_ 



1101 

msal67919.2{322_COHl} aAAcCAAGCA CTTGGTAATg AAGTTGCACA 

msal67919.2{322_M78l) aAAcCAAGCA CTTGGTAATg AAGTTGCACA 

msal67919 . 2 (322_M732 } aAAcCAAGCA CTTGGTAATg AAGTTGCACA 

msal67919.2{322_18RS2l} t AAt CAAGCA CTTGGTAATa AAGTTGCACA 

msal67919.2{322_2603} t AAt CAAGCA CTTGGTAATa AAGTTGCACA 

msal67919 . 2 {322_JM9 130013} t AAt CAAGCA CTTGGTAATa AAGTTGCACA 

msal67919.2{322_090) aAAcCAAGCA CTTGGTAATg AAGTTGCACA 

msal67919.2{322_CJB110} aAAcCAAGCA CTTGGTAATg AAGTTGCACA 

msal67919.2{322_A909} aAAcCAAGCA CTTGGTAATg AAGTTGCACA 

msal67919.2{322_H36B} aAAcCAAGCA CTTGGTAATg AAGTTGCACA 

msal67919.2{322_1169NT} aAAcCAAGCA CTTGGTAATg AAGTTGCACA 

Consensus -**-****** *********- ********** 



GTACTCTACA 
GTACTCTACA 
GTACTCTACA 
GTACTCTACA 
GTACTCTACA 
GTACTCTACA 
GTACTCTACA 
GTACTCTACA 
GTACTCTACA 
GTACTCTACA 
GTACTCTACA 



1150 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 



********** ********** 



1151 1200 

msa!67919 -2{322_COHl} CAGCAAATAA CATTTCATAT GTTATCTGGC AACAAAAGTT TTAtTCAAAT 

msal67919.2{322_M78l} CAGCAAATAA CATTTCATAT GTTATCTGGC AACAAAAGTT TTAtTCAAAT 

msal67919.2{322_M732} CAGCAAATAA CATTTCATAT GTTATCTGGC AACAAAAGTT TTAtTCAAAT 

msal67 919.2{322_18RS2l} CAGCAAATAA CATTTCATAT GTTATCTGGC AACAAAAGTT TTAcTCAAAT 

msal67919 .2 {322_2603 } CAGCAAATAA CATTTCATAT GTTATCTGGC AACAAAAGTT TTAcTCAAAT 

msal67919.2{322_JM9130013} CAGCAAATAA CATTTCATAT GTTATCTGGC AACAAAAGTT TTAcTCAAAT 

msal67919.2{322_090} CAGCAAATAA CATTTCATAT GTTATCTGGC AACAAAAGTT TTAcTCAAAT 

msal67 919.2{322_CJB110} CAGCAAATAA CATTTCATAT GTTATCTGGC AACAAAAGTT TTAcTCAAAT 

msal67919.2{322_A909} CAGCAAATAA CATTTCATAT GTTATCTGGC AACAAAAGTT TTAcTCAAAT 

msal67919.2{322_H36B} CAGCAAATAA CATTTCATAT GTTATCTGGC AACAAAAGTT TTAcTCAAAT 

msal67 919.2{322_1169NT} CAGCAAATAA CATTTCATAT GTTATCTGGC AACAAAAGTT TTAcTCAAAT 

Consensus ********** ********** ********** ********** ***_****** 



1201 1250 

msal67919.2{322_COHl) ACAAAtAGTA TTTATGGACC TGCTAATACT TGGAATGCAA TGCCAGATCG 

msal67919 .2{322_M781) ACAAAtAGTA TTTATGGACC TGCTAATACT TGGAATGCAA TGCCAGATCG 

msal67919.2{322_M732} ACAAAtAGTA TTTATGGACC TGCTAATACT TGGAATGCAA TGCCAGATCG 

msal67919.2{322_18RS2l} ACAAAcAGTA TTTATGGACC TGCTAATACT TGGAATGCAA TGCCAGATCG 

msal67919.2{322_2603} ACAAAcAGTA TTTATGGACC TGCTAATACT TGGAATGCAA TGCCAGATCG 

tnsal67919 .2{322_JM9130013) ACAAAcAGTA TTTATGGACC TGCTAATACT TGGAATGCAA TGCCAGATCG 

msal67919 .2 {322_090} ACAAAtAGTA TTTATGGACC TGCTAATACT TGGAATGCAA TGCCAGATCG 

msal67919.2{322_CJB110) ACAAAtAGTA TTTATGGACC TGCTAATACT TGGAATGCAA TGCCAGATCG 

msal67919.2{322_A909) ACAAAtAGTA TTTATGGACC TGCTAATACT TGGAATGCAA TGCCAGATCG 

msal67919 .2{322_H36B} ACAAAtAGTA TTTATGGACC TGCTAATACT TGGAATGCAA TGCCAGATCG 
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msal67919.2{322_JL169NT} ACAAAtAGTA TTTATGGACC TGCTAATACT TGGAATGCAA TGCCAGATCG 
Consensus *****-**** ********** ********** ********** ********** 



msal67919 . 

msal67919. 

msal67919. 
msal67919.2{ 

msal67919. 
msal67919.2{322 
msal67919 
msal67919 .2{ 

msal67919. 

msal67919. 
msal67919.2{ 



2{322_COHl} 
2{322_M78l} 
2{322_M732} 
322_18RS2l} 
2{322_2603} 
JM9130013} 
2{322_090} 
322_CJB110} 
2{322_A909} 
2{322_H36B} 
322_1169NT} 
Consensus 



1251 

TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 



ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 



AcTATGACCA 
ACTATGACCA 
AcTATGACCA 
AcTATGACCA 
AcTATGACCA 
AcTATGACCA 
AtTATGACCA 
AtTATGACCA 
AcTATGACCA 
AcTATGACCA 
AcTATGACCA 



CGTTCACGTA 
CGTTCACGTA 
CGTTCACGTA 
CGTTCACGTA 
CGTTCACGTA 
CGTTCACGTA 
tGTTCACGTA 
t GTTCACGTA 
CGTTCACGTA 
CGTTCACGTA 
CGTTCACGTA 



1300 
TCATTTAACA 
TCATTTAACA 
TCATTTAACA 
TCATTTAACA 
TCATTTAACA 
TCATTTAACA 
TCATTTAACA 
TCATTTAACA 
TCATTTAACA 
TCATTTAACA 
TCATTTAACA 



********** ********** *_******** 



.********* ********** 



msal67919. 

msal67919. 

msal67919. 
msal67919.2{ 

msal6791'9. 
msal67919.2{322 
msal67919 
msal67919.2{ 

msal67919 

msal67919 
msal67919.2{ 



2{322_COHl} 
2{322_M781} 
2{322_M732} 
322_18RS2l} 
2{322_2603} 
JM9130013} 
2{322_090} 
322_CJB110} 
2{322_A909} 
2{322_H36B) 
322_1169NT} 
Consensus 



1301 

AATAATATAA 
AATAATATAA 
AATAATATAA 
AATAATATAA 
AATAATATAA 
AATAATATAA 
AATAATATAA 
AATAATATAA 
AATAATATAA 
AATAATATAA 
AATAATATAA 



AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 



TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 



CTTTTTTATA 
CTTTTTTATA 
CTTTTTTATA 
CTTTTTTATA 
CTTTTTTATA 
CTTTTTTATA 
CTTTTTTATA 
CTTTTTTATA 
CTTTTTTATA 
CTTTTTTATA 
CTTTTTTATA. 



1350 
TGCCTTGaAT 
TGCCTTGaAT 
TGCCTTGaAT 
TGCCTTGaAT 
TGCCTTGaAT 
TGCCTTGaAT 
TGCCTTGaAT 
TGCCTTGaAT 
TGCCTTGcAT 
TGCCTTGcAT 
TGCCTTGaAT 



********** ********** ********** ********** *******_** 



1351 1382 

msal67919.2{322_COHl} AGACTTTCAA GGTTCTTATA TAATTTTTAT TA 

tnsal67919.2{322_M78l} AGACTTTCAA GGTTCTTATA TAATTTTTAT TA 

msal 67919.2(32 2_M7 32} AGACTTTCAA GGTTCTTATA TAATTTTTAT TA 

msal67919 .2 {322_18RS2ll AGACTTTCAA GGTTCTTATA TAATTTTTAT TA 

msal67919.2{322_J2603) AGACTTTCAA GGTTCTTATA TAATTTTTAT TA 

msa!67919»2{322_JM9130013} AGACTTTCAA GGTTCTTATA TAATTTTTAT TA 

msal67919.2{322_090} AGACTTTCAA GGTTCTTATA TAATTTTTAT TA 

msal67919.2{32 2_C JB1 1 0 } AGACTTTCAA GGTTCTTATA TAATTTTTAT TA 

msal67919 . 2{322_A909} AGACTTTCAA GGTTCTTATA TAATTTTTAT TA 

msal67919.2{32 2_H3 6B } AGACTTTCAA GGTTCTTATA TAATTTTTAT TA 

msal67919.2{322_1169NT} AGACTTTCAA GGTTCTTATA TAATTTTTAT TA 

Consensus ********-** ********** ********** ** 



SEQ ID NO. 6912 
STRAIN 2603 frame: 1 

MNKKVTiLTSTMAAS LLS VAS VQAQETDTTWTARTVSETVKADLVKQDNKS S YTVKYGDTL S 
VI SEAMS IDMNVLAKINNI ADI NL I YPETTLTVT YDQKSHTATSMKI ETPATNAAGQTTA 
TVDLKTNQVSVADQKVSLNT I SEGMTPEAATTI VSPMKTYSSAPALKSKEVLAQEQAVSQ 
AAANEQVSPAP VKS I TSEVPAAKEE VKPTQTS VS QSTTVS PAS VAAETPAPVAKVAPVRT 
VAAPRYASVKWTPKVETGASPEHVSAPAVPVTTTSPATDSKLQATEVKBVPVAQKA 
TPVAQPASTTNAVAAHPENAGLQPHVAAYKEKVASTYGVNEFSTYRAGDPGDHGKGLxAVD 
FI VGTNQALGNKVAQYSTQNMAANNI SYVI WQQKFYSNTNS I YGPANTWNAMPDRGGVTA 
NHYDHVHVSFNK. YKKGSYLASFLYALNRLSRFLYNFY 

SEQ XD NO. 6913 

STRAIN 090 frame: 2 

ETTLTVT YDQKSHTATSMKI ETPATNAAGQTPATVDLKTNQVSVADQKVSLNTI SEGMTP 
EAATT I VSPMKT YS S APALKSKEVLAQEQAVSQAAANEQVSTAP VKS I TSEVPAAKEEVK 
PTC/TS VSQSTTVS PASVAAETPAPVAKVAP VRTVAAPRVAS VKWTPKVETGAS PEHVSA 
PAVPVTTTSTATDSKLQATEVKSVPVAQKAPTATPVAQPASTTNAVAAHPENAGLQPHVA 
AYKEKVAST YGVNE FSTYRAGDPGDHGKGLAVDF I VGKNQALGNE VAQYSTQNMAANN I S 
YVI WQQKFYSNTNS I YGPANTWNAMPDRGGVTANHYDHVHVSFNK . YKKGS YLAS FL YAL 
NRLSRFLYNFY 

SEQ ZD NO. 6914 

STRAIN A909 frame: 3 

DLVKQDNKS S YTVKYGDTL S VI SEAMS IDMNVIiAKINNIADINLI YPETTLTVTYDQKSH 
TATSMKI ETPATNAAGQTTATVDLKTNQVS VADQKVS LNT I SEGMTPEAATT I VS PMKT Y 
SSAPALKSKEVLAQGQAVSQAAANEQVS PAPVKS I TSEVPAAKEEVKPTQTSVSQSTTVS 
PASVAAETPAPVAKVAPVRTVAAPRVASVKVVTPKVETGASPEHVSAPAVPVTTTSTATD 
S KLQATE VKS VP VAQKA PTAT P VAQ P A STTNAVAAH P ENARLQPHVAAY KE KVAS TYGVN 
E FSTYRAGDPGDHGKGLAVDF I VGKNQALGNE VAQYSTQNMAANNI S YVI WQQKFYSNTN 
SI YGPANTWNAMPDRGGVTANHYDHVHVSFNK . YKKGS YLAS FLYALHRLSRFLYNFY 

SEQ ID NO. 6915 
STRAIN H36B frame: 3 

DLVKQDNKS SYTVKYGDTLS VI SEAMS IDMNVLAKINNI ADINL I YPETTLTVTYDQKSH 
TATSMKI ETPATNAAGQTTATVDLKTNQVS VADQKVS LNT I SEGMTPEAATTI VSPMKT Y 
S SAPALKS KEVLAQGQAVS QAAANEQVSPAPVKS I TSEVPAAKEEVKPTQTSVSQSTTVS 
PAS VAAE TPAP VAKVAP VRT VAAP R VAS VKWT P KVETGA S P EHVS AP AVP VTTT S TATD 
S KLQATE VKS VP VAQ KAPTAT P VAQ PASTTNAVAAHPENARL QPHVAAYKE KVAS TYGVN 



995 



WO 2004/018646 



PCT/US2003/026827 



Table 69: Comparative Sequences relating to SAG0032 



E FST YRAGD PGDHGKGLAVD F I VGKNQALGNE VAQYSTQNMAANN I S YV I WQQKFYSNTN 
S I YGPANTWNAM PDRGG VTANH YDHVHVS FNK . YKKGSYLASFLYALHRLSRFLYNFY 

SEQ ID NO. 6916 
STRAIN 18RS21 frame: 3 

DLVKQDNKSSYTVKYGDTLSVI SEAMS IDMNVLAKINNIADINLI YPETTLTVTYDQKSH 
TATSMKI ETPATNAAGQTTATVDLKTNQVS VADQKVSLNT I S EGMT PEAATT I VS PMKT Y 
SSAPALKSKEVIJVQEQAVSQAAANEQVSPAPVKSITSEVPAAKEEVKPTO/rSVSQSTTVS 
PASVAAETPAPVAKVAPVRTVAAPRVASVKVVTPKVETGASPEHVSAPAVPVTTTSPATD 
SKLQATEVKSVPVAQKAPTATPVAQPASTTNAVAAHPENAGLQPHVAAYKEKVASTYGVN 
EF STYRAGD PGDHGKGLAVDF I VGTNQALGNKVAQYSTQNMAANN I S YV I WQQKF YSNTN 
S I YGPANTWNAMPDRGGVTANH YDHVHVS FNK . YKKGSYLAS FLYALNRLSRFLYNF Y 

SEQ ID NO. 6917 
STRAIN M732 frame: 3 

DLVKQDNKS SYTVKYGDTXS VI SEAMS IDMNVLAKINNIADINLI YPETTLTVTYDQKSH 
TATSMKI ETPATNAAGQTTATVDLKTNQVFVADQKVSLNTISEGMTPEAATTIVSPMKT 
S SAPALKS KE VLAQEQAVS Q VAANEQVSPAPVKS I TSEVPAAKEE VKPTQTS VSQLTTVS 
PAS VAAETPAP VAKVAP VRTVAAPRVA S AKWTP KVETGAS P E HVS APAVP VTTT S PATD 
S KLQ ATE VKS VP VAQ KAPTAS P VAQ PA S TTNAVAAHP ENAGL Q PHVAAYKE KVAS T YGVN 
EFSTYRAGD PGDHGKGLAVDF I VGKNQALGNEVAQYSTQNMAANNI SYVI WQQKFYSNTN 
SI YGPANTWNAM PDRGG VTANHYDHVHVS FNK . YKKGSYLASFLYALNRLSRFLYNFY 

SEQ ID NO. 6918 
STRAIN COH1 frame: 3 

DLVKQDNKSSYTVKYGDTLSVISEAMSIDMNVLAKINNIADINLI YPETTLTVTYDQKSH 
TATSMKI ETPATNAAGQTTATVDLKTNQVFVADQKVSLNT I SEGMTPEAATT I VS PMKTY 
S SAPALKSKE VLAQEQAV5QVAANEQVSPAP VKS I TSEVPAAKEEVKPTQT S VSQ LTTVS 
PASVAAETPAPVAKVAPVRTVAAPRVASAKVVTPKVETGASPEHVSAPAVPVTTTSPATD 
S KLQATE VKS VP VAQKAPT ATP VAQ PASTTNAVAAHPENAGLQPHVAAYKE KVAS TYGVN 
EFSTYRAGDPGDHGKGLAVDFIVGKNQALGNEVAQYSTQNMAANNI SYVI WQQKFYSNTN 
SI YGPANTWNAM PDRGG VTANHYDHVHVS FNK . YKKGSYLASFLYALNRLSRFLYNFY 

SEQ ID NO. 6919 
STRAIN M781 frame: 3 

DLVKQDNKSSYTVKYGDTLSVI SEAMS IDMNVLAKINNIADINLI YPETTLTVTYDQKSH 
TATSMKI ET PATNAAGQTT ATVDL KTNQVFVADQ KVS LNT I SEGMTPEAATT I VS PMKTY 
S SAPALKSKEVLAQEQAVSQVAANEQVS PAPVKS I TSEVPAAKEE VKPTQTS VS QLTTVS 
PAS VAAE T PAP VAKVAP VRTVAAP RVAS AKWT P KVETGAS PEHVS APAVP VTTT S PATD 
S KLQ ATE VKS VP VAQ KAPTATPVAQ PAS TTNAVAAHP ENAGLQPHVA^YKEKVASTYGVN 
EFSTYRAGDPGDHGKGLAVDFIVGKNQALGNEVAQYSTQNMAANNISYVIWQQKFYSNTN 
S I YGPANTWNAM PDRGG VTANHYDHVHVS FNK . YKKGSYLASFLYALNRLSRFLYNFY 

SEQ ID NO. 6920 
STRAIN CJBUO frame: 3 

DLVKQDNKSSYTVKYGDTLSVI SEAMS IDMNVLAKINNIADINLI YPETTLTVTYDQKSH 
TATSMKI ETPATNAAGGTPATVDLKTNQVS VADQKVS LNT I SEGMTPEAATT I VS PMKTY 
SSAPALKSKEVLAQEQAVSQAAANEQVSTAPVKS I TSEVPAAKEE VKPTQTS VSQSTTVS 
PAS VAAE TPAP VAKVAP VRTVAAP RVAS VKWT P KVETGAS PEHVS APAVP VTTTSTATD 
S KLQATEVKS VP VAQKAPTAT P VAQ PAS TTNAVAAH PENAGLQPHVAAYKEKVAS TYGVN 
EFSTYRAGD PGDHGKGLAVD F I VGKNQALGNEVAQ YSTQNMAANN I SYVI WQQKFYSNTN 
S I YGPANTWNAMPDRGG VTANHYDHVHVS FNK . YKKGSYLASFLYALNRLSRFLYNFY 

SEQ ID NO. 6921 
STRAIN 1 169NT frame: 3 

DLVKQDNKSSYTVKYGDTLSVI SEAMS IDMNVLAKINNIADINLI YPETTLTVTYDQKSH 
TATSMKI ETPATNAAGQTTATVDLKTNQVSVADQKVSLNTI SEGMTPEAATT I VS PMKTY 
S SAPALKS KEVLAQEQAVS QAAANEQVS PAPVKS ITSEVPAAKEE VRPTQTS V SQST TVS 
PASVAAETPAP VAKVAP VRTVAAP APRVA SAKVVT P KVETGA S PEHVPAPAVP VTTT S TA 
TDNKLQATE VKS VP VAQ KAPTATPVAQ PAS TTNAVAAHPENAGLQ PHVAAYKE KVAS TYG 
VNEFSTYRAGDPGDHGKGLAVDFI VGKNQALGNEVAQYSTQNMAANNI SYVI WQQKF YSN 
TNS I YGPANTWNAM PDRGG VTANHYDHVHVS FNK . YKKGSYLASFLYALNRLSRFLYNFY 

SEQ ID NO. 6922 
STRAIN JM9 1300 13 frame: 3 

DLVKQDNKSSYTVKYGDTLSVI SEAMS IDMNVLAKINNIADINLI YPETTLTVTYDQKSH 
TATSMKI ET PATNAAGQTTATVDLKTNQVS VADQKVSLNT I SEGMTPEAATT I VS PMKTY 
SSAPALKSKEVLAQEQAVSQAAANEQVSPAPVKSITSEVPAAKEEVKPTQTSVSQSTTVS 
PASVAAETPAPVAKVAPVRTVAAPRVASVKVVTPKVETGAS 

SKLQATEVKSVPVAQKAPTATPVAQPASTTNAVAAHPENAGLQPHVAAYKEKVASTYGVN 
EFSTYRAGD PGDHGKGLAVDF I VGTNQALGNKVAQYSTQNMAANN I SYVI WQQKFYSNTN 
S I YGPANTWNAM PDRGG VTANH YDHVHVS FNK . YKKGSYLASFLYALNRLSRFLYNFY 

PRETTY Of : /biotmp/msa237 049 . 2 {*} May 14, 2003 03:04 



1 50 

msa237049.2{322_COHl} dlvkqdnkss 

msa237049 -2{322_M78l} dlvkqdnkss 

msa237049.2{322_M732} dlvkqdnkss 

msa237049.2{322_A909) dlvkqdnkss 

msa237049.2{322_H36B} dlvkqdnkss 

msa237049.2{322_090} 



996 



WO 2004/018646 



PCT/US2003/026827 



Table 69: Comparative Sequences relating to SAG0032 



msa237049 .2 {322__CJB110} dlvkqdnkss 

msa237Q49 .2{322_18RS2l} dlvkqdnkss 

msa237049 .2{322_2603} mnkkvlltst maasllsvas vqaqetdttw tartvsevka dlvkqdnkss 

msa237049 . 2 { 322_JM9130 013 } • dlvkqdnkss 

msa237049 .2{322_1169NT} dlvkqdnkss 

Consensus ********** ********** ********** ********** 



msa237049 . 

msa237049 . 

msa237049 . 

rasa237049 . 

msa237049 . 
msa237049 
msa237049.2{ 
msa237049.2{ 

msa237049 . 
msa237049.2{322 
msa237049.2{ 



2f 322_COHl} 
2{322_M78l} 
2{322_M732} 
2{322_A909} 
2{322_H36B} 
.2{322_090} 
322_CJB110} 
322_18RS2l) 
2{322_2603} 
JM9130013} 
322_1169NT} 
Consensus 



51 

ytvkygdtls 
ytvkygdtls 
ytvkygdtxs 
ytvkygdtls 
ytvkygdtls 



vis earns i dm nvlakinnia 
viseamsidm nvlakinnia 
vis earns idm nvlakinnia 
viseamsidm nvlakinnia 
viseamsidm nvlakinnia 



ytvkygdtls 
ytvkygdtls 
ytvkygdtls 
ytvkygdtls 
ytvkygdtls 



viseamsidm nvlakinnia 
viseamsidm nvlakinnia 
viseamsidm nvlakinnia 
viseamsidm nvlakinnia 
viseamsidm nvlakinnia 



dinliypETT 
dinliypETT 
dinliypETT 
dinliypETT 
dinliypETT 

ETT 

dinliypETT 
dinliypETT 
dinliypETT 
dinliypETT 
dinliypETT 



100 

LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 



_*** ********** 



msa237049.2{322_COHl} 
msa237049 .2l322_M78l} 
msa237049.2{322_M732} 
msa237049.2{322_A909} 
msa237049 . 2 { 322JK36B} 
msa237049.2{322_090} 
msa237049.2{322_CJB110} 
msa237049.2{322_18RS2l} 
msa237049.2{322_2603} 
msa237049.2{322_JM9130013} 
msa237049.2{322_1169NT} 
Consensus 



101 

TATSMKI ETP 
TATSMKI ETP 
TATSMKI ETP 
TATSMKI ETP 
TATSMKI ETP 
TATSMKI ETP 
TATSMKI ETP 
TATSMKI ETP 
TATSMKI ETP 
TATSMKIETP 
TATSMKI ETP 



ATNAAGQTtA 
ATNAAGQTtA 
ATNAAGQTtA 
ATNAAGQTtA 
ATNAAGQTtA 
ATNAAGQTpA 
ATNAAGQTpA 
ATNAAGQTtA 
ATNAAGQTtA 
ATNAAGQTtA 
ATNAAGQTtA 



TVDLKTNQVf 
TVDLKTNQVf 
TVDLKTNQVf 
TVDLKTNQVs 
TVDLKTNQVs 
TVDLKTNQVs 
TVDLKTNQVs 
TVDLKTNQVs 
TVDLKTNQVs 
TVDLKTNQVs 
TVDLKTNQVs 



********** ********_* *********_ 



VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
********** 



150 

I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
********** 



msa237049.2{322_COHl} 
msa237049.2{322_M78l} 
msa237049.2{322_M732} 
msa237049 . 2 (322_A909) 
msa237049.2{322_H36B} 
msa237049.2{322_090} 
msa237049.2{322_CJB110} 
msa237049 .2{322_18RS2l} 
msa237049.2{322_2 603} 
msa237049.2{322_JM9130 013} 
msa237049.2{322_1169NT} 
Consensus 



151 

TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 



SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 



VLAQeQAVSQ 
VLAQeQAVSQ 
VLAQeQAVSQ 
VLAQgQAVSQ 
VLAQgQAVSQ 
VLAQeQAVSQ 
VLAQeQAVSQ 
VLAQeQAVSQ 
VLAQeQAVSQ 
VLAQeQAVSQ 
VLAQeQAVSQ 



********** ********** ****_***** 



vAANEQVSpA 
vAANEQVSpA 
vAANEQVSpA 
aAANEQVSpA 
aAANEQVSpA 
aAANEQVS t A 
aAANEQVS t A 
aAANEQVSpA 
aAANEQVSpA 
aAANEQVSpA 
aAANEQVSpA 
.*******„* 



200 

PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
********** 



msa237049.2{322_COHl} 
msa237049.2(322_M78l} 
msa237049.2(322_M732} 
msa237049.2{322_A909} 
msa237049.2{322_H36B} 
msa237049. 2 {322^090} 
msa237049 . 2 { 322__CJB110 } 
msa237049 . 2 {322_18RS2l} 
msa237049. 2 {322^2603} 
msa237049.2{322_JM9130013} 
msa237049 . 2 { 322_1169NT} 
Consensus 



201 

AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVrPTQ 



TSVSQ1TTVS 
TSVSQITTVS 
TSVSQ1TTVS 
TSVSQSTTVS 
TSVSQsTTVS 
TSVSQsTTVS 
TSVSQsTTVS 
TSVSQSTTVS 
TSVSQsTTVS 
TSVSQsTTVS 
TSVSQsTTVS 



******_*** *****_**** 



PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
********** 



PVAKVAPVRT 
PVAKVAPVRT 
PVAKVAPVRT 
PVAKVAPVRT 
PVAKVAPVRT 
PVAKVAPVRT 
PVAKVAPVRT 
PVAKVAPVRT 
PVAKVAPVRT 
PVAKVAPVRT 
PVAKVAPVRT 



250 

VA. .APRVAS 
VA. .APRVAS 
VA. .APRVAS 
VA. .APRVAS 
APRVAS 
APRVAS 
APRVAS 
APRVAS 
VA. .APRVAS 
VA. .APRVAS 
VAapAPRVAS 



VA. . 
VA. . 
VA. . 
VA. . 



********** **__****** 



msa237049.2(322_COHl 
msa237049.2(322_M78i; 
msa237049.2{322_M732 
msa237049.2{322_A909 
msa237049.2{322_H36B] 
msa237049.2{322_090) 
msa237049.2{322_CJB110> 
msa237049 . 2 { 322_18RS21 } 
msa237049.2{322_2603} 
msa237049 . 2 { 322_JM9130013 ) 
msa237049 . 2 {322_1169NT} 
Consensus 



251 

aKWTPKVET 
aKWTPKVET 
aKWTPKVET 
vKWTPKVET 
vKWTPKVET 
vKWTPKVET 
vKWTPKVET 
vKWTPKVET 
vKWTPKVET 
vKWTPKVET 
aKWTPKVET 



GASPEHVsAP 
GASPEHVsAP 
GASPEHVsAP 
GASPEHVsAP 
GASPEHVsAP 
GASPEHVsAP 
GASPEHVsAP 
GASPEHVSAP 
GASPEHVsAP 
GASPEHVsAP 
GASPEHVpAP 



_********* *******_** 



AVPVTTTSpA 
AVPVTTTSpA 
AVPVTTTSpA 
AVPVTTTS t A 
AVPVTTTS t A 
AVPVTTTS t A 
AVPVTTTS t A 
AVPVTTTSpA 
AVPVTTTSpA 
AVPVTTTSpA 
AVPVTTTS t A 
********_* 



TDs KLQATEV 
TDs KLQATE V 
TDs KLQATEV 
TDs KLQATEV 
TDs KLQATEV 
TDs KLQATEV 
TDs KLQATEV 
TDs KLQATEV 
TDs KLQATEV 
TDs KLQATEV 
TDnKLQATEV 



300 

KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 



**_******* ********** 



msa237049 . 2 { 322_C0H1 } 
msa237049.2{322_M78l} 
msa237049.2{322_M732} 
msa237049.2f322_A909} 
msa237049.2{322_H36B} 



301 

TAtPVAQPAS 
TAtPVAQPAS 
TAsPVAQPAS 
TAtPVAQPAS 
TAtPVAQPAS 



TTNAVAAHPE 
TTNAVAAHPE 
TTNAVAAHPE 
TTNAVAAHPE 
TTNAVAAHPE 



NAgLQPHVAA 
NAgLQPHVAA 
NAgLQPHVAA 
NArLQPHVAA 
NArLQPHVAA 



YKEKVASTYG 
YKEKVASTYG 
YKEKVASTYG 
YKEKVASTYG 
YKEKVASTYG 



350 

VNEPSTYRAG 
VNEFSTYRAG 
VNE FSTYRAG 
VNEFSTYRAG 
VNEFSTYRAG 
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Table 69: Comparative Sequences relating to SAG0032 

msa237049 .2{322_090} TAtPVAQPAS TTNAVAAHPE NAgLQ PHVAA YKEKVASTYG VNE FSTYRAG 

msa237049.2{322_CJB110} TAtPVAQPAS TTNAVAAHPE NAgLQ PHVAA YKEKVASTYG VNE FSTYRAG 

msa237049.2{322_18RS2l} TAtPVAQPAS TTNAVAAHPE NAgLQ PHVAA YKEKVASTYG VNE FSTYRAG 

msa237049.2{322_2603} TAtPVAQPAS TTNAVAAHPE NAgLQPHVAA YKEKVASTYG VNE FSTYRAG 

tnsa237049.2{322_JM9130013} TAtPVAQPAS TTNAVAAHPE NAgLQPHVAA YKEKVASTYG VNE FSTYRAG 

msa237049.2{322_1169NT} TAtPVAQPAS TTNAVAAHPE NAgLQPHVAA YKEKVASTYG VNE FSTYRAG 

Consensus **-******* ********** **_******* ********** ********** 



rasa237049 . 

rasa237049 . 

msa237049 . 

msa237049 . 

rasa237049 . 
msa237049 
msa237049.2{ 
msa237049.2{ 

msa237049 . 
msa237049.2{322 
msa237049 .2{ 



2{322JZOHl} 
2{322_M78l} 
2{322_M732} 
2{322_A909] 
2{322_H36B] 
2{322_090] 
322_CJB110 
322_18RS2lj 
2{322_2603} 
_JM9130013} 
322_1169NT} 
Consensus 



351 

DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 



VDFIVGkNQA 
VDFIVGkNQA 
VDFIVGkNQA 
VDF I VGkNQA 
VDFIVGkNQA 
VDFIVGkNQA 
VDFIVGkNQA 
VDFIVGtNQA 
VDFIVGtNQA 
VDFIVGtNQA 
VDFIVGkNQA 



LGNeVAQYST 
LGNeVAQYST 
LGNeVAQYST 
LGNeVAQYST 
LGNeVAQYST 
LGNeVAQYST 
LGNeVAQYST 
LGNkVAQYST 
LGNkVAQYST 
LGNkVAQYST 
LGNeVAQYST 



********** ******_*** ***_****** 



QNMAANNISY 
QNMAANNISY 
QNMAANNISY 
QNMAANNISY 
QNMAANNISY 
QNMAANNISY 
QNMAANNISY 
QNMAANNISY 
QNMAANNISY 
QNMAANNISY 
QNMAANNISY 



400 

VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 



********** ********** 



msa237049 . 

msa237049 . 

msa237049 . 

msa237049. 

msa237049 . 
msa237049 
msa237049.2{ 
msa237049.2{ 

msa237049 . 
msa237049.2{322 
msa237049.2{" 



2f 322_COHl} 
2(322_M781} 
2{322_M732; 
2{322_A909" 
2{322_H36B) 
2{322_090. 
322_CJB110' 
322_18RS2l} 
2{322_2603) 
_JM9130013} 
322__1169NT} 
Consensus 



401 

TNS I YGPANT 
TNS I YGPANT 
TNS I YGPANT 
TNS I YGPANT 
TNS I YGPANT 
TNS I YGPANT 
TNS I YGPANT 
TNS I YGPANT 
TNS I YGPANT 
TNS I YGPANT 
TNS I YGPANT 
********** 



WNAMPDRGGV 
WNAMPDRGGV 
WNAMPDRGGV 
WNAMPDRGGV 
WNAMPDRGGV 
WNAMPDRGGV 
WNAMPDRGGV 
WNAMPDRGGV 
WNAMPDRGGV 
WNAMPDRGGV 
WNAMPDRGGV 
********** 



TANHYDHVHV 
TANHYDHVHV 
TANHYDHVHV 
TANHYDHVHV 
TANHYDHVHV 
TANHYDHVHV 
TANHYDHVHV 
TANHYDHVHV 
TANHYDHVHV 
TANHYDHVHV 
TANHYDHVHV 
********** 



SFNK.YKKGS 
SFNK.YKKGS 
SFNK.YKKGS 
SFNK.YKKGS 
SFNK.YKKGS 
SFNK.YKKGS 
SFNK.YKKGS 
SFNK.YKKGS 
SFNK.YKKGS 
SFNK.YKKGS 
SFNK.YKKGS 
********** 



450 

YLASFLYALn 
YLASFLYALn 
YLASFLYALn 
YLASFLYALh 
YLASFLYALh 
YLASFLYALn 
YLASFLYALn 
YLASFLYALn 
YLASFLYALn 
YLASFLYALn 
YLASFLYALn 
*********„ 



msa237049 . 

msa237049 . 

msa237049 . 

msa237049 . 

rasa237049 . 
msa237049 
msa237049 
msa237049 

msa237049 . 
msa237049.2{322 
msa237049.2{' 



2{322_COHl} 
2{322_M78l} 
2{322_M732} 
2{322_A909} 
2{322_H36B} 
.2{322_090} 
.2{322_CJB110) 
.2{322_18RS21} 
2 {322^2603} 
_JM9130013} 
322_1169NT} 
Consensus 



451 460 
RLSRFLYNFY 
RLSRFLYNFY 
RLSRFLYNFY 
RLSRFLYNFY 
RLSRFLYNFY 
RLSRFLYNFY 
RLSRFLYNFY 
RLSRFLYNFY 
RLSRFLYNFY 
RLSRFLYNFY 
RLSRFLYNFY 
********** 
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Table 70: Comparative Sequences relating to SAG 1280 



SEQ ID. NO. 7001 
STRAIN 2603 

ATGGGAGGGAAAATG AAT CAAGAAGT CTTACTACAAATG ATGAG AG C CACT ATT C CT C 
GTGAT AG AG C CTTG CTTGAGGCATTTTTAT ATTAC CAAG CAGAGCATTTTGATGAGG AGT 
GGGATAGTCTTATTCATCAGTTTATGACCAATAGGCAAGAAATAAATAAGTCTGTTCAAG 
TACTTCACTTTGAGACAGATGTTT GAG CTTTTGTCCAGGCTAGTCCTTATGATACTGCTC 
ATGAT CTATTGAC CTAT ACACAAGTTTT CGG C CAAAGTGGT CTTCAAAAACTAG ATAAAC 
TATCGCCGTCTGAAAAAAACTTGGTGATAGAAGTGGCCTTGTTCAATCTGGCCACTCGTT 
TTG^TTATTGGATTCCAATGGACACrAGC^W^CCATATCGCCGGATTCACTCTTACAAA 
AGAGTAGGG G AG CTAATTT GGT CAATGTGTAT CGTGTGG CTAATAATTT AGCGGAT CGTA 
TTAGT CG AG ATATTGAACAGTTT CTCTTAACTTACGAGCCTG AG CTTGAAACTAGAG CTG 
ATGAAACTGTTCTAGAAAATGAAGAAACTGTTGATGAGCAGAAAACAAGTGTTCATCAAG 
CAATATCITTTCGAGAAGAGGGCTTCTCTGGTTATTGCTAGTTTGGATGTAGATTTGTCTC, 
AACTAGATGTTCAAATAGG AAAAACCAGTCAT CT G C CAG CTT ATGAAGAGTTAT C CTTAC 
GACGT AAATTTGAG ATT CTAACAT ATTTTG AC CAAATTCGAAATGAACGTTC CAAAGT CC 
CAAGTTTTAGACGAGGTGATTTTGAGAGAGAGATGGAAATGACACCAGT CTTTGATGG CG 
AGGAATTACI^ACTTATCTCGAAGCTGATGGCAGTCCCTATGAGCTGAAACGA^ 
CTACAGT CG AAG AAAAGGAATT AGAAAAAATTGGACAAG C CATTAGGAT AGAAAATCAAG 
AAAAATTGACT CAG CTAGGGATTGATTTAT CT CAGTTTG AC C CAG AC CG AGT CGGTATTT 
TATTGGATGCAGCAGGTCGTTTTCGTTTAAAAAATGCAGACCTTGCTTT 
ATCC CAAAG C CT CGGTAACT CAACTAG C C CTTGCGACAGAACTACT C CAAATGGGACTAA 
GTCATGAAAAGGTTGAATTTTT CTTTGGT AG C CAGcri" I'C CATTG AAGAG CTG CG ACAAG 
TTGC CTACG CCTTTTTATAC CAAGAACTCAGCAGAGAAGATG CGGAG CAATTTGAAAAAG 
ATAAAGGTAATCAG C CAGATTTAACTCT CAGAGATTGGAAAAGCAAG CT AGAGAAAG CTG 
AGGGAAAAGAAGTAGTTGATGAAGAATTCG CX3GAAAATC CACTGGTT CAGAGAGTATTGG 
ACACT^ATCCrCTGGGGTCATTGGTTTCCTATAAGGGACAG^ 
TCAGCGATGCTCGATTGAACGGTTTGATTCGGATTGA^ 

TCATTGAACAAAAT CCAGTT CTTTATGTGAGGACCTGGGAAGAAGTCAGTCAGG CACTTC 
ATGAG CCAAAGG CAGAACCACAAACAGAGTTAGAAGAAG CGGACCAAGAAT^ 
TCTCATTTCTGGAAGAGGAGCCAGTTCAGAGTATTG<3ACT 

AAAATGGT CATAACGATACTGATCTTGAAGAAACAGATAAT CAAATT CCTGAAGAGGAAG 

TCGT CGAAACAATTC CAGAGATTC CAGTAACGGACl"ri"l ATT TT CCAGAAGATTTGACGG 

ACTTTTATCCTAAGACTGCTAGAGATAAGGTTGAGACAAACAT^ 

TAAAAAATCTAGAAGTAC^GCACCGCAATGCTTCACCAAGTGAACAAC^ 

AGTATGTAGG CTGGG GTGGACTAG CCAATGAATTTTTTGATGACTATAAT CCAAAATTTT 

CTAAGGAACGAGAAGAACTGAAGAGCCTAGTCACAGATAAAGAGTATTCGGATATGAAAC 

AGTC(2TCCCTC^C!AGCCrATTACACAGACCCATCCCTGATCCGTCA.GATGTGGGATAAGT 

TGGAAAGAGATGGCTTTACAG^TGGCAAAAT CCTAG^T CCATGGGAACAGGGAATT 

TCTTTGCGGCTATGCCAAAAC^CrTAAGAGAAAAGAGTGAGTTGTATGGCGTAGAG 

ATACTATTACAGGAGCTATTGCCAAACACCrTCATC 

GATTTGAGACGGTGGCTTTTAACGACAATAGTTC 

TTGCCAATATACGAATTGCGGATAATAGGTACGATAGGCCTTACATGATTCATGACTACT 
TTGT CAAAAAGT CACTTGATTT GCTTCATGATGGTGGACAAGTAG CGATTAT CT CTTC CA 
CAGGAACTATGGATAAG CG AACAG A^AACATCITACAAG ATATT CGTGAG ACAACTGAAT 
TTCTTGGTGGGGTTCGACTGCCTGACTCrrGCCTTTAAGGCC^ 
GAACGGATATGTTATTCTTCCRGAAACACTTAGACAAGGGATA 
CCTTTTCAGGTTCCATTCG CTATGACAAGGATAGTCG CATT^ 

ATGGAGAATACAATAG C CAGGTG CTAGGAAC CTAC GAGGT CAGG AATTTT A^CGGAG GAA 
CACTTTCTGTTAAGGGGACTAGTGATGACTT 

ACGTTAAGG CCC CAAGAGAG ATTGATAGAAATGAGGT CAT CATTAAC CCAGATGTGTTG A 
CCAAACAAGTCAATGATAC CTCCATT C CAG CTGAAATGAGGGAAAAT CTAGGTCAGTACA 
GTTTTGGTTATCAGGGGTCTACAGTTTACTATCGAGATAACAAAGG CATT CGAGT CGGAA 
CCAAG ACG<?AAGAAATCAGTTACTATGTCG ATGAAGAGGG CAACTTCAAAG CATGGGACA 
CCAAACATTCTCAAAAG CAG ATTGAT CGCITTAATGC CTTAGAAGTGACTGATAAC^ 
CT CTGGATGTCTATGTG AC CGATGATG CAG CCAAACGTGGTCAGTTTAAGGGGTATT ATA 
AAAAGACAGTTTTCTATGAAGCTC(2ATTGTCTT , ATAAAGAAGTGGCA 
TGGTCGATATTCGCAATGCCTACCAAGAAGTTATTGCCATTCAACGCTATTAT^ 
ATAAGGAGACCTTTAACCACTT'GTTAGGCAAACTCAATCGTACCTATG 
AACACTATGGGTATTTGAATAGTG CTGTGAAC CG CAATCTTTTTG ATAGTGATGATAAGT 
ATTC G CTTCTTG CT AGTTTG GAAG ATGAAAGT CTGGATC CAAGTGGAAAGT CTGTTATCT 
ATACTAAAT CC CTTG CCTTTGAGAAGGCT CTAGTG CGTCCTGi^ 
TGCATACJTGCCCTTGATGCCTTAAATTCGAGCTITGGCT 

CTTATATGATGT CTAT CTAT CAGGTTGAATCGCAGATGAC CTTG ATTGAGGAGTTAGGCG 
AC CT CATTATG CCTG AT CCTGAGAAGTATTTGAATGG AGAATTG ACCTAT GTTT CT CG C C 
AAGACTTTCTTT CAGGGGATGT CGTCACTAAGTTAGAAGTGGTAGAT CTATTCGT CAAAC 
AAGACAATCAGGACTTTAACTGGT CACATTAT G CG GGACTTCTAGAAGCTAT CAAAC CAG 
CCCGTATTACTTTGGCAGACATTGATTATCGAATCGGTTC^ 

TTTATGGAAAATTTG CC C^GAAACCTTTATGGGGAAAG C CTAT G AACTGTCAGACCAAG 

AAGTAGCGACAGTCCTAGAAGTCAGTCCCATTGACGGGGTTATCACT^ 

TTGC CTACACCTATT C CAACG CAACGG ATAGGAGTTTAGGTGTC C CTG CTTCACG CTATG 

ATAGTG^TCGAAAAATCTTTGAAAATCrCCTGAATTCCAATCAACCAACC^ 

AAGTTGTCGAAGGGGATAAGAAAAAGAATGTGACGGATGTAGAGAAAACAACGGTCCTGC 

GTGCCAAGGAAACACACCT ACAAGAACT CTTT CAAGGTTTTGTAG CAAAGTAT CCAGAAG 

TCCAACAAATGATTGAAGACACCTATAATAGGCTCTACAATCGTAC^^ 

ATGATGGTAGTCATTTAACCATTGATGGACTTGCTCAG^AATATCT 

AAAAGAATGCCATTCAACGAATTGTCXJAGGAAAAACGTGCTCTACT 

GTT CAC^TAAAACACTTAC CATGCTTGGGG CAGGATT CAAACTGAAAGAACTCGGAATGG 
TAC^ATAAACCACTTTATGTGGTGC CGTCTAGT CTG ACTG CT CAGTTTGGT CAAGAAAT CA 
TGAAATTTTT CCCT ACCAAG AAAGT CTATGTGACTACTAAGAAAGACTTTGC CAAAG C CA 
AACG CAAG CAGTTTGTGTC C CGTATTATTACAGGGGACTATGATG CCATTGT CATTGGGG 
ATTCACAATTTGAGAAGAT ACCGATGAGT C GTGAAAAACAGGT CACCTAT AT CAATG ACA 
AACTTGAG CAACTCCGAGAAAT CAAG CTAGGAAGTGACAGTGATTACACGGTGAAAG AAG 
CXXmACGTTCC^TTAAGGGATTAC^CACCAGCTGGAAGAACTCCAAAAACTAGAGCGAG 
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